سیویلیکا را در شبکه های اجتماعی دنبال نمایید.

DataBay: A Unified Platform for Automating DataWarehouse Management, Real-Time Data Processing,and Ensuring Data Quality and Monitoring

Publish Year: 1403
Type: Conference paper
Language: English
View: 74

This Paper With 8 Page And PDF Format Ready To Download

Export:

Link to this Paper:

Document National Code:

DATAGOV01_032

Index date: 2 February 2025

DataBay: A Unified Platform for Automating DataWarehouse Management, Real-Time Data Processing,and Ensuring Data Quality and Monitoring abstract

As organizations increasingly depend on large-scaledata for strategic decision-making, managing data warehouses hasbecome a complex and resource-intensive challenge. This paperintroduces DataBay, a unified platform designed to automate theentire data warehouse lifecycle, from data ingestion andtransformation to real-time processing, monitoring, and ensuringdata quality. By streamlining these processes, DataBay reduces theneed for specialized technical expertise, enabling fasterimplementation and more efficient data management. Theplatform integrates critical components such as Change DataCapture (CDC), Kafka, and Prometheus to ensure high-performance data processing and real-time monitoring withoutdisrupting production environments. DataBay leverages Avro fordata serialization, providing optimal throughput and storageefficiency compared to traditional formats like JSON.Additionally, its automated data pipeline orchestration, along withbuilt-in data quality checks, enhances the reliability and accuracyof insights derived from the data. The platform’s architecture ishighly scalable, supporting enterprise-level datasets and adaptingto evolving business needs. Through its seamless integration andflexibility, DataBay helps businesses make timely, data-drivendecisions and enables continuous optimization of data workflows.This paper discusses the platform’s architecture, itsimplementation in real-world industry settings, and the significantbusiness value it delivers by enhancing operational efficiency andempowering data-driven decision-making across organizations.

DataBay: A Unified Platform for Automating DataWarehouse Management, Real-Time Data Processing,and Ensuring Data Quality and Monitoring Keywords:

DataBay: A Unified Platform for Automating DataWarehouse Management, Real-Time Data Processing,and Ensuring Data Quality and Monitoring authors

Mostafa Ghadimi

Databurst.techTehran, Iran

Niyusha Baghayi

Databurst.techTehran, Iran

Alireza Shateri

Databurst.techTehran, Iran