International Journal For Multidisciplinary Research

E-ISSN: 2582-2160     Impact Factor: 9.24

A Widely Indexed Open Access Peer Reviewed Multidisciplinary Bi-monthly Scholarly International Journal

Call for Paper Volume 8, Issue 4 (July-August 2026) Submit your research before last 3 days of August to publish your research paper in the issue of July-August.

Optimizing ETL Pipelines with Delta Lake and Medallion Architecture: A Scalable Approach for Large-Scale Data

Author(s) Praveen Kumar Reddy Gujjala
Country United States
Abstract The exponential growth of enterprise data has led to the demand for highly efficient, scalable, and reliable Extract–Transform–Load (ETL) pipelines. Traditional ETL approaches often encounter limitations in handling massive datasets while maintaining transactional consistency, efficient schema evolution, and seamless integration with real-time workloads. This paper presents a comprehensive technical exploration of combining Delta Lake and Medallion Architecture to address these challenges. Delta Lake’s ACID (Atomicity, Consistency, Isolation, Durability) transaction guarantees provide a resilient data foundation, while Medallion Architecture enables a layered approach to data curation through the Bronze, Silver, and Gold layers. The proposed methodology incorporates schema evolution, time travel, and optimized partitioning strategies to dynamically adapt to changing business requirements. Performance evaluation through longitudinal studies and controlled simulations demonstrates significant improvements in data throughput, governance, and system uptime. This work provides a blueprint for designing future-ready ETL pipelines capable of supporting both batch and streaming workloads at scale.
Field Engineering
Published In Volume 6, Issue 6, November-December 2024
Published On 2024-12-04
DOI https://doi.org/10.36948/ijfmr.2024.v06i06.55445

Share this