Apache Spark Data Pipeline
Designed and implemented a scalable ETL pipeline using Apache Spark and Delta Lake (Bronze–Silver–Gold architecture) to process multi-source datasets. Improved data accessibility and enabled structured reporting by cleaning, standardizing, and aggregating raw data into analytics-ready formats.