Spotify Data Pipeline
Built a scalable azure data platform with dynamic, metadata driven ingestion, incremental cdc pipelines, and databricks silver gold transformations using autoloader, unity catalog, and delta live tables
Built a scalable azure data platform with dynamic, metadata driven ingestion, incremental cdc pipelines, and databricks silver gold transformations using autoloader, unity catalog, and delta live tables
Built an end-to-end Azure data pipeline using Data Factory, Databricks, Synapse, and ADLS Gen2 to automate ingestion, transformation, and analytics with Medallion Architecture.
Comprehensive data warehousing and analytics project, from building a data warehouse to generating actionable insights
Analyzed Blinkit's sales, outlets, and fat content, item types data using Power BI to uncover insights on product performance, outlet efficiency, and customer preferences across locations.
Developed an interactive Power BI dashboard visualizing PhonePe transactions across services, loan categories, and payment modes, uncovering digital payment trends and user engagement patterns.
Analyzed customer financial data to identify key predictors of loan default. Leveraged Python libraries like Pandas, Matplotlib, and Seaborn to uncover high-risk profiles, enabling more informed lending decisions.
Analyzed IMDB movie data using advanced SQL like CTEs, joins, subqueries, and window functions to uncover insights on genres, ratings, directors, actors, and production trends.
Developed an interactive Power BI dashboard analyzing credit card transactions and customer insights to visualize spending patterns and revenue trends.
This project analyzes the Titanic disaster to uncover key survival factors using Python, turning raw data into meaningful insights and visualizations.