FAANG Stock Data Pipeline
Developed a fully containerized data platform to track FAANG stock data performance. Built with a focus on scalability and reproducibility, the project leverages Terraform for GCP provisioning and Docker for containerization. Data is orchestrated using Airflow, transformed within BigQuery using dbt for robust modeling, and visualized in Looker Studio to provide a comprehensive view of market volatility and historical growth.
MIT Data Science & ML Capstone
Comprehensive portfolio showcasing Machine Learning models and statistical projects completed during the MIT Professional Education program.
NYC Taxi Data Pipeline
Automated ingestion of public NYC taxi datasets into PostgreSQL. Built to handle batch ingestion and optimized for scalable database performance using Python.
Price Analytics Automation
Automated dynamic pricing system for E-commerce clients that adjusted prices based on real-time competitor data.
Azure Sales Analytics Platform
An end-to-end platform orchestrating data from on-prem SQL databases to Azure Synapse via ADF, Databricks, and Data Lake Gen2. Visualized in Power BI with automated triggers.