Featured Projects

A showcase of data engineering pipelines, cloud architecture, and analytical platforms.

Data Engineering

FAANG Stock Data Pipeline

Developed a fully containerized data platform to track FAANG stock data performance. Built with a focus on scalability and reproducibility, the project leverages Terraform for GCP provisioning and Docker for containerization. Data is orchestrated using Airflow, transformed within BigQuery using dbt for robust modeling, and visualized in Looker Studio to provide a comprehensive view of market volatility and historical growth.

Airflow BigQuery dbt Terraform
View on GitHub
MIT Professional Education

MIT Data Science & ML Capstone

Comprehensive portfolio showcasing Machine Learning models and statistical projects completed during the MIT Professional Education program.

Predictive Modeling Feature Engineering Regression Collaborative Filtering Matrix Factorization Recommendation Systems Descriptive Statistics Data Visualization EDA
Access MIT Portfolio
ETL / Docker

NYC Taxi Data Pipeline

Automated ingestion of public NYC taxi datasets into PostgreSQL. Built to handle batch ingestion and optimized for scalable database performance using Python.

Python PostgreSQL Docker
View on GitHub

Price Analytics Automation

Automated dynamic pricing system for E-commerce clients that adjusted prices based on real-time competitor data.

Enterprise Architecture

Azure Sales Analytics Platform

An end-to-end platform orchestrating data from on-prem SQL databases to Azure Synapse via ADF, Databricks, and Data Lake Gen2. Visualized in Power BI with automated triggers.

Azure ADF Databricks Synapse Power BI