Skip to main content
Real-World Impact

Data Science Portfolio

From healthcare to finance, these projects showcase quantifiable business impact across healthcare analytics, quantitative finance, and AI/ML implementation. Every project represents real-world challenges solved with advanced statistical methods and proven results.

7
Completed Projects
3
Industries Covered
100%
Healthcare Sensitivity
57.9%
Poverty Model Rยฒ

๐ŸŽฏ Results-Driven Approach: Combining healthcare analytics with statistical modeling to deliver measurable business impact across diverse industries.

๐Ÿ’ป Technology

Istanbul Tourism & Retail Analytics

December 2024

๐Ÿ’ผ Business Impact: Actionable customer segmentation strategies for tourism and retail optimization

Applied K-means clustering and Random Forest regression to 2021-2023 customer data, identifying four distinct segments with spending ranges $471-$14,511 and 50% predictive importance from item pricing.

๐Ÿ‘ฅ
4 Distinct
Customer Segments
๐Ÿ’ฐ
$471-$14,511
Spending Range
๐Ÿท๏ธ
Item Price (50%)
Top Predictor

Technologies Used:

Python K-means Clustering Random Forest Customer Segmentation Predictive Modeling

๐ŸŽฏ Key Challenge Solved:

Balancing model complexity with interpretability for business stakeholder communication

๐Ÿ’ป Technology

NYC Transit Equity Analysis - MHC ร— MTA Datathon

September 2024

๐Ÿ’ผ Business Impact: Data-driven policy recommendations for expanding transit accessibility programs

Participant in inaugural MHC ร— MTA Datathon analyzing 10GB+ Fair Fares ridership data across 6 NYC neighborhoods, identifying 98% correlation between bus/subway usage and peak patterns for policy recommendations.

๐Ÿ’พ
10GB+
Data Volume
๐Ÿ”—
98%
Usage Correlation
๐Ÿ™๏ธ
6 NYC Areas
Neighborhoods

Technologies Used:

Python SQL Tableau Big Data Processing Data Visualization Policy Analysis

๐ŸŽฏ Key Challenge Solved:

Processing massive real-time transit datasets while maintaining query performance and accuracy

๐Ÿ’ป Technology

Pokรฉmon Franchise Analytics - Foundation Project

September 2024

๐Ÿ’ผ Business Impact: Personal skill development and validation of programming capabilities

My first comprehensive Python project analyzing global video game sales using VGChartz dataset and PokeAPI integration, demonstrating OOP principles and revealing Game Boy platform dominance (80M+ units).

๐Ÿ
First Python Project
Learning Milestone
๐ŸŽฎ
Game Boy 80M+
Platform Analysis
๐Ÿ—๏ธ
Foundation Built
Skill Development

Technologies Used:

Python API Integration OOP Data Visualization Web Scraping

๐ŸŽฏ Key Challenge Solved:

Learning foundational programming concepts while handling real-world data integration complexities

Technical Expertise

Proven Skills Across Industries

Every project demonstrates measurable impact through advanced analytics, from 66% variance explanation in marketplace modeling to 100% sensitivity in healthcare anomaly detection.

๐Ÿ“Š

Statistical Modeling

  • โ€ข Ridge Regression: 66% variance explained in marketplace analysis
  • โ€ข Time Series Analysis: CUSUM & ETS models for anomaly detection
  • โ€ข Econometric Modeling: 57.9% Rยฒ across 54 years of global data
  • โ€ข Panel Data Analysis: 2,705+ observations, 100+ countries
  • โ€ข Interaction Effects: Growth-distribution policy modeling
๐Ÿค–

Machine Learning & AI

  • โ€ข RAG Implementation: LangChain + OpenAI for grant assistance
  • โ€ข Cambio Labs Project: Production-ready AI grant assistant
  • โ€ข Customer Segmentation: K-means clustering, 4 distinct personas
  • โ€ข Random Forest: Feature importance analysis for business insights
  • โ€ข Neural Networks: 95% accuracy on image classification (ML Fellowship)
๐Ÿ“Š

Data Engineering & Analytics

  • โ€ข Clinical Data Analysis: 150+ hours at NYU Langone Health
  • โ€ข Anomaly Detection: 100% sensitivity, 91% specificity rates
  • โ€ข Big Data Processing: 10GB+ transit datasets, optimized SQL
  • โ€ข Multi-dataset Integration: Six complementary data sources
  • โ€ข Domain Expertise: Finance, policy analysis, and data science
๐ŸŽฏ Results-Driven Data Science

Ready to Drive Business Impact with Data?

From 66% variance explanation in marketplace optimization to 100% sensitivity in anomaly detection; these projects demonstrate measurable ROI through advanced analytics. Let's discuss how similar methodologies can accelerate your business objectives.

66%
Care.com Model Rยฒ
100%
Healthcare Sensitivity
57.9%
Poverty Model Rยฒ
435
Districts Analyzed