1
🔗

Data Integration

Combined violation records with bus stop locations using spatial joins

Technical Implementation:
GeoPandas spatial join with 50m tolerance, matched 97.8% of records
2

Temporal Analysis

Identified peak violation patterns by hour, day, and academic calendar

Technical Implementation:
Pandas groupby operations, datetime indexing, statistical significance testing
3
🎓

CUNY Proximity Mapping

Calculated distances from violations to CUNY campuses using buffer analysis

Technical Implementation:
Haversine distance formula, 500m buffer zones, spatial indexing
4
📊

Enforcement Paradox Calculation

Developed metric combining violation intensity with speed improvement

Technical Implementation:
Custom scoring: (violations × intensity) / speed_improvement_factor
5
📈

Impact Quantification

Estimated time lost and academic disruption for student populations

Technical Implementation:
Monte Carlo simulation with ridership data, confidence intervals

Statistical Methods

Significance Testing

  • • Chi-square tests for categorical associations
  • • Mann-Whitney U tests for non-parametric comparisons
  • • Bonferroni correction for multiple comparisons
  • • Bootstrap confidence intervals (n=10,000)

Spatial Analysis

  • • Haversine distance calculations
  • • K-means clustering for hotspot identification
  • • Moran's I for spatial autocorrelation
  • • Buffer analysis with 50m, 100m, 500m tolerances