NYC Traffic & Road Infrastructure Analysis

CIS 9440 - Data Warehousing & Analytics | Group 3

NYC Traffic & Road Infrastructure Analysis

Investigating the relationship between traffic density and road infrastructure degradation across New York City's five boroughs (2020-2025).

CIS 9440 · Data Warehousing Baruch College · Zicklin School of Business Spring 2026 Group 3

Introduction

New York City’s road infrastructure handles some of the highest traffic volumes in the United States. When roads degrade, potholes form, traffic signals fail, street lights go dark, and the burden falls on residents, delivery networks, and emergency services alike. Yet maintenance resources are finite, and response times vary widely across boroughs.

This project builds a data warehouse that integrates two NYC Open Data sources to investigate whether traffic volume is a meaningful predictor of infrastructure complaints, and whether boroughs with higher traffic also experience slower repair response times.

Analytics Questions

Q1: Does traffic volume correlate with complaint frequency?
To what extent does high traffic volume (specifically heavy vehicle counts) correlate with the frequency of “Street Condition,” “Street Light Condition,” and related 311 complaints?

Q2: Does response time vary across boroughs?
How does agency response time for road repairs vary between high-traffic zones and low-traffic zones? Is there a measurable “maintenance lag” in high-traffic areas?

Key Metrics / KPIs

KPI Description
Pothole Incidence Rate (PIR) Total “Street Condition” complaints ÷ avg daily traffic volume per segment
Traffic-to-Repair Correlation (T2R) Statistical measure across zip codes: Do higher traffic volumes correlate with faster or slower resolution?
Peak-Load Service Lag Avg days-to-close during peak traffic months vs. annual average

Data Sources

Source Description Link
NYC 311 Service Requests Community-reported road/infrastructure issues, filtered to street-related complaint types, 2020-2025 NYC Open Data
DOT Automatic Traffic Volume Counts 15-minute interval automated traffic counts at DOT sensor locations, 2020-2025 NYC Open Data

Potential Stakeholders

  • NYC Department of Transportation (DOT): prioritize preventive maintenance on high-traffic / low-complaint roads before failure occurs.
  • Logistics & Delivery Companies: identify high-incident routes that increase vehicle maintenance costs or delivery delays.
  • Urban Planners: assess whether current road materials are sufficient for the actual traffic load detected by sensors.

Project Summary

944,395
Total street complaints
11 days
Avg citywide close time
5
Boroughs analyzed
2020-2025
Data time range
250+
ZIP codes covered
9 SQL
Analytical queries

Group Members

Name Role GitHub
Zesen Chen Technician See Portfolio
Bridgette Wang Detail Manager See Portfolio
Paven Oommen Communicator See Portfolio
Sung Ik Park Technician See Portfolio
Raúl J. Solá Navarro Facilitator See Portfolio