The original data files are too large to upload to GitHub, but they are publicly available on Kaggle:
🔗 NFL Play-by-Play 2009–2016 Dataset
This project consists of two main components:
-
EDA & Cleaning Notebook
- Ingests raw data
- Performs exploratory data analysis
- Cleans and prepares the data for modeling
-
Regression Analysis Notebook
- Builds and evaluates logistic regression models
- Optimizes predictors and model performance