NCAA Analytics Challenge

Machine Learning · Gradient Boosting · Feature Engineering · Tableau
Won the NCAA Final Four Analytics Challenge - Predicting seeds for 2026 NCAA March Madness.
Predicted NCAA Tournament seedings for over 360 college basketball teams using five seasons of historical data. The real challenge was reverse-engineering how the selection committee weighs NET rankings, quadrant records, and conference strength. We built a seven-model gradient-boosting ensemble over 104 engineered features that reached 78% accuracy, cutting prediction error by 43% versus the baseline. I then used Tableau dashboards to turn the findings into a clear narrative for NCAA stakeholders.
Honest notes - what was traded away
- –Minimizing RMSE on seeds sounds clean on paper, but committee logic is inconsistent - most of the work was iterative error analysis to find where the model was systematically wrong, then encoding those patterns as features.
- –The Tableau narrative ended up mattering as much as the model when presenting to judges - a lesson in how far accuracy alone gets you.



