Accurate prediction of FIFA World Cup match outcomes is valuable to analysts, coaches, bettors, and fans. We introduce a machine-learning framework that fuses player-level performance metrics (goals, assists, passing accuracy, tackles, …) with team-level historical data to forecast winners. Multi-year information is distilled into year-specific team profiles, dimensionality reduction is applied to control feature explosion, and an ensemble of classifiers is optimized via cross-validation. On the FIFA 2022 World Cup dataset our model outperforms a head-to-head baseline, underscoring the importance of granular player attributes and offering insights into player synergy and strategic match-ups.
The proposed pipeline (Figure 1) comprises:
The code and pre-processed datasets are released under the
MIT License. See the LICENSE
file for details.
We thank the FIFA data community and prior works on sports analytics for inspiration. This project was supported in part by the University of Michigan-Dearborn ECE and CIS departments.