Enhancing Predictive Accuracy in Forecasting Football World Cup 2022 Outcomes

Authors

  • Umme Habiba Department of Mathematics, Mawlana Bhashani Science and Technology University, Bangladesh Author

DOI:

https://doi.org/10.69728/jst.v11.60

Keywords:

Simple Linear Regression Model, Multiple Linear Regression Model, Pearson Regression Model, Log-transformation, Football forecast 2022, Soccer Power Index

Abstract

This study evaluates key predictive metrics for the 2022 FIFA World Cup outcomes, comparing the predictive accuracy of the simple linear regression (SLR) and multiple linear regression (MLR) models. The analysis examines model fit and accuracy using statistical metrics and addresses key assumptions such as homoscedasticity, autocorrelation, and multicollinearity. Significant variables include the Soccer Power Index (SPI), offensive and defensive strengths, and simulated goal differences. To improve model assumptions, log-transformation is employed for the dependent variable. The findings demonstrate that MLR models outperform SLR in predictive accuracy, contributing to advances in sports forecasting methodologies. Recommendations are also provided for future sports analytics applications. 

Author Biography

  • Umme Habiba, Department of Mathematics, Mawlana Bhashani Science and Technology University, Bangladesh

    School of Mathematical and Statistical Science with Interdisciplinary Applications, The University of Texas Rio Grande Valley, Texas, USA

References

Baio, G., & Blangiardo, M. (2010). Bayesian hierarchical model for the prediction of football results. Journal of Applied Statistics, 37(2), 253–264.

Dufera, A. G., Liu, T., & Xu, J. (2023). Regression models of Pearson correlation coefficient. Statistical Theory and Related Fields, 7(2), 97–106.

Efron, B., & Tibshirani, R. J. (1994). An introduction to the bootstrap. Chapman and Hall/CRC.

Fox, J. (2015). Applied regression analysis and generalized linear models (3rd ed.). Sage Publications.

Jingzheng, D. (2023). Linear regression to predict World Cup. Highlights in Science, Engineering and Technology, 61, 144–151.

Király, F. J., & Qian, Z. (2017). Modelling competitive sports: Bradley-Terry-Elo models for supervised and on-line learning of paired competition outcomes. arXiv preprint, arXiv:1701.08055.

Maher, M. J. (1982). Modelling association football scores. Statistica Neerlandica, 36(3), 109–118.

Mukaka, M. M. (2012). A guide to appropriate use of correlation coefficient in medical research. Malawi Medical Journal, 24(3), 69–71.

Osborne, J. (2010). Improving your data transformations: Applying the Box-Cox transformation. Practical Assessment, Research, and Evaluation, 15(1).

O’Brien, R. M. (2007). A caution regarding rules of thumb for variance inflation factors. Quality & Quantity, 41, 673–690.

Peters, G., & Pacheco, D. (2022). Betting the system: Using lineups to predict football scores. arXiv preprint, arXiv:2210.06327.

Simon, L., Heckhard, R., Weisner, A., Young, D., & Pardoe, I. (2021). STAT 501: Regression Methods. Online Notes, The Pennsylvania State University, Eberly College of Science.

Wang, S., Chen, J., & Chen, H. (2023). Player score prediction based on multiple linear regression model. Advances in Engineering Technology Research, 4(1), 246.

Downloads

Published

30-06-2025

How to Cite

Habiba, U. (2025). Enhancing Predictive Accuracy in Forecasting Football World Cup 2022 Outcomes. MBSTU Journal of Science and Technology, 11(1), 67-74. https://doi.org/10.69728/jst.v11.60