Science, Technology, Engineering and Mathematics.
Open Access

OLYMPIC MEDAL QUANTITY FORECASTING: A RANDOM FOREST ALGORITHM-BASED MODEL CONSTRUCTION

Download as PDF

Volume 3, Issue 3, Pp 19-25, 2025

DOI: https://doi.org/10.61784/wjit3038

Author(s)

JunBo Zhu*, LinFeng Li

Affiliation(s)

School of Mathematical and Physical SciencesChongqing University of Science and TechnologyChongqing 401331, China.

Corresponding Author

JunBo Zhu

ABSTRACT

Against the backdrop of the unstoppable wave of globalization in sports, the competition for Olympic medals has shown an increasingly fierce trend. Countries have invested a lot of resources to improve their performance in the Olympic Games in order to be in a favorable position in the medal competition. In this study, a random forest model is developed to predict the number of gold medals and the total number of medals of each country in the 2028 Olympic Games. Firstly, the data were obtained from the official website of the Olympic Games and data preprocessing was carried out. After completing data cleaning and organizing, a series of key influence indicators such as whether it is the host country, the number of athletes, the total score and so on are introduced, and then a random forest model is built to predict the total number of medals and gold medals of each country. Finally, based on the prediction results, it was determined that in the 2028 Olympic Games, countries such as Cuba, Germany and Slovakia have the potential to achieve breakthroughs, while countries such as Belgium, Ecuador and Israel may experience a decline in the acquisition of medals. This study breaks through the limitations of linear assumptions in traditional econometric models, utilizes the nonlinear fitting ability of the Random Forest algorithm to capture complex variable interactions, and quantifies the dynamic impact of the 'host effect' on the distribution of medals, and reveals the role weights of the core factors such as historical performance and participation size through characteristic contribution analysis. Meanwhile, the prediction results can provide scientific basis for the National Olympic Committees to optimize resource allocation and formulate strategies, sports economic research and event public opinion prediction.

KEYWORDS

Random forest model; Olympic medal prediction; Data preprocessing; Prediction accuracy

CITE THIS PAPER

JunBo Zhu, LinFeng Li. Olympic medal quantity forecasting: a random forest algorithm-based model construction. World Journal of Information Technology. 2025, 3(3): 19-25. DOI: https://doi.org/10.61784/wjit3038.

REFERENCES

[1] Scelles N, Andreff W, Bonnal L, et al. Forecasting national medal totals at the Summer Olympic Games reconsidered. Social science quarterly, 2020, 101(2): 697-711.

[2] Bredtmann J, Crede C J, Otten S. Olympic medals: Does the past predict the future?. Significance, 2016, 13(3): 22-25.

[3] Andreff W. Economic development as major determinant of Olympic medal wins: predicting performances of Russian and Chinese teams at Sochi Games. International Journal of Economic Policy in Emerging Economies, 2013, 6(4): 314-340.

[4] Vagenas G, Vlachokyriakou E. Olympic medals and demo-economic factors: Novel predictors, the ex-host effect, the exact role of team size, and the “population-GDP” model revisited. Sport Management Review, 2012, 15(2): 211-217.

[5] Cheng H R, Lü J, Yuan T G. Prediction of China's track and field results in the Tokyo Olympic Games from the world top 20 national rankings of track and field events in 2018. Bulletin of Sports Science & Technology, 2020, 28(04): 4-8.

[6] Liu C Y, Wu M Q, Zhang A A, et al. Study on the temporal and spatial differentiation of Chinese Olympic medals from 1984 to 2016. Journal of Physical Education, 2019, 26(01): 75-82.

[7] Ding W Z. Data mining model of Olympic medals based on comprehensive national strength. Information Recording Materials, 2018, 19(03): 231-233.

[8] Balmer N J, Nevill A M, Williams A M. Modelling home advantage in the Summer Olympic Games. Journal of sports sciences, 2003, 21(6): 469-478.

[9] Bao Y, Meng X, Ustin S, et al. Vis-SWIR spectral prediction model for soil organic matter with different grouping strategies. Catena, 2020, 195: 104703.

[10] Hafezi M H, Liu L, Millward H. Learning daily activity sequences of population groups using random forest theory. Transportation research record, 2018, 2672(47): 194-207.

[11] Zhang Y D, Senjyu T, Chakchai S, et al. Smart Trends in Computing and Communications. Springer Singapore, 2022.

[12] Shi H M, Zhang D Y, Zhang Y H. Can Olympic medals be predicted? - From the perspective of interpretable machine learning. Journal of Shanghai University of Sport, 2024, 48(04): 26-36.

[13] Sun J, Zou X K, Zhu X B, et al. Research on random forest algorithm in the field of online scalper prediction. Computer Simulation, 2025: 1-6.

[14] Yang Q F, Li T, Jia Z Q. Consumption behavior prediction algorithm based on parameter-optimized random forest model. Computer & Digital Engineering, 2024, 52(07): 1959-1965.

[15] Gan M, Liu P F, Yue D B, et al. Prospecting prediction by geoelectrochemical technology in and around the Murong lithium mining area, western Sichuan based on random forest algorithm. Geology and Exploration, 2025, 61(02): 359-370. 

All published work is licensed under a Creative Commons Attribution 4.0 International License. sitemap
Copyright © 2017 - 2025 Science, Technology, Engineering and Mathematics.   All Rights Reserved.