OLYMPIC MEDAL QUANTITY FORECASTING: A RANDOM FOREST ALGORITHM-BASED MODEL CONSTRUCTION
Volume 3, Issue 3, Pp 19-25, 2025
DOI: https://doi.org/10.61784/wjit3038
Author(s)
JunBo Zhu*, LinFeng Li
Affiliation(s)
School of Mathematical and Physical Sciences, Chongqing University of Science and Technology, Chongqing 401331, China.
Corresponding Author
JunBo Zhu
ABSTRACT
Against the backdrop of the unstoppable wave of globalization in sports, the competition for Olympic medals has shown an increasingly fierce trend. Countries have invested a lot of resources to improve their performance in the Olympic Games in order to be in a favorable position in the medal competition. In this study, a random forest model is developed to predict the number of gold medals and the total number of medals of each country in the 2028 Olympic Games. Firstly, the data were obtained from the official website of the Olympic Games and data preprocessing was carried out. After completing data cleaning and organizing, a series of key influence indicators such as whether it is the host country, the number of athletes, the total score and so on are introduced, and then a random forest model is built to predict the total number of medals and gold medals of each country. Finally, based on the prediction results, it was determined that in the 2028 Olympic Games, countries such as Cuba, Germany and Slovakia have the potential to achieve breakthroughs, while countries such as Belgium, Ecuador and Israel may experience a decline in the acquisition of medals. This study breaks through the limitations of linear assumptions in traditional econometric models, utilizes the nonlinear fitting ability of the Random Forest algorithm to capture complex variable interactions, and quantifies the dynamic impact of the 'host effect' on the distribution of medals, and reveals the role weights of the core factors such as historical performance and participation size through characteristic contribution analysis. Meanwhile, the prediction results can provide scientific basis for the National Olympic Committees to optimize resource allocation and formulate strategies, sports economic research and event public opinion prediction.
KEYWORDS
Random forest model; Olympic medal prediction; Data preprocessing; Prediction accuracy
CITE THIS PAPER
JunBo Zhu, LinFeng Li. Olympic medal quantity forecasting: a random forest algorithm-based model construction. World Journal of Information Technology. 2025, 3(3): 19-25. DOI: https://doi.org/10.61784/wjit3038.
REFERENCES
[1] Scelles N, Andreff W, Bonnal L, et al. Forecasting national medal totals at the Summer Olympic Games reconsidered. Social science quarterly, 2020, 101(2): 697-711.
[2] Bredtmann J, Crede C J, Otten S. Olympic medals: Does the past predict the future?. Significance, 2016, 13(3): 22-25.
[3] Andreff W. Economic development as major determinant of Olympic medal wins: predicting performances of Russian and Chinese teams at Sochi Games. International Journal of Economic Policy in Emerging Economies, 2013, 6(4): 314-340.
[4] Vagenas G, Vlachokyriakou E. Olympic medals and demo-economic factors: Novel predictors, the ex-host effect, the exact role of team size, and the “population-GDP” model revisited. Sport Management Review, 2012, 15(2): 211-217.
[5] Cheng H R, Lü J, Yuan T G. Prediction of China's track and field results in the Tokyo Olympic Games from the world top 20 national rankings of track and field events in 2018. Bulletin of Sports Science & Technology, 2020, 28(04): 4-8.
[6] Liu C Y, Wu M Q, Zhang A A, et al. Study on the temporal and spatial differentiation of Chinese Olympic medals from 1984 to 2016. Journal of Physical Education, 2019, 26(01): 75-82.
[7] Ding W Z. Data mining model of Olympic medals based on comprehensive national strength. Information Recording Materials, 2018, 19(03): 231-233.
[8] Balmer N J, Nevill A M, Williams A M. Modelling home advantage in the Summer Olympic Games. Journal of sports sciences, 2003, 21(6): 469-478.
[9] Bao Y, Meng X, Ustin S, et al. Vis-SWIR spectral prediction model for soil organic matter with different grouping strategies. Catena, 2020, 195: 104703.
[10] Hafezi M H, Liu L, Millward H. Learning daily activity sequences of population groups using random forest theory. Transportation research record, 2018, 2672(47): 194-207.
[11] Zhang Y D, Senjyu T, Chakchai S, et al. Smart Trends in Computing and Communications. Springer Singapore, 2022.
[12] Shi H M, Zhang D Y, Zhang Y H. Can Olympic medals be predicted? - From the perspective of interpretable machine learning. Journal of Shanghai University of Sport, 2024, 48(04): 26-36.
[13] Sun J, Zou X K, Zhu X B, et al. Research on random forest algorithm in the field of online scalper prediction. Computer Simulation, 2025: 1-6.
[14] Yang Q F, Li T, Jia Z Q. Consumption behavior prediction algorithm based on parameter-optimized random forest model. Computer & Digital Engineering, 2024, 52(07): 1959-1965.
[15] Gan M, Liu P F, Yue D B, et al. Prospecting prediction by geoelectrochemical technology in and around the Murong lithium mining area, western Sichuan based on random forest algorithm. Geology and Exploration, 2025, 61(02): 359-370.