Abstract
The emerging field of computer science research that combines Data Mining and Machine Learning in sports analytics, has revolutionized performance analysis and predictive modeling. This study presents a predictive model leveraging the XGBoost algorithm to forecast scores in Twenty20 (T20) cricket matches, with a comparative analysis against other popular regression algorithms such as Linear Regression, Decision Trees, and Random Forests. The model incorporates a comprehensive dataset of historical T20 match statistics, including player performance, match conditions, and venue changes. Rigorous preprocessing and feature selection techniques were employed to optimize model performance. The results demonstrate that the XGBoost algorithm outperforms other methods, achieving a Mean Absolute Error (MAE) of 1.73 on the test dataset. This margin of error highlights the model's precision in predicting T20 match outcomes, providing valuable insights for strategic decision-making in cricket analytics. The findings suggest that advanced machine learning techniques like XGBoost can significantly enhance predictive accuracy in sports analytics, paving the way for future research and practical applications in the field.



Similar content being viewed by others
Data Availability
The datasets generated and analyzed during the current study are available on the platform Kaggle, [https://www.kaggle.com/veeralakrishna/cricsheet-a-retrosheet-for-cricket?select=t20s]. The data includes “historical T20 cricket match statistics, including player performance, match conditions, and venue changes”].
References
Tejinder Singh, Vishal Singla, Parteek Bhatia;—score and winning prediction in cricket through data mining; Oct 8–10, 2015.
A.N.Wickramasinghe, Roshan D.Yapa, Cricket match outcome prediction using tweets and prediction of the man of the match using social network analysis: case study using IPL data; 2018.
Animal Islam Anik, Sakif yeaser, A.G.M. Emam Hussain, Amitabha Chakraborty, Player's performance prediction in ODI cricket using machine learning algorithms;2018.
Jalaz Kumar, Rajeev Kumar, Pushpender Kumar; Outcome prediction of ODI cricket matches using decision trees and MLP networks;2018.
D. Thenmozhi, P. Mirunalini, S. M. Jaisakthi, Srivatsan Vasudevan , Veeramani Kannan V, SagubarSadiq S, Moneyball—data mining on cricket dataset; 2019.
Nigel Rodrigues1, Nelson Sequeira2, Stephen Rodrigues3, Varsha Shrivastava4, Cricket squad analysis using multiple random forest regression;2019
Siyamalan Manivannan, Mogan Kausik, Convolutional neural network and feature encoding for predicting the outcome of cricket matches;2019
Manuka madranga hatharasinghe, guhanathan poravi data mining and machine learning in cricket match outcome prediction: missing link;2019.
G. Sudhamathy and G. Raja Meenakshi, Prediction On Ipl data using machine learning techniques In R package;2020.
Nimmi Weeraddana, Saminda Premaratne, Unique approach for cricket match outcome prediction using Xgboost algorithms;2021.
Nikhil Dhonge, Shraddha Dhole, Nikita Wavre, Mandar Pardakhe, Amit Nagarale, Ipl cricket score and winning prediction using machine learning techniques; 2021.
Dr. Mrs. Jayshree Pansare, Prof. Mrs. Shubhangi Khande, Ankush Oswal, Zuhayr Munsiff, Suraj Choudhary, Vishal Kumbhar ;Cricket score prediction using Xgboost regression; 2022.
Preetham HK, Prajwal R, Prince Kumar, Naveen Kumar;Cricket score prediction using machine learning; 2023.
Dataset Link—https://www.kaggle.com/veeralakrishna/cricsheet-a-retrosheet-for-cricket?select=t20s. Accessed 10 July 2023.
Funding
Not Applicable.
Author information
Authors and Affiliations
Contributions
Conceptualization: [Gandharv Mohan]. Methodology: [Akhil Tyagi]. Software: [Aryan Kamboj]. Formal analysis:[ Gandharv Mohan, Manpreet Singh]. Investigation: [Chayandeep Chaulia]. Resources: [Aryan Kamboj, Chayandeep Chaulia]. Writing—Review & Editing: [Akhil Tyagi]. Supervision: [Dr. Amandeep Kaur].
Corresponding author
Ethics declarations
Conflict of interest
No.
Research involving human and/or animals
Not Applicable.
Informed consent
Not Applicable.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Tyagi, A., Kaur, A., Kamboj, A. et al. XGBoosting Cricket: Enhancing Predictive Modeling for Twenty20 Match Results Using Machine Learning and Statistical Techniques. SN COMPUT. SCI. 5, 1036 (2024). https://doi.org/10.1007/s42979-024-03385-0
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s42979-024-03385-0