Abstract
In recent years, technological advancements have been replicated in various industries, including sports medicine. Recent developments, such as big data analytics and data mining, which have revolutionized medical services in sports, are apparent in this transformation. This technological shift is motivated by the need to enhance athletic performance, prevent injuries, and offer individualized health advice. Modern lifestyles have simultaneously increased people’s attention to their health, creating a demand for better medical services. However, China’s ability to provide superior medical care needs to be improved due to a lack of medical resources and an ever-increasing patient population. To address these challenges, this research paper presents an integrated framework that leverages Spark-based big data analytics and the XGBoost algorithm. The framework aims to provide a robust sports medical service encompassing real-time health monitoring and data-driven insights. Powered by the formidable distributed computing platform Spark, it adeptly manages extensive sports data generated during training and events, facilitating instant health evaluations. Incorporating the XGBoost algorithm for data mining amplifies health prediction and recommendation capabilities. Renowned for its predictive prowess, XGBoost excels in discerning intricate sports data patterns and trends. Its proficiency in tackling intricates feature selection and modeling tasks ensures precision and actionable insights. Empirical findings underscore substantial enhancements in sports medical services. When applied to chronic disease datasets, the XGBoost algorithm garnered an impressive 93% trust rate. In contrast to conventional methods like K-Nearest Neighbors (KNN), Random Forest (RF), Decision Trees (DT), Support Vector Machines (SVM), Naïve Bayes (NB), and Logistic Regression (LR), the proposed framework consistently outperforms these established techniques. This remarkable performance underscores the transformative potential of the integrated framework in revolutionizing sports medical services.
Similar content being viewed by others
Data availability
The data that support the findings of this study are available from the corresponding author, upon reasonable request.
References
Ali M, Yin B, Kumar A, Sheikh AM et al (2020) Reduction of multiplications in convolutional neural networks. In: 2020 39th Chinese Control Conference (CCC). IEEE, pp 7406–7411. https://doi.org/10.23919/CCC50068.2020.9188843
Ali M, Yin B, Bilal H et al (2023) Advanced efficient strategy for detection of dark objects based on spiking network with multi-box detection. Multimed Tools Appl. https://doi.org/10.1007/s11042-023-16852-2
Aminizadeh S, Heidari A, Toumaj S, Darbandi M, Navimipour NJ, Rezaei M, Talebi S, Azad P, Unal M (2023) The applications of machine learning techniques in medical data processing based on distributed computing and the internet of things. Comput Methods Programs Biomed 241:107745
Ashraf S, Afify YM, Ismail R (2022) Big data for real-time processing on streaming data: state-of-the-art and future challenges. In: 2022 International Conference on Electrical, Computer, Communications and Mechatronics Engineering (ICECCME). IEEE, pp 1–8
Aslam MS, Dai X, Hou J, Li Q, Ullah R, Ni Z, Liu Y (2020) Reliable control design for composite-driven scheme based on delay networked T-S fuzzy system. Int J Robust Nonlinear Control 30(4):1622–1642
Aslam MS, Qaisar I, Majid A, Shamrooz S (2023) Adaptive event-triggered robust H∞ control for Takagi-Sugeno fuzzy networked Markov jump systems with time-varying delay. Asian J Control 25(1):213–228
Bao N, Zhang T, Huang R, Biswal S, Su J, Wang Y (2023) A deep transfer learning network for structural condition identification with limited real-world training data. Struct Control Health Monit. https://doi.org/10.1155/2023/8899806
Chen Z (2019) Observer-based dissipative output feedback control for network T-S fuzzy systems under time delays with mismatch premise. Nonlinear Dyn 95:2923–2941
Chen G, Chen P, Huang W, Zhai J (2022) Continuance intention mechanism of middle school student users on online learning platform based on qualitative comparative analysis method. Math Probl Eng 2022:1–12
Cheng B, Wang M, Zhao S, Zhai Z, Zhu D, Chen J (2017) Situation-aware dynamic service coordination in an IoT environment. IEEE/ACM Trans Netw 25(4):2082–2095
Debie A, Khatri RB, Assefa Y (2022) Successes and challenges of health systems governance towards universal health coverage and global health security: a narrative review and synthesis of the literature. Health Res Policy Syst 20(1):50
Dong J, Hu J, Zhao Y, Peng Y (2023) Opinion formation analysis for expressed and private opinions (EPOs) models: reasoning private opinions from behaviors in group decision-making systems. Expert Syst Appl 236:121292
Dou H, Liu Y, Chen S et al (2023) A hybrid CEEMD-GMM scheme for enhancing the detection of traffic flow on highways. Soft Comput 27:16373–16388. https://doi.org/10.1007/s00500-023-09164-y
Dwivedi YK, Hughes L, Baabdullah AM, Ribeiro-Navarrete S, Giannakis M, Al-Debei MM, Dennehy D, Metri B, Buhalis D, Cheung CM, Conboy K (2022) Metaverse beyond the hype: multidisciplinary perspectives on emerging challenges, opportunities, and agenda for research, practice and policy. Int J Inf Manag 66:102542
Fan W, Yang L, Bouguila N (2022) Unsupervised grouped axial data modeling via hierarchical Bayesian nonparametric models with watson distributions. IEEE Trans Pattern Anal Mach Intell 44(12):9654–9668
Gilvesy A, Husen E, Magloczky Z, Mihaly O, Hortobágyi T, Kanatani S, Heinsen H, Renier N, Hökfelt T, Mulder J, Uhlen M (2022) Spatiotemporal characterization of cellular tau pathology in the human locus coeruleus–pericoerulear complex by three-dimensional imaging. Acta Neuropathol 144(4):651–676
Holm PM, Simonÿ C, Brydegaard NK, Høgsgaard D, Thorborg K, Møller M, Whittaker JL, Roos EM, Skou ST (2023) An early care void: the injury experience and perceptions of treatment among knee-injured individuals and healthcare providers–a qualitative interview study. Phys Ther Sport 64:32–40
Hosseini M, Wieczorek M, Gordijn B (2022) Ethical issues in social science research employing big data. Sci Eng Ethics 28(3):29
Hu Z, Ren L, Wei G, Qian Z, Liang W, Chen W, Lu X, Ren L, Wang K (2022) Energy flow and functional behavior of individual muscles at different speeds during human walking. IEEE Trans Neural Syst Rehabil Eng 31:294–303
Jensen CB, Norbye B, Dahlgren MA, Iversen A (2023) Getting real in interprofessional clinical placements: patient-centeredness in student teams’ collaborative learning. Adv Health Sci Educ 28(3):687–703
Johnson WR, Mian A, Robinson MA, Verheul J, Lloyd DG, Alderson JA (2020) Multidimensional ground reaction forces and moments from wearable sensor accelerations via deep learning. IEEE Trans Biomed Eng 68(1):289–297
Kumar A, Shaikh AM, Li Y et al (2021) Pruning filters with L1-norm and capped L1-norm for CNN compression. Appl Intell 51:1152–1160. https://doi.org/10.1007/s10489-020-01894-y
Kumar V, Babubhai PJ, Fayaz FA, Dhobal K, Rai PK, Rachapalli A (2023) Role of artificial intelligence in the next generation wearable devices. In: 2023 3rd International Conference on Advance Computing and Innovative Technologies in Engineering (ICACITE). IEEE, pp 1180–1184
Liu X, Shi T, Zhou G, Liu M, Yin Z, Yin L, Zheng W (2023a) Emotion classification for short texts: an improved multi-label method. Humanit Soc Sci Commun 10(1):1–9
Liu X, Zhou G, Kong M, Yin Z, Li X, Yin L, Zheng W (2023b) Developing multi-labelled corpus of twitter short texts: a semi-automatic method. Systems 11(8):390
Lu S, Liu M, Yin L, Yin Z, Liu X, Zheng W (2023) The multi-modal fusion in visual question answering: a review of attention mechanisms. PeerJ Comput Sci 9:e1400
Lv Z, Qiao L (2020) Analysis of healthcare big data. Futur Gener Comput Syst 109:103–110
Lv Z, Qiao L, Hossain MS, Choi BJ (2021) Analysis of using blockchain to protect the privacy of drone big data. IEEE Netw 35(1):44–49
Ni Q, Guo J, Wu W, Wang H, Wu J (2021) Continuous influence-based community partition for social networks. IEEE Trans Netw Sci Eng 9(3):1187–1197
Phatak AA, Wieland FG, Vempala K, Volkmar F, Memmert D (2021) Artificial intelligence based body sensor network framework—narrative review: proposing an end-to-end framework using wearable sensors, real-time location systems and artificial intelligence/machine learning algorithms for data collection, data mining and knowledge discovery in sports and healthcare. Sports Med-Open 7:1–15
Qolomany B, Al-Fuqaha A, Gupta A, Benhaddou D, Alwajidi S, Qadir J, Fong AC (2019) Leveraging machine learning and big data for smart buildings: a comprehensive survey. IEEE Access 7:90316–90356
Rajeashwari S, Arunesh K (2022) performance analysis for chronic disease prediction using various data mining techniques. In: 2022 3rd International Conference on Electronics and Sustainable Communication Systems (ICESC). IEEE, pp 721–727
Shamrooz M, Li Q, Hou J (2021) Fault detection for asynchronous T-S fuzzy networked Markov jump systems with new event-triggered scheme. IET Control Theory Appl 15(11):1461–1473
Shan Y, Wang H, Yang Y, Wang J, Zhao W, Huang Y, Wang H, Han B, Pan N, Jin X, Fan X (2023) Evidence of a large current of transcranial alternating current stimulation directly to deep brain regions. Mol Psychiatry. https://doi.org/10.1038/s41380-023-02150-8
Shao Z, Zhai Q, Guan X (2023) Physical-model-aided data-driven linear power flow model: an approach to address missing training data. IEEE Trans Power Syst 38(3):2970–2973
Shen Y, Ding N, Zheng HT, Li Y, Yang M (2020) Modeling relation paths for knowledge graph completion. IEEE Trans Knowl Data Eng 33(11):3607–3617
Shen X, Du SC, Sun YN, Sun PZ, Law R, Wu EQ (2023) Advance scheduling for chronic care under online or offline revisit uncertainty. IEEE Trans Autom Sci Eng. https://doi.org/10.1109/TASE.2023.3310116
Shi J, Niu W, Li Z, Shen C, Zhang J, Yu S, Chi N (2022) Optimal adaptive waveform design utilizing an end-to-end learning-based pre-equalization neural network in an UVLC system. J Lightwave Technol 41(6):1626–1636
Ullah R, Dai X, Sheng A (2020) Event-triggered scheme for fault detection and isolation of non-linear system with time-varying delay. IET Control Theory Appl 14(16):2429–2438
Wang L, Zhai Q, Yin B, et al (2019) Second-order convolutional network for crowd counting. In: Proc. SPIE 11198, Fourth International Workshop on Pattern Recognition, p 111980T. https://doi.org/10.1117/12.2540362
Wang F, Wang H, Zhou X, Fu R (2022) A driving fatigue feature detection method based on multifractal theory. IEEE Sens J 22(19):19046–19059
Wu H, Jin S, Yue W (2022) Pricing policy for a dynamic spectrum allocation scheme with batch requests and impatient packets in cognitive radio networks. J Syst Sci Syst Eng 31(2):133–149
Wu Q, Li X, Wang K et al (2023) Regional feature fusion for on-road detection of objects using camera and 3D-LiDAR in high-speed autonomous vehicles. Soft Comput 27:18195–18213. https://doi.org/10.1007/s00500-023-09278-3
Xiao Z, Fang H, Jiang H, Bai J, Havyarimana V, Chen H, Jiao L (2021) Understanding private car aggregation effect via spatio-temporal analysis of trajectory data. IEEE Trans Cybern. https://doi.org/10.1109/TCYB.2021.3117705
Xie X, Xie B, Xiong D, Hou M, Zuo J, Wei G, Chevallier J (2023) New theoretical ISM-K2 Bayesian network model for evaluating vaccination effectiveness. J Ambient Intell Humaniz Comput 14(9):12789–12805
Xiong Z, Liu Q, Huang X (2022) The influence of digital educational games on preschool children’s creative thinking. Comput Educ 189:104578
Xu H, Sun Z, Cao Y et al (2023) A data-driven approach for intrusion and anomaly detection using automated machine learning for the internet of things. Soft Comput. https://doi.org/10.1007/s00500-023-09037-4
Yan L, Yin-He S, Qian Y, Zhi-Yu S, Chun-Zi W, Zi-Yun L (2021) Method of reaching consensus on probability of food safety based on the integration of finite credible data on block chain. IEEE Access 9:123764–123776
Yao W, Guo Y, Wu Y, Guo J (2017) Experimental validation of fuzzy PID control of flexible joint system in presence of uncertainties. In: 2017 36th Chinese Control Conference (CCC). IEEE, pp 4192–4197. https://doi.org/10.23919/ChiCC.2017.8028015
Yin B, Khan J, Wang L, Zhang J, Kumar A (2019) Real-time lane detection and tracking for advanced driver assistance systems. In: 2019 Chinese Control Conference (CCC). IEEE, pp 6772–6777. https://doi.org/10.23919/ChiCC.2019.8866334
Funding
This study was funded by the Ministry of Education Industry Education Collaborative Education Project (Research on the Practice Path of College Students' Innovation and Entrepreneurship Education from the Perspective of Project Achievement Transformation; Project No. 202002201011).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Consent to participate
This article does not contain any studies with human participants or animals performed by any of the authors.
Consent for publication
Authors give consent to the journal to publish their article.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Zhao, Y., Ramos, M.F. & Li, B. Integrated framework to integrate Spark-based big data analytics and for health monitoring and recommendation in sports using XGBoost algorithm. Soft Comput 28, 1585–1608 (2024). https://doi.org/10.1007/s00500-023-09450-9
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00500-023-09450-9