Skip to main content
Log in

Integrated framework to integrate Spark-based big data analytics and for health monitoring and recommendation in sports using XGBoost algorithm

  • Application of soft computing
  • Published:
Soft Computing Aims and scope Submit manuscript

Abstract

In recent years, technological advancements have been replicated in various industries, including sports medicine. Recent developments, such as big data analytics and data mining, which have revolutionized medical services in sports, are apparent in this transformation. This technological shift is motivated by the need to enhance athletic performance, prevent injuries, and offer individualized health advice. Modern lifestyles have simultaneously increased people’s attention to their health, creating a demand for better medical services. However, China’s ability to provide superior medical care needs to be improved due to a lack of medical resources and an ever-increasing patient population. To address these challenges, this research paper presents an integrated framework that leverages Spark-based big data analytics and the XGBoost algorithm. The framework aims to provide a robust sports medical service encompassing real-time health monitoring and data-driven insights. Powered by the formidable distributed computing platform Spark, it adeptly manages extensive sports data generated during training and events, facilitating instant health evaluations. Incorporating the XGBoost algorithm for data mining amplifies health prediction and recommendation capabilities. Renowned for its predictive prowess, XGBoost excels in discerning intricate sports data patterns and trends. Its proficiency in tackling intricates feature selection and modeling tasks ensures precision and actionable insights. Empirical findings underscore substantial enhancements in sports medical services. When applied to chronic disease datasets, the XGBoost algorithm garnered an impressive 93% trust rate. In contrast to conventional methods like K-Nearest Neighbors (KNN), Random Forest (RF), Decision Trees (DT), Support Vector Machines (SVM), Naïve Bayes (NB), and Logistic Regression (LR), the proposed framework consistently outperforms these established techniques. This remarkable performance underscores the transformative potential of the integrated framework in revolutionizing sports medical services.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18
Fig. 19
Fig. 20
Fig. 21
Fig. 22
Fig. 23
Fig. 24
Fig. 25

Similar content being viewed by others

Data availability

The data that support the findings of this study are available from the corresponding author, upon reasonable request.

References

  • Ali M, Yin B, Kumar A, Sheikh AM et al (2020) Reduction of multiplications in convolutional neural networks. In: 2020 39th Chinese Control Conference (CCC). IEEE, pp 7406–7411. https://doi.org/10.23919/CCC50068.2020.9188843

  • Ali M, Yin B, Bilal H et al (2023) Advanced efficient strategy for detection of dark objects based on spiking network with multi-box detection. Multimed Tools Appl. https://doi.org/10.1007/s11042-023-16852-2

    Article  Google Scholar 

  • Aminizadeh S, Heidari A, Toumaj S, Darbandi M, Navimipour NJ, Rezaei M, Talebi S, Azad P, Unal M (2023) The applications of machine learning techniques in medical data processing based on distributed computing and the internet of things. Comput Methods Programs Biomed 241:107745

    Article  Google Scholar 

  • Ashraf S, Afify YM, Ismail R (2022) Big data for real-time processing on streaming data: state-of-the-art and future challenges. In: 2022 International Conference on Electrical, Computer, Communications and Mechatronics Engineering (ICECCME). IEEE, pp 1–8

  • Aslam MS, Dai X, Hou J, Li Q, Ullah R, Ni Z, Liu Y (2020) Reliable control design for composite-driven scheme based on delay networked T-S fuzzy system. Int J Robust Nonlinear Control 30(4):1622–1642

    Article  MathSciNet  Google Scholar 

  • Aslam MS, Qaisar I, Majid A, Shamrooz S (2023) Adaptive event-triggered robust H∞ control for Takagi-Sugeno fuzzy networked Markov jump systems with time-varying delay. Asian J Control 25(1):213–228

    Article  MathSciNet  Google Scholar 

  • Bao N, Zhang T, Huang R, Biswal S, Su J, Wang Y (2023) A deep transfer learning network for structural condition identification with limited real-world training data. Struct Control Health Monit. https://doi.org/10.1155/2023/8899806

    Article  Google Scholar 

  • Chen Z (2019) Observer-based dissipative output feedback control for network T-S fuzzy systems under time delays with mismatch premise. Nonlinear Dyn 95:2923–2941

    Article  Google Scholar 

  • Chen G, Chen P, Huang W, Zhai J (2022) Continuance intention mechanism of middle school student users on online learning platform based on qualitative comparative analysis method. Math Probl Eng 2022:1–12

    Google Scholar 

  • Cheng B, Wang M, Zhao S, Zhai Z, Zhu D, Chen J (2017) Situation-aware dynamic service coordination in an IoT environment. IEEE/ACM Trans Netw 25(4):2082–2095

    Article  Google Scholar 

  • Debie A, Khatri RB, Assefa Y (2022) Successes and challenges of health systems governance towards universal health coverage and global health security: a narrative review and synthesis of the literature. Health Res Policy Syst 20(1):50

    Article  Google Scholar 

  • Dong J, Hu J, Zhao Y, Peng Y (2023) Opinion formation analysis for expressed and private opinions (EPOs) models: reasoning private opinions from behaviors in group decision-making systems. Expert Syst Appl 236:121292

    Article  Google Scholar 

  • Dou H, Liu Y, Chen S et al (2023) A hybrid CEEMD-GMM scheme for enhancing the detection of traffic flow on highways. Soft Comput 27:16373–16388. https://doi.org/10.1007/s00500-023-09164-y

    Article  Google Scholar 

  • Dwivedi YK, Hughes L, Baabdullah AM, Ribeiro-Navarrete S, Giannakis M, Al-Debei MM, Dennehy D, Metri B, Buhalis D, Cheung CM, Conboy K (2022) Metaverse beyond the hype: multidisciplinary perspectives on emerging challenges, opportunities, and agenda for research, practice and policy. Int J Inf Manag 66:102542

    Article  Google Scholar 

  • Fan W, Yang L, Bouguila N (2022) Unsupervised grouped axial data modeling via hierarchical Bayesian nonparametric models with watson distributions. IEEE Trans Pattern Anal Mach Intell 44(12):9654–9668

    Article  Google Scholar 

  • Gilvesy A, Husen E, Magloczky Z, Mihaly O, Hortobágyi T, Kanatani S, Heinsen H, Renier N, Hökfelt T, Mulder J, Uhlen M (2022) Spatiotemporal characterization of cellular tau pathology in the human locus coeruleus–pericoerulear complex by three-dimensional imaging. Acta Neuropathol 144(4):651–676

    Article  Google Scholar 

  • Holm PM, Simonÿ C, Brydegaard NK, Høgsgaard D, Thorborg K, Møller M, Whittaker JL, Roos EM, Skou ST (2023) An early care void: the injury experience and perceptions of treatment among knee-injured individuals and healthcare providers–a qualitative interview study. Phys Ther Sport 64:32–40

    Article  Google Scholar 

  • Hosseini M, Wieczorek M, Gordijn B (2022) Ethical issues in social science research employing big data. Sci Eng Ethics 28(3):29

    Article  Google Scholar 

  • Hu Z, Ren L, Wei G, Qian Z, Liang W, Chen W, Lu X, Ren L, Wang K (2022) Energy flow and functional behavior of individual muscles at different speeds during human walking. IEEE Trans Neural Syst Rehabil Eng 31:294–303

    Article  Google Scholar 

  • Jensen CB, Norbye B, Dahlgren MA, Iversen A (2023) Getting real in interprofessional clinical placements: patient-centeredness in student teams’ collaborative learning. Adv Health Sci Educ 28(3):687–703

    Article  Google Scholar 

  • Johnson WR, Mian A, Robinson MA, Verheul J, Lloyd DG, Alderson JA (2020) Multidimensional ground reaction forces and moments from wearable sensor accelerations via deep learning. IEEE Trans Biomed Eng 68(1):289–297

    Article  Google Scholar 

  • Kumar A, Shaikh AM, Li Y et al (2021) Pruning filters with L1-norm and capped L1-norm for CNN compression. Appl Intell 51:1152–1160. https://doi.org/10.1007/s10489-020-01894-y

    Article  Google Scholar 

  • Kumar V, Babubhai PJ, Fayaz FA, Dhobal K, Rai PK, Rachapalli A (2023) Role of artificial intelligence in the next generation wearable devices. In: 2023 3rd International Conference on Advance Computing and Innovative Technologies in Engineering (ICACITE). IEEE, pp 1180–1184

  • Liu X, Shi T, Zhou G, Liu M, Yin Z, Yin L, Zheng W (2023a) Emotion classification for short texts: an improved multi-label method. Humanit Soc Sci Commun 10(1):1–9

    Google Scholar 

  • Liu X, Zhou G, Kong M, Yin Z, Li X, Yin L, Zheng W (2023b) Developing multi-labelled corpus of twitter short texts: a semi-automatic method. Systems 11(8):390

    Article  Google Scholar 

  • Lu S, Liu M, Yin L, Yin Z, Liu X, Zheng W (2023) The multi-modal fusion in visual question answering: a review of attention mechanisms. PeerJ Comput Sci 9:e1400

    Article  Google Scholar 

  • Lv Z, Qiao L (2020) Analysis of healthcare big data. Futur Gener Comput Syst 109:103–110

    Article  Google Scholar 

  • Lv Z, Qiao L, Hossain MS, Choi BJ (2021) Analysis of using blockchain to protect the privacy of drone big data. IEEE Netw 35(1):44–49

    Article  Google Scholar 

  • Ni Q, Guo J, Wu W, Wang H, Wu J (2021) Continuous influence-based community partition for social networks. IEEE Trans Netw Sci Eng 9(3):1187–1197

    Article  MathSciNet  Google Scholar 

  • Phatak AA, Wieland FG, Vempala K, Volkmar F, Memmert D (2021) Artificial intelligence based body sensor network framework—narrative review: proposing an end-to-end framework using wearable sensors, real-time location systems and artificial intelligence/machine learning algorithms for data collection, data mining and knowledge discovery in sports and healthcare. Sports Med-Open 7:1–15

    Article  Google Scholar 

  • Qolomany B, Al-Fuqaha A, Gupta A, Benhaddou D, Alwajidi S, Qadir J, Fong AC (2019) Leveraging machine learning and big data for smart buildings: a comprehensive survey. IEEE Access 7:90316–90356

    Article  Google Scholar 

  • Rajeashwari S, Arunesh K (2022) performance analysis for chronic disease prediction using various data mining techniques. In: 2022 3rd International Conference on Electronics and Sustainable Communication Systems (ICESC). IEEE, pp 721–727

  • Shamrooz M, Li Q, Hou J (2021) Fault detection for asynchronous T-S fuzzy networked Markov jump systems with new event-triggered scheme. IET Control Theory Appl 15(11):1461–1473

    Article  MathSciNet  Google Scholar 

  • Shan Y, Wang H, Yang Y, Wang J, Zhao W, Huang Y, Wang H, Han B, Pan N, Jin X, Fan X (2023) Evidence of a large current of transcranial alternating current stimulation directly to deep brain regions. Mol Psychiatry. https://doi.org/10.1038/s41380-023-02150-8

    Article  Google Scholar 

  • Shao Z, Zhai Q, Guan X (2023) Physical-model-aided data-driven linear power flow model: an approach to address missing training data. IEEE Trans Power Syst 38(3):2970–2973

    Article  Google Scholar 

  • Shen Y, Ding N, Zheng HT, Li Y, Yang M (2020) Modeling relation paths for knowledge graph completion. IEEE Trans Knowl Data Eng 33(11):3607–3617

    Article  Google Scholar 

  • Shen X, Du SC, Sun YN, Sun PZ, Law R, Wu EQ (2023) Advance scheduling for chronic care under online or offline revisit uncertainty. IEEE Trans Autom Sci Eng. https://doi.org/10.1109/TASE.2023.3310116

    Article  Google Scholar 

  • Shi J, Niu W, Li Z, Shen C, Zhang J, Yu S, Chi N (2022) Optimal adaptive waveform design utilizing an end-to-end learning-based pre-equalization neural network in an UVLC system. J Lightwave Technol 41(6):1626–1636

    Article  Google Scholar 

  • Ullah R, Dai X, Sheng A (2020) Event-triggered scheme for fault detection and isolation of non-linear system with time-varying delay. IET Control Theory Appl 14(16):2429–2438

    Article  MathSciNet  Google Scholar 

  • Wang L, Zhai Q, Yin B, et al (2019) Second-order convolutional network for crowd counting. In: Proc. SPIE 11198, Fourth International Workshop on Pattern Recognition, p 111980T. https://doi.org/10.1117/12.2540362

  • Wang F, Wang H, Zhou X, Fu R (2022) A driving fatigue feature detection method based on multifractal theory. IEEE Sens J 22(19):19046–19059

    Article  Google Scholar 

  • Wu H, Jin S, Yue W (2022) Pricing policy for a dynamic spectrum allocation scheme with batch requests and impatient packets in cognitive radio networks. J Syst Sci Syst Eng 31(2):133–149

    Article  Google Scholar 

  • Wu Q, Li X, Wang K et al (2023) Regional feature fusion for on-road detection of objects using camera and 3D-LiDAR in high-speed autonomous vehicles. Soft Comput 27:18195–18213. https://doi.org/10.1007/s00500-023-09278-3

    Article  Google Scholar 

  • Xiao Z, Fang H, Jiang H, Bai J, Havyarimana V, Chen H, Jiao L (2021) Understanding private car aggregation effect via spatio-temporal analysis of trajectory data. IEEE Trans Cybern. https://doi.org/10.1109/TCYB.2021.3117705

    Article  Google Scholar 

  • Xie X, Xie B, Xiong D, Hou M, Zuo J, Wei G, Chevallier J (2023) New theoretical ISM-K2 Bayesian network model for evaluating vaccination effectiveness. J Ambient Intell Humaniz Comput 14(9):12789–12805

    Article  Google Scholar 

  • Xiong Z, Liu Q, Huang X (2022) The influence of digital educational games on preschool children’s creative thinking. Comput Educ 189:104578

    Article  Google Scholar 

  • Xu H, Sun Z, Cao Y et al (2023) A data-driven approach for intrusion and anomaly detection using automated machine learning for the internet of things. Soft Comput. https://doi.org/10.1007/s00500-023-09037-4

    Article  Google Scholar 

  • Yan L, Yin-He S, Qian Y, Zhi-Yu S, Chun-Zi W, Zi-Yun L (2021) Method of reaching consensus on probability of food safety based on the integration of finite credible data on block chain. IEEE Access 9:123764–123776

    Article  Google Scholar 

  • Yao W, Guo Y, Wu Y, Guo J (2017) Experimental validation of fuzzy PID control of flexible joint system in presence of uncertainties. In: 2017 36th Chinese Control Conference (CCC). IEEE, pp 4192–4197. https://doi.org/10.23919/ChiCC.2017.8028015

  • Yin B, Khan J, Wang L, Zhang J, Kumar A (2019) Real-time lane detection and tracking for advanced driver assistance systems. In: 2019 Chinese Control Conference (CCC). IEEE, pp 6772–6777. https://doi.org/10.23919/ChiCC.2019.8866334

Download references

Funding

This study was funded by the Ministry of Education Industry Education Collaborative Education Project (Research on the Practice Path of College Students' Innovation and Entrepreneurship Education from the Perspective of Project Achievement Transformation; Project No. 202002201011).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Bin Li.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Consent to participate

This article does not contain any studies with human participants or animals performed by any of the authors.

Consent for publication

Authors give consent to the journal to publish their article.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhao, Y., Ramos, M.F. & Li, B. Integrated framework to integrate Spark-based big data analytics and for health monitoring and recommendation in sports using XGBoost algorithm. Soft Comput 28, 1585–1608 (2024). https://doi.org/10.1007/s00500-023-09450-9

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00500-023-09450-9

Keywords

Navigation