Abstract
Early risk assessment is essential for addressing cardiovascular disease, a major healthcare issue. Accurate diagnosis is essential for prompt medical care and medication. Deep learning (DL) approaches have yielded promising outcomes in the detection of coronary artery disease (CAD). Previous work [Waqar et al. Sci Programm. 2021;2021:1–12, Muntasir Nishat et al. Sci Programm. 2022;2022:1–17, Krishnan et al. Int J Electr Comput Eng. 2021;11(6):2088–8708] is mostly based on open repository data, it is necessary to use real-world data to observe the DL-based models’ performance. For this cutting-edge effort, real-world EHR is collected from Excelcare Hospital in Guwahati, Assam, India with three timelines spaced at six-month intervals concatenated into multiclass sequential curated-electronic health records (MSC-EHR). The FRS risk estimating method is used to convert multiclass categorization with four risk labels on the curated dataset. Hence, this work aims at harnessing the benefits of MSC-EHR data with age-specific cluster (ASC) over age-agnostic cluster (AAC) for improved CAD prediction. This work proposes two hybrid models based on deep learning methodology. The study has been divided into two phases to analyze AAC and ASC datasets independently. The purpose of this research endeavor is to analyze the performance of hybrid models using deep learning techniques on curated dataset and investigate the impact of data pre-processing and balancing techniques on the model performance. For the phase 1 experimentation, Hybrid-Model1 (RNN+GRU) achieved 93.27%, and Hybrid-Model2 (LSTM+GRU) achieved 94.01% accuracy, while in phase 2, Hybrid-Model1 attained 96.93% (ASC2) accuracy, and Hybrid-Model2 attained 97.28% (ASC2) accuracy, highlighting the importance of ASC data over AAC in CAD prediction.








Similar content being viewed by others
Data Availability and Access
The data that support the findings of this study are available from the Excelcare Hospital, Guwahati, India, but restrictions apply to the availability of these data, and so are not publicly available. Data are, however, available from the authors upon reasonable request and with permission of the Managing Director and HOD of Cardiology Excelcare Hospital, Guwahati, India.
References
Manjurul Ahsan M, Siddique Z. Machine learning-based heart disease diagnosis: a systematic literature review. arXiv e-prints, 2021;2112.
Kaur I, Doja M, Ahmad T, Ahmad M, Hussain A, Nadeem A, El-Latif A, Ahmed A, et al. An integrated approach for cancer survival prediction using data mining techniques. Comput Intell Neurosci 2021;2021:14 Article ID 6342226.
Muhammad Y, Tahir M, Hayat M, Chong KT. Early and accurate detection and diagnosis of heart disease using intelligent computational model. Sci Reports. 2020;10(1):19747.
Chokwijitkul T, Nguyen A, Hassanzadeh H, Perez S. Identifying risk factors for heart disease in electronic medical records: a deep learning approach. In: Proceedings of the BioNLP 2018 Workshop, 2018; pp. 18–27.
Johri AM, Mantella LE, Jamthikar AD, Saba L, Laird JR, Suri JS. Role of artificial intelligence in cardiovascular risk prediction and outcomes: comparison of machine-learning and conventional statistical approaches for the analysis of carotid ultrasound features and intra-plaque neovascularization. Int J Cardiovasc Imaging. 2021;37(11):3145–56.
Firdous N, Din NMU, Assad A. An imbalanced classification approach for establishment of cause-effect relationship between heart-failure and pulmonary embolism using deep reinforcement learning. Eng Appl Artif Intell. 2023;126: 107004.
Xie S, Yu Z, Lv Z. Multi-disease prediction based on deep learning: a survey. Comput Model Eng Sci. 2021;128(2):489–22.
Swathy M, Saruladha K. A comparative study of classification and prediction of cardio-vascular diseases (cvd) using machine learning and deep learning techniques. ICT Express. 2022;8(1):109–16.
Ishaq A, Sadiq S, Umer M, Ullah S, Mirjalili S, Rupapara V, Nappi M. Improving the prediction of heart failure patients’ survival using smote and effective data mining techniques. IEEE Access. 2021;9:39707–16.
Jonnagaddala J, Liaw S-T, Ray P, Kumar M, Chang N-W, Dai H-J. Coronary artery disease risk assessment from unstructured electronic health records using text mining. J Biomed Inform. 2015;58:203–10.
Morid MA, Sheng ORL, Dunbar J. Time series prediction using deep learning methods in healthcare. ACM Trans Manage Inf Syst. 2023;14(1):1–29.
Shukla PK, Stalin S, Joshi S, Shukla PK, Pareek PK. Optimization assisted bidirectional gated recurrent unit for healthcare monitoring system in big-data. Appl Soft Comput. 2023;138: 110178.
Wang L, Han M, Li X, Zhang N, Cheng H. Review of classification methods on unbalanced data sets. IEEE Access. 2021;9:64606–28.
Bhavekar GS, Goswami AD. A hybrid model for heart disease prediction using recurrent neural network and long short term memory. Int J Inf Technol. 2022;14(4):1781–9.
Min X, Yu B, Wang F. Predictive modeling of the hospital readmission risk from patients’ claims data using machine learning: a case study on copd. Sci Reports. 2019;9(1):2362.
Smita, Kumar E. Probabilistic decision support system using machine learning techniques: a case study of cardiovascular diseases. J Discrete Math Sci Cryptogr. 2021;24(5):1487–96.
Shamshirband S, Fathi M, Dehzangi A, Chronopoulos AT, Alinejad-Rokny H. A review on deep learning approaches in healthcare systems: taxonomies, challenges, and open issues. J Biomed Inform. 2021;113: 103627.
Bhatt CM, Patel P, Ghetia T, Mazzeo PL. Effective heart disease prediction using machine learning techniques. Algorithms. 2023;16(2):88.
Petrazzini BO, Chaudhary K, Márquez-Luna C, Forrest IS, Rocheleau G, Cho J, Narula J, Nadkarni G, Do R. Coronary risk estimation based on clinical data in electronic health records. J Am College Cardiol. 2022;79(12):1155–66.
Alaa AM, Bolton T, Di Angelantonio E, Rudd JH, Schaar M. Cardiovascular disease risk prediction using automated machine learning: a prospective study of 423,604 UK biobank participants. PloS One. 2019;14(5):0213653.
Waqar M, Dawood H, Dawood H, Majeed N, Banjar A, Alharbey R. An efficient smote-based deep learning model for heart attack prediction. Sci Programm. 2021;2021:1–12.
Muntasir Nishat M, Faisal F, Jahan Ratul I, Al-Monsur A, Ar-Rafi AM, Nasrullah SM, Reza MT, Khan MRH. A comprehensive investigation of the performances of different machine learning classifiers with smote-enn oversampling technique and hyperparameter optimization for imbalanced heart failure dataset. Sci Programm. 2022;2022:1–17.
Baral S, Alsadoon A, Prasad P, Al Aloussi S, Alsadoon OH. A novel solution of using deep learning for early prediction cardiac arrest in sepsis patient: enhanced bidirectional long short-term memory (lstm). Multimed Tools Appl. 2021;80:32639–64.
Sharma N, Malviya L, Jadhav A, Lalwani P. A hybrid deep neural net learning model for predicting coronary heart disease using randomized search cross-validation optimization. Decisi Anal J. 2023;9: 100331.
Baccouche A, Garcia-Zapirain B, Castillo Olea C, Elmaghraby A. Ensemble deep learning models for heart disease classification: a case study from Mexico. Information. 2020;11(4):207.
Johri AM, Singh KV, Mantella LE, Saba L, Sharma A, Laird JR, Utkarsh K, Singh IM, Gupta S, Kalra MS, et al. Deep learning artificial intelligence framework for multiclass coronary artery disease prediction using combination of conventional risk factors, carotid ultrasound, and intraplaque neovascularization. Computers in Biology and Medicine. 2022;150: 106018.
Jamthikar AD, Gupta D, Mantella LE, Saba L, Laird JR, Johri AM, Suri JS. Multiclass machine learning vs. conventional calculators for stroke/cvd risk assessment using carotid plaque predictors with coronary angiography scores as gold standard: A 500 participants study. Int J Cardiovasc Imaging. 2021;37:1171–87.
Krishnan S, Magalingam P, Ibrahim R. Hybrid deep learning model using recurrent neural network and gated recurrent unit for heart disease prediction. Int J Electr Comput Eng. 2021;11(6):2088–8708.
Ayoobi N, Sharifrazi D, Alizadehsani R, Shoeibi A, Gorriz JM, Moosaei H, Khosravi A, Nahavandi S, Chofreh AG, Goni FA, et al. Time series forecasting of new cases and new deaths rate for COVID-19 using deep learning methods. Results Phys. 2021;27: 104495.
Alizadehsani R, Khosravi A, Roshanzamir M, Abdar M, Sarrafzadegan N, Shafie D, Khozeimeh F, Shoeibi A, Nahavandi S, Panahiazar M, et al. Coronary artery disease detection using artificial intelligence techniques: a survey of trends, geographical differences and diagnostic features 1991–2020. Comput Biol Med. 2021;128: 104095.
Ahsan MM, Siddique Z. Machine learning-based heart disease diagnosis: a systematic literature review. Artif Intell Med. 2022;128: 102289.
Chen S-F, Loguercio S, Chen K-Y, Lee SE, Park J-B, Liu S, Sadaei HJ, Torkamani A. Artificial intelligence for risk assessment on primary prevention of coronary artery disease. Curr Cardiovasc Risk Rep. 2023;17(12):215–31.
Solares JRA, Raimondi FED, Zhu Y, Rahimian F, Canoy D, Tran J, Gomes ACP, Payberah AH, Zottoli M, Nazarzadeh M, et al. Deep learning for electronic health records: a comparative review of multiple deep neural architectures. J Biomed Inform. 2020;101: 103337.
Rani S, Ahmad T, Masood S. Handling class imbalance problem using oversampling techniques for breast cancer prediction. In: 2023 International Conference on Recent Advances in Electrical, Electronics & Digital Healthcare Technologies (REEDCON), 2023; pp. 693–8. IEEE.
Kotsiantis S, Kanellopoulos D, Pintelas P, et al. Handling imbalanced datasets: a review. GESTS Int Trans Comput Sci Eng. 2006;30(1):25–36.
Gosain A, Sardana S. Handling class imbalance problem using oversampling techniques: a review. In: 2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI), 2017; pp. 79–85. IEEE.
Ebenezer AB, Boyinbode O, Idowu OM. A comprehensive analysis of handling imbalanced dataset. Int J. 2021;10(2):454–63.
Durstewitz D. A state space approach for piecewise-linear recurrent neural networks for identifying computational dynamics from neural measurements. PLoS Comput Biol. 2017;13(6):1005542.
Hochreiter S, Schmidhuber J. Long short-term memory. Neural Comput. 1997;9(8):1735–80.
Zhao R, Wang D, Yan R, Mao K, Shen F, Wang J. Machine health monitoring using local feature-based gated recurrent unit networks. IEEE Trans Ind Electron. 2017;65(2):1539–48.
Tohka J, Van Gils M. Evaluation of machine learning algorithms for health and wellness applications: a tutorial. Comput Biol Med. 2021;132: 104324.
Aladeyelu AC, Adekunle GT. Predicting heart disease using machine learning. Mach Learn. 2023;10(4):15837–41.
Arooj S, Rehman SU, Imran A, Almuhaimeed A, Alzahrani AK, Alzahrani A. A deep convolutional neural network for the early detection of heart disease. Biomedicines. 2022;10(11):2796.
Junsomboon N, Phienthrakul T. Combining over-sampling and under-sampling techniques for imbalance dataset. In: Proceedings of the 9th International Conference on Machine Learning and Computing, 2017; pp. 243–7.
Desuky AS, Hussain S. An improved hybrid approach for handling class imbalance problem. Arab J Sci Eng. 2021;46:3853–64.
Acknowledgements
The authors would like to thank the Managing Director and HOD of Cardiology Excelcare Hospital, Guwahati, India, for help in collecting heart disease data as well as the many members of the department, including laboratory personnel and management.
Funding
No funding.
Author information
Authors and Affiliations
Contributions
All authors contributed to the study’s conception and design. Material preparation, data collection and analysis were performed by Ms. Smita. The first draft of the manuscript was written by Ms. Smita, and other authors commented on previous versions of the manuscript.
Corresponding author
Ethics declarations
Conflict of Interest
Our paper has no potential Conflict of interest. All authors have read and approved the work for submission to your journal.
Ethical and Informed Consent for Data Used
Research-specific study ethical approval for the use of clinical samples and retrieving clinical data was approved by the Managing Director and HOD of Cardiology Excelcare Hospital, Guwahati, India. Clinical data collected during the COVID period and approval received through Dr. NEIL BARDOLOI (MBBS, MD, DM (AIIMS), FACC, FESC, Managing Director and HOD, Cardiology Excelcare Hospital, Guwahati, India) with electronic mail: drbardoloineil@gmail.com.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This article is part of the topical collection “Security for Communication and Computing Application” guest edited by Karan Singh, Ali Ahmadian, Ahmed Mohamed Aziz Ismail, R S Yadav, Md. Akbar Hossain, D. K. Lobiyal, Mohamed Abdel-Basset, Soheil Salahshour, Anura P. Jayasumana, Satya P. Singh, Walid Osamy, Mehdi Salimi and Norazak Senu.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Smita, Kumar, E. Age Specific Analysis on Multiclass Sequential Curated-Electronic Health Records (MSC-EHR) for CAD Survival Prediction using Deep Learning Techniques. SN COMPUT. SCI. 5, 603 (2024). https://doi.org/10.1007/s42979-024-02946-7
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s42979-024-02946-7