Evaluation of Machine Learning Techniques in Predicting Acute Coronary Syndrome Outcome

Jaafar, Juliana; Atwell, Eric; Johnson, Owen; Clamp, Susan; Ahmad, Wan Azman Wan

doi:10.1007/978-3-319-02621-3_24

Juliana Jaafar³,
Eric Atwell⁴,
Owen Johnson³,
Susan Clamp³ &
…
Wan Azman Wan Ahmad⁵

Included in the following conference series:

International Conference on Innovative Techniques and Applications of Artificial Intelligence

777 Accesses
3 Citations

Abstract

Data mining using machine learning techniques may aid in the development of prediction models for Acute Coronary Syndrome (ACS) patients. ACS prediction models such as TIMI and GRACE have been developed using traditional statistical techniques such as linear regression. In this paper, different machine learning techniques were evaluated to present the potential use of machine learning techniques in classification tasks as basis for future medical prediction model development. A dataset of 960 of ACS patients from the Malaysian National Cardiovascular Disease Database registry was employed and trained on three popular machine learning classifiers i.e. Naïve Bayes, Decision Tree and Neural Network to predict ACS outcome. The outcome being evaluated was whether the patient is dead or alive. An open source tool—Waikato Environment for Knowledge Analysis (WEKA) were used in executing these classification tasks. A 10-folds cross validation technique was used to evaluate the models. The performance of classifiers was presented by their accuracy rate, confusion matrix and area under the receiver operating characteristic curve (AUC). Naïve Bayes and Neural Network show generally convincing results with an average of 0.8 AUC values and 90 % accuracy rate.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Bassand, J.-P., Hamm, C. W., Ardissino, D., Boersma, E., Budaj, A., Fernandez-Aviles, F., Fox, K. A., Hasdai, D., Ohman, E. M. & Wallentin, L. 2007. Guidelines for the diagnosis and treatment of non-ST-segment elevation acute coronary syndromes The Task Force for the Diagnosis and Treatment of Non-ST-Segment Elevation Acute Coronary Syndromes of the European Society of Cardiology. European Heart Journal, 28, 1598–1660.
Google Scholar
SIGN 2007 (Updated 2013). Acute coronary syndromes : A national clinical guideline. Edinburgh, UK: Scottish Intercollegiate Guidelines Network.
Google Scholar
Chin, C. T., Chua, T. & LIM, S. 2010. Risk assessment models in acute coronary syndromes and their applicability in Singapore. Ann Acad Med Singapore, 39, 216–20.
Google Scholar
Antman, E. M., Cohen, M., Bernink, P. J., Mccabe, C. H., Horacek, T., Papuchis, G., Mautner, B., Corbalan, R., Radley, D. & Braunwald, E. 2000. The TIMI risk score for unstable angina/non-ST elevation MI. JAMA: the, journal of the American Medical Association, 284, 835–842.
Google Scholar
Cooney, M. T., Dudina, A. L. & Graham, I. M. 2009. Value and limitations of existing scores for the assessment of cardiovascular risk: a review for clinicians. Journal of the American College of Cardiology, 54, 1209–1227.
Google Scholar
Delen, D., Oztekin, A. & Tomak, L. 2012. An analytic approach to better understanding and management of coronary surgeries. Decision Support Systems, 52, 698–705.
Google Scholar
Cruz, J. A. & Wishart, D. S. 2006. Applications of machine learning in cancer prediction and prognosis. Cancer Informatics, 2, 59.
Google Scholar
Khalilia, M., Chakraborty, S. & Popescu, M. 2011. Predicting disease risks from highly imbalanced data using random forest. Bmc Medical Informatics and Decision Making, 11, 51.
Google Scholar
Westreich, D., Lessler, J. & Funk, M. J. 2010. Propensity score estimation: machine learning and classification methods as alternatives to logistic regression. Journal of clinical epidemiology, 63, 826.
Google Scholar
Song, X., Mitnitski, A., Cox, J. & Rockwood, K. 2004. Comparison of machine learning techniques with classical statistical models in predicting health outcomes. Medinfo, 11, 736–40.
Google Scholar
Oztekin, A., Delen, D. & Kong, Z. J. 2009. Predicting the graft survival for heart-lung transplantation patients: an integrated data mining methodology. International Journal of Medical Informatics, 78, e84.
Google Scholar
Shillabeer, A. & Roddick, J. F. Establishing a lineage for medical knowledge discovery. 2007. Australian Computer Society, Inc., 29–37.
Google Scholar
Li, J., Fu, A. W.-C., He, H., Chen, J., Jin, H., Mcaullay, D., Williams, G., Sparks, R. & Kelman, C. Mining risk patterns in medical data. Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining, 2005. ACM, 770–775.
Google Scholar
Hippisley-Cox, J., C. Coupland, Y. Vinogradova, J. Robson and P. Brindle. 2008a. Performance of the QRISK cardiovascular risk prediction algorithm in an independent UK sample of patients from general practice: a validation study. Heart, 94(1), pp. 34–39.
Google Scholar
Chia, C. C., Rubinfeld, I., Scirica, B. M., McMillan, S., Gurm, H. S. & Syed, Z. 2012. Looking Beyond Historical Patient Outcomes to Improve Clinical Models. Science Translational Medicine, 4, 131ra49-131ra49.
Google Scholar
Stolba, N. and A. M. Tjoa. 2006. The Relevance of Data Warehousing and Data Mining in the Field of Evidence-based Medicine to Support Healthcare Decision Making. In: C. ARDIL, ed. Proceedings of World Academy of Science, Engineering and Technology, Vol 11. pp. 12–17.
Google Scholar
Horvitz, E. 2010. From Data to Predictions and Decisions: Enabling Evidence-Based Healthcare. Computing Community Consortium, 6.
Google Scholar
Kotsiantis, S., Zaharakis, I. & Pintelas, P. 2007. Supervised machine learning: A review of classification techniques. Frontiers in Artificial Intelligence and Applications, 160, 3.
Google Scholar
Mitchell., T. M. 1997. Machine Learning New York; London, McGraw-Hill.
Google Scholar
Quinlan., J. R. 1993. C4.5 : Programs for Machine Learning, San Mateo, California, Morgan Kaufmann.
Google Scholar
Lau., C. 1992. Neural networks : theoretical foundations and analysis, New York, IEEE Press.
Google Scholar
Chin, S., Jeyaindran, S., Azhari, R., Wan Azman, W., Omar, I., Robaayah, Z. & SIM, K. 2008. Acute coronary syndrome (ACS) registry-leading the charge for National Cardiovascular Disease (NCVD) Database. Med J Malaysia, 63, 29–36.
Google Scholar
Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P. & Witten, I. H. 2009. The WEKA data mining software: an update. ACM SIGKDD Explorations Newsletter, 11, 10–18.
Google Scholar
Saeys, Y., Inza, I. & Larranaga, P. 2007. A review of feature selection techniques in bioinformatics. Bioinformatics, 23, 2507–2517.
Google Scholar
Cios, K. J. & Moore, G. W. 2002. Uniqueness of medical data mining. Artificial Intelligence in Medicine, 26, 1–24.
Google Scholar
Bellazzi, R. & Zupan, B. 2008. Predictive data mining in clinical medicine: Current issues and guidelines. International Journal of Medical Informatics, 77, 81–97.
Google Scholar
OlsonLSON, D. L. & DELEN, D. 2008. Advanced data mining techniques, Springer Verlag.
Google Scholar
Ian H. Witten, E. F. 2005. Data mining : practical machine learning tools and techniques, Amsterdam; London : Elsevier, c2005.
Google Scholar
Han, J. & Kamber, M. 2001. Data mining : concepts and techniques, San Francisco Morgan Kaufmann Publishers.
Google Scholar
Fawcett, T. 2004. ROC graphs: Notes and practical considerations for researchers. Machine Learning, 31, 1–38.
Google Scholar
Guo, X., Yin, Y., Dong, C., Yang, G. & Zhou, G. On the class imbalance problem. Natural Computation, 2008. ICNC’08. Fourth International Conference on, 2008. IEEE, 192–201.
Google Scholar
Chawla, N. V. C4. 5 and imbalanced data sets: investigating the effect of sampling method, probabilistic estimate, and decision tree structure. Proceedings of the ICML, 2003.
Google Scholar
Folorunso, S. & Adeyemo, A. 2013. Alleviating Classification Problem of Imbalanced Dataset. African Journal of Computing & ICT, 6.
Google Scholar
Weng, C. G. & Poon, J. A new evaluation measure for imbalanced datasets. Proceedings of the 7th Australasian Data Mining Conference-Volume 87, 2008. Australian Computer Society, Inc., 27–32.
Google Scholar
Kurz, D. J., Bernstein, A., Hunt, K., Radovanovic, D., Erne, P., Siudak, Z. & Bertel, O. 2009. Simple point-of-care risk stratification in acute coronary syndromes: the AMIS model. Heart, 95, 662–668.
Google Scholar
Kononenko, I., Bratko, I. & Kukar, M. 1997. Application of machine learning to medical diagnosis. Machine Learning and Data Mining: Methods and Applications, 389, 408.
Google Scholar
Wyatt, J. C. & Douglas G Altman, H. 1995. Commentary: Prognostic models: clinically useful or quickly forgotten? BMJ, 311, 1539.
Google Scholar
Kononenko, I. 2001. Machine learning for medical diagnosis: history, state of the art and perspective. Artificial Intelligence in Medicine, 23, 89–109.
Google Scholar

Download references

Acknowledgments

The authors would like to thank the Governance Board member of Malaysian National Cardiovascular Disease Database Registry for providing us the Acute Coronary Syndrome (ACS) dataset to be used for the research. Also, University Kuala Lumpur (UniKL), Malaysia for funding the PHD of the main author.

Author information

Authors and Affiliations

Yorkshire Centre for Health Informatics (YCHI), University of Leeds, Leeds, UK
Juliana Jaafar, Owen Johnson & Susan Clamp
School of Computing, University of Leeds, Leeds, UK
Eric Atwell
Department of Medicine, University Malaya Medical Centre, Kuala Lumpur, Malaysia
Wan Azman Wan Ahmad

Authors

Juliana Jaafar
View author publications
You can also search for this author in PubMed Google Scholar
Eric Atwell
View author publications
You can also search for this author in PubMed Google Scholar
Owen Johnson
View author publications
You can also search for this author in PubMed Google Scholar
Susan Clamp
View author publications
You can also search for this author in PubMed Google Scholar
Wan Azman Wan Ahmad
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Juliana Jaafar .

Editor information

Editors and Affiliations

School of Computing, University of Portsmouth, Portsmouth, United Kingdom
Max Bramer
School of Computing, Engineering and Mathematics, University of Brighton, Brighton, West Sussex, United Kingdom
Miltos Petridis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jaafar, J., Atwell, E., Johnson, O., Clamp, S., Ahmad, W.A. (2013). Evaluation of Machine Learning Techniques in Predicting Acute Coronary Syndrome Outcome. In: Bramer, M., Petridis, M. (eds) Research and Development in Intelligent Systems XXX. SGAI 2013. Springer, Cham. https://doi.org/10.1007/978-3-319-02621-3_24

Download citation

DOI: https://doi.org/10.1007/978-3-319-02621-3_24
Published: 07 November 2013
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-02620-6
Online ISBN: 978-3-319-02621-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics