Skip to main content

Evaluation of Machine Learning Techniques in Predicting Acute Coronary Syndrome Outcome

  • Conference paper
  • First Online:
Research and Development in Intelligent Systems XXX (SGAI 2013)

Abstract

Data mining using machine learning techniques may aid in the development of prediction models for Acute Coronary Syndrome (ACS) patients. ACS prediction models such as TIMI and GRACE have been developed using traditional statistical techniques such as linear regression. In this paper, different machine learning techniques were evaluated to present the potential use of machine learning techniques in classification tasks as basis for future medical prediction model development. A dataset of 960 of ACS patients from the Malaysian National Cardiovascular Disease Database registry was employed and trained on three popular machine learning classifiers i.e. Naïve Bayes, Decision Tree and Neural Network to predict ACS outcome. The outcome being evaluated was whether the patient is dead or alive. An open source tool—Waikato Environment for Knowledge Analysis (WEKA) were used in executing these classification tasks. A 10-folds cross validation technique was used to evaluate the models. The performance of classifiers was presented by their accuracy rate, confusion matrix and area under the receiver operating characteristic curve (AUC). Naïve Bayes and Neural Network show generally convincing results with an average of 0.8 AUC values and 90 % accuracy rate.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Bassand, J.-P., Hamm, C. W., Ardissino, D., Boersma, E., Budaj, A., Fernandez-Aviles, F., Fox, K. A., Hasdai, D., Ohman, E. M. & Wallentin, L. 2007. Guidelines for the diagnosis and treatment of non-ST-segment elevation acute coronary syndromes The Task Force for the Diagnosis and Treatment of Non-ST-Segment Elevation Acute Coronary Syndromes of the European Society of Cardiology. European Heart Journal, 28, 1598–1660.

    Google Scholar 

  2. SIGN 2007 (Updated 2013). Acute coronary syndromes : A national clinical guideline. Edinburgh, UK: Scottish Intercollegiate Guidelines Network.

    Google Scholar 

  3. Chin, C. T., Chua, T. & LIM, S. 2010. Risk assessment models in acute coronary syndromes and their applicability in Singapore. Ann Acad Med Singapore, 39, 216–20.

    Google Scholar 

  4. Antman, E. M., Cohen, M., Bernink, P. J., Mccabe, C. H., Horacek, T., Papuchis, G., Mautner, B., Corbalan, R., Radley, D. & Braunwald, E. 2000. The TIMI risk score for unstable angina/non-ST elevation MI. JAMA: the, journal of the American Medical Association, 284, 835–842.

    Google Scholar 

  5. Cooney, M. T., Dudina, A. L. & Graham, I. M. 2009. Value and limitations of existing scores for the assessment of cardiovascular risk: a review for clinicians. Journal of the American College of Cardiology, 54, 1209–1227.

    Google Scholar 

  6. Delen, D., Oztekin, A. & Tomak, L. 2012. An analytic approach to better understanding and management of coronary surgeries. Decision Support Systems, 52, 698–705.

    Google Scholar 

  7. Cruz, J. A. & Wishart, D. S. 2006. Applications of machine learning in cancer prediction and prognosis. Cancer Informatics, 2, 59.

    Google Scholar 

  8. Khalilia, M., Chakraborty, S. & Popescu, M. 2011. Predicting disease risks from highly imbalanced data using random forest. Bmc Medical Informatics and Decision Making, 11, 51.

    Google Scholar 

  9. Westreich, D., Lessler, J. & Funk, M. J. 2010. Propensity score estimation: machine learning and classification methods as alternatives to logistic regression. Journal of clinical epidemiology, 63, 826.

    Google Scholar 

  10. Song, X., Mitnitski, A., Cox, J. & Rockwood, K. 2004. Comparison of machine learning techniques with classical statistical models in predicting health outcomes. Medinfo, 11, 736–40.

    Google Scholar 

  11. Oztekin, A., Delen, D. & Kong, Z. J. 2009. Predicting the graft survival for heart-lung transplantation patients: an integrated data mining methodology. International Journal of Medical Informatics, 78, e84.

    Google Scholar 

  12. Shillabeer, A. & Roddick, J. F. Establishing a lineage for medical knowledge discovery. 2007. Australian Computer Society, Inc., 29–37.

    Google Scholar 

  13. Li, J., Fu, A. W.-C., He, H., Chen, J., Jin, H., Mcaullay, D., Williams, G., Sparks, R. & Kelman, C. Mining risk patterns in medical data. Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining, 2005. ACM, 770–775.

    Google Scholar 

  14. Hippisley-Cox, J., C. Coupland, Y. Vinogradova, J. Robson and P. Brindle. 2008a. Performance of the QRISK cardiovascular risk prediction algorithm in an independent UK sample of patients from general practice: a validation study. Heart, 94(1), pp. 34–39.

    Google Scholar 

  15. Chia, C. C., Rubinfeld, I., Scirica, B. M., McMillan, S., Gurm, H. S. & Syed, Z. 2012. Looking Beyond Historical Patient Outcomes to Improve Clinical Models. Science Translational Medicine, 4, 131ra49-131ra49.

    Google Scholar 

  16. Stolba, N. and A. M. Tjoa. 2006. The Relevance of Data Warehousing and Data Mining in the Field of Evidence-based Medicine to Support Healthcare Decision Making. In: C. ARDIL, ed. Proceedings of World Academy of Science, Engineering and Technology, Vol 11. pp. 12–17.

    Google Scholar 

  17. Horvitz, E. 2010. From Data to Predictions and Decisions: Enabling Evidence-Based Healthcare. Computing Community Consortium, 6.

    Google Scholar 

  18. Kotsiantis, S., Zaharakis, I. & Pintelas, P. 2007. Supervised machine learning: A review of classification techniques. Frontiers in Artificial Intelligence and Applications, 160, 3.

    Google Scholar 

  19. Mitchell., T. M. 1997. Machine Learning New York; London, McGraw-Hill.

    Google Scholar 

  20. Quinlan., J. R. 1993. C4.5 : Programs for Machine Learning, San Mateo, California, Morgan Kaufmann.

    Google Scholar 

  21. Lau., C. 1992. Neural networks : theoretical foundations and analysis, New York, IEEE Press.

    Google Scholar 

  22. Chin, S., Jeyaindran, S., Azhari, R., Wan Azman, W., Omar, I., Robaayah, Z. & SIM, K. 2008. Acute coronary syndrome (ACS) registry-leading the charge for National Cardiovascular Disease (NCVD) Database. Med J Malaysia, 63, 29–36.

    Google Scholar 

  23. Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P. & Witten, I. H. 2009. The WEKA data mining software: an update. ACM SIGKDD Explorations Newsletter, 11, 10–18.

    Google Scholar 

  24. Saeys, Y., Inza, I. & Larranaga, P. 2007. A review of feature selection techniques in bioinformatics. Bioinformatics, 23, 2507–2517.

    Google Scholar 

  25. Cios, K. J. & Moore, G. W. 2002. Uniqueness of medical data mining. Artificial Intelligence in Medicine, 26, 1–24.

    Google Scholar 

  26. Bellazzi, R. & Zupan, B. 2008. Predictive data mining in clinical medicine: Current issues and guidelines. International Journal of Medical Informatics, 77, 81–97.

    Google Scholar 

  27. OlsonLSON, D. L. & DELEN, D. 2008. Advanced data mining techniques, Springer Verlag.

    Google Scholar 

  28. Ian H. Witten, E. F. 2005. Data mining : practical machine learning tools and techniques, Amsterdam; London : Elsevier, c2005.

    Google Scholar 

  29. Han, J. & Kamber, M. 2001. Data mining : concepts and techniques, San Francisco Morgan Kaufmann Publishers.

    Google Scholar 

  30. Fawcett, T. 2004. ROC graphs: Notes and practical considerations for researchers. Machine Learning, 31, 1–38.

    Google Scholar 

  31. Guo, X., Yin, Y., Dong, C., Yang, G. & Zhou, G. On the class imbalance problem. Natural Computation, 2008. ICNC’08. Fourth International Conference on, 2008. IEEE, 192–201.

    Google Scholar 

  32. Chawla, N. V. C4. 5 and imbalanced data sets: investigating the effect of sampling method, probabilistic estimate, and decision tree structure. Proceedings of the ICML, 2003.

    Google Scholar 

  33. Folorunso, S. & Adeyemo, A. 2013. Alleviating Classification Problem of Imbalanced Dataset. African Journal of Computing & ICT, 6.

    Google Scholar 

  34. Weng, C. G. & Poon, J. A new evaluation measure for imbalanced datasets. Proceedings of the 7th Australasian Data Mining Conference-Volume 87, 2008. Australian Computer Society, Inc., 27–32.

    Google Scholar 

  35. Kurz, D. J., Bernstein, A., Hunt, K., Radovanovic, D., Erne, P., Siudak, Z. & Bertel, O. 2009. Simple point-of-care risk stratification in acute coronary syndromes: the AMIS model. Heart, 95, 662–668.

    Google Scholar 

  36. Kononenko, I., Bratko, I. & Kukar, M. 1997. Application of machine learning to medical diagnosis. Machine Learning and Data Mining: Methods and Applications, 389, 408.

    Google Scholar 

  37. Wyatt, J. C. & Douglas G Altman, H. 1995. Commentary: Prognostic models: clinically useful or quickly forgotten? BMJ, 311, 1539.

    Google Scholar 

  38. Kononenko, I. 2001. Machine learning for medical diagnosis: history, state of the art and perspective. Artificial Intelligence in Medicine, 23, 89–109.

    Google Scholar 

Download references

Acknowledgments

The authors would like to thank the Governance Board member of Malaysian National Cardiovascular Disease Database Registry for providing us the Acute Coronary Syndrome (ACS) dataset to be used for the research. Also, University Kuala Lumpur (UniKL), Malaysia for funding the PHD of the main author.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Juliana Jaafar .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer International Publishing Switzerland

About this paper

Cite this paper

Jaafar, J., Atwell, E., Johnson, O., Clamp, S., Ahmad, W.A. (2013). Evaluation of Machine Learning Techniques in Predicting Acute Coronary Syndrome Outcome. In: Bramer, M., Petridis, M. (eds) Research and Development in Intelligent Systems XXX. SGAI 2013. Springer, Cham. https://doi.org/10.1007/978-3-319-02621-3_24

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-02621-3_24

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-02620-6

  • Online ISBN: 978-3-319-02621-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics