Skip to main content

Machine Learning: Multi-site Evidence-Based Best Practice Discovery

  • Conference paper
  • First Online:
Book cover Machine Learning, Optimization, and Big Data (MOD 2016)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10122))

Included in the following conference series:

Abstract

This study establishes interoperability among electronic medical records from 737 healthcare sites and performs machine learning for best practice discovery. A mapping algorithm is designed to disambiguate free text entries and to provide a unique and unified way to link content to structured medical concepts despite the extreme variations that can occur during clinical diagnosis documentation. Redundancy is reduced through concept mapping. A SNOMED-CT graph database is created to allow for rapid data access and queries. These integrated data can be accessed through a secured web-based portal. A classification model (DAMIP) is then designed to uncover discriminatory characteristics that can predict the quality of treatment outcome. We demonstrate system usability by analyzing Type II diabetic patients. DAMIP establishes a classification rule on a training set which results in greater than 80% blind predictive accuracy on an independent set of patients. By including features obtained from structured concept mapping, the predictive accuracy is improved to over 88%. The results facilitate evidence-based treatment and optimization of site performance through best practice dissemination and knowledge transfer.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Jensen, P.B., Jensen, L.J., Brunak, S.: Mining electronic health records: towards better research applications and clinical care. Nat. Rev. Genet. 13(6), 395–405 (2012)

    Article  Google Scholar 

  2. Park, H., Hardiker, N.: Clinical terminologies: a solution for semantic interoperability. J. Korean Soc. Med. Inform. 15(1), 1–11 (2009)

    Article  Google Scholar 

  3. Rosenbloom, S.T., et al.: Interface terminologies: facilitating direct entry of clinical data into electronic health record systems. J. Am. Med. Inform. Assoc. 13(3), 277–288 (2006)

    Article  Google Scholar 

  4. Donnelly, K.: SNOMED-CT: the advanced terminology and coding system for eHealth. Stud. Health Technol. Inform. 121, 279 (2006)

    Google Scholar 

  5. McDonald, C.J., et al.: LOINC, a universal standard for identifying laboratory observations: a 5-year update. Clin. Chem. 49(4), 624–633 (2003)

    Article  Google Scholar 

  6. Liu, S., et al.: RxNorm: prescription for electronic drug information exchange. IT Prof. 7(5), 17–23 (2005)

    Article  Google Scholar 

  7. Bodenreider, O.: The unified medical language system (UMLS): integrating biomedical terminology. Nucleic Acids Res. 32(suppl 1), D267–D270 (2004)

    Article  Google Scholar 

  8. Hernandez, P., et al.: Automated mapping of pharmacy orders from two electronic health record systems to RxNorm within the STRIDE clinical data warehouse. American Medical Informatics Association (2009)

    Google Scholar 

  9. Carlo, L., Chase, H.S., Weng, C.: Aligning structured and unstructured medical problems using umls. American Medical Informatics Association (2010)

    Google Scholar 

  10. Patel, C.O., Cimino, J.J.: Using semantic and structural properties of the unified medical language system to discover potential terminological relationships. J. Am. Med. Inform. Assoc. 16(3), 346–353%@ 1067–5027 (2009)

    Google Scholar 

  11. Gallagher, R.J., Lee, E.K.: Mixed integer programming optimization models for brachytherapy treatment planning. In: Proceedings of the AMIA Annual Fall Symposium. American Medical Informatics Association (1997)

    Google Scholar 

  12. Haffner, S.M.: Epidemiology of type 2 diabetes: risk factors. Diab. Care 21(Suppl. 3), C3–C6 (1998)

    Article  Google Scholar 

  13. Chan, J.M., et al.: Obesity, fat distribution, and weight gain as risk factors for clinical diabetes in men. Diab. Care 17(9), 961–969 (1994)

    Article  Google Scholar 

  14. Estacio, R.O., et al.: Effect of blood pressure control on diabetic microvascular complications in patients with hypertension and type 2 diabetes. Diab. Care 23, B54 (2000)

    Google Scholar 

  15. Aronson, A.R.: Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program. In: Proceedings of the AMIA Symposium. American Medical Informatics Association (2001)

    Google Scholar 

  16. Aronson, A.R.: Metamap: Mapping Text to the UMLS Metathesaurus. NLM, NIH, DHHS, Bethesda, pp. 1–26 (2006)

    Google Scholar 

  17. Aronson, A.R., Lang, F.-M.: An overview of MetaMap: historical perspective and recent advances. J. Am. Med. Inform. Assoc. 17(3), 229–236 (2010)

    Article  Google Scholar 

  18. National Center for Health Statistics (U.S.). ICD-9-CM: The International Classification of Diseases, 9th Revision, Clinical Modification. 1978: Commission on Professional and Hospital Activities

    Google Scholar 

  19. Caiado, J., Crato, N., Peña, D.: Comparison of times series with unequal length in the frequency domain. Commun. Stat. Simul. Comput.® 38(3), 527–540 (2009)

    Google Scholar 

  20. Lee, E.K.: Large-scale optimization-based classification models in medicine and biology. Ann. Biomed. Eng. 35(6), 1095–1109 (2007)

    Article  Google Scholar 

  21. Lee, E.K., et al.: A clinical decision tool for predicting patient care characteristics: patients returning within 72 hours in the emergency department. In: AMIA Annual Symposium Proceedings. American Medical Informatics Association (2012)

    Google Scholar 

  22. Brooks, J.P., Lee, E.K.: Solving a multigroup mixed-integer programming-based constrained discrimination model. INFORMS J. Comput. 26(3), 567–585 (2014)

    Article  MathSciNet  MATH  Google Scholar 

  23. Brooks, J.P., Lee, E.K.: Analysis of the consistency of a mixed integer programming-based multi-category constrained discriminant model. Ann. Oper. Res. 174(1), 147–168 (2010)

    Article  MathSciNet  MATH  Google Scholar 

Download references

Acknowledgment

This paper receives the 2016 NSF Health Organization Transformation award (second place). The work is partially supported by a grant from the National Science Foundation IIP-1361532. Findings and conclusions in this paper are those of the authors and do not necessarily reflect the views of the National Science Foundation.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Eva K. Lee .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing AG

About this paper

Cite this paper

Lee, E.K., Wang, Y., Hagen, M.S., Wei, X., Davis, R.A., Egan, B.M. (2016). Machine Learning: Multi-site Evidence-Based Best Practice Discovery. In: Pardalos, P., Conca, P., Giuffrida, G., Nicosia, G. (eds) Machine Learning, Optimization, and Big Data. MOD 2016. Lecture Notes in Computer Science(), vol 10122. Springer, Cham. https://doi.org/10.1007/978-3-319-51469-7_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-51469-7_1

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-51468-0

  • Online ISBN: 978-3-319-51469-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics