Skip to main content

Mining Professional Knowledge from Medical Records

  • Conference paper
Brain Informatics and Health (BIH 2014)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8609))

Included in the following conference series:

Abstract

The paper aims at two tasks of electronic medical record (EMR) processing: EMR retrieval and medical term extraction. The linguistic phenomena in EMRs in different departments are analyzed in depth including record size, vocabulary, entropy of medical languages, grammaticality, and so on. We explore various techniques of information retrieval for EMR retrieval, including five retrieval models with six pre-processing strategies on different parts of EMRs. The learning to rank algorithm is also adopted to improve the retrieval performance. Finally, our retrieval model is applied to extract medical terms from EMRs. Both coarse-grained relevance evaluation on department level and fine-grained relevance evaluation on treatment level are conducted.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Jensen, L.J., Saric, J., Bork, P.: Literature mining for the biologist: from information retrieval to biological discovery. Nature Reviews Genetics 7, 119–129 (2006)

    Article  Google Scholar 

  2. Goth, G.: Analyzing medical data. Communications of the ACM 55(6), 13–15 (2012)

    Article  Google Scholar 

  3. Heinze, D.T., Morsch, M.L., Holbrook, J.: Mining free-text medical records. In: AMIA Annual Symposium, pp. 254–258 (2001)

    Google Scholar 

  4. Ramos, P.: Acute myocardial infarction patient data to assess healthcare utilization and treatments. ProQuest, UMI Dissertation Publishing (2011)

    Google Scholar 

  5. Huang, H.-H., Lee, C.-C., Chen, H.-H.: Outpatient department recommendation based on medical summaries. In: Hou, Y., Nie, J.-Y., Sun, L., Wang, B., Zhang, P. (eds.) AIRS 2012. LNCS, vol. 7675, pp. 518–527. Springer, Heidelberg (2012)

    Chapter  Google Scholar 

  6. Hersh, W.: Information retrieval: A health and biomedical perspective, 3rd edn. Springer (2009)

    Google Scholar 

  7. Voorhees, E., Tong, R.: Overview of the TREC 2011 Medical Records Track. In: TREC (2011)

    Google Scholar 

  8. Voorhees, E., Hersh, W.: Overview of the TREC 2012 Medical Records Track. In: TREC (2012)

    Google Scholar 

  9. Koopman, B., Lawley, M., Bruza, P.: AEHRC & QUT at TREC 2011 Medical Track: A Concept-Based Information Retrieval. In: TREC (2011)

    Google Scholar 

  10. Dinh, D., Tamine, L.: IRIT at TREC 2011: Evaluation of Query Expansion Techniques for Medical Record Retrieval. In: TREC (2011)

    Google Scholar 

  11. Demner-Fushman, D., Abhyankar, S., Jimeno-Yepes, A., Loane, R., Rance, B., Lang, F., Ide, N., Apostolova, E., Aronson, A.R.: A Knowledge-Based Approach to Medical Records Retrieval. In: TREC (2011)

    Google Scholar 

  12. Shannon, C.E.: Prediction and entropy of printed English. Bell System Tech. J. 30(1), 50–64 (1950)

    Article  MathSciNet  Google Scholar 

  13. Grignetti, M.C.: A note on the entropy of words in printed English. Information and Control 7, 304–306 (1964)

    Article  MATH  MathSciNet  Google Scholar 

  14. Li, H.: A Short Introduction to Learning to Rank. IEICE Trans. Inf. & Syst. E-94D(10), 1–9 (2011)

    Google Scholar 

  15. Abacha, A.B., Zweigenbaum, P.: Medical entity recognition: a comparison of semantic and statistical methods. In: Workshop on Biomedical Natural Language Processing, pp. 56–64 (2011)

    Google Scholar 

  16. Chen, H.-B., Huang, H.-H., Chen, H.-H., Tan, C.-T.: A Simplification-Translation-Restoration Framework for Cross-Domain SMT Applications. In: 24th International Conference on Computational Linguistics, pp. 545–560 (2012)

    Google Scholar 

  17. Chen, H.-B., Huang, H.-H., Tjiu, J., Tan, C.-T., Chen, H.-H.: A statistical medical summary translation system. In: ACM SIGHIT International Health Informatics Symposium, pp. 101–110 (2012)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Huang, HH., Lee, CC., Chen, HH. (2014). Mining Professional Knowledge from Medical Records. In: Ślȩzak, D., Tan, AH., Peters, J.F., Schwabe, L. (eds) Brain Informatics and Health. BIH 2014. Lecture Notes in Computer Science(), vol 8609. Springer, Cham. https://doi.org/10.1007/978-3-319-09891-3_15

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-09891-3_15

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-09890-6

  • Online ISBN: 978-3-319-09891-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics