Abstract
The paper aims at two tasks of electronic medical record (EMR) processing: EMR retrieval and medical term extraction. The linguistic phenomena in EMRs in different departments are analyzed in depth including record size, vocabulary, entropy of medical languages, grammaticality, and so on. We explore various techniques of information retrieval for EMR retrieval, including five retrieval models with six pre-processing strategies on different parts of EMRs. The learning to rank algorithm is also adopted to improve the retrieval performance. Finally, our retrieval model is applied to extract medical terms from EMRs. Both coarse-grained relevance evaluation on department level and fine-grained relevance evaluation on treatment level are conducted.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Jensen, L.J., Saric, J., Bork, P.: Literature mining for the biologist: from information retrieval to biological discovery. Nature Reviews Genetics 7, 119–129 (2006)
Goth, G.: Analyzing medical data. Communications of the ACM 55(6), 13–15 (2012)
Heinze, D.T., Morsch, M.L., Holbrook, J.: Mining free-text medical records. In: AMIA Annual Symposium, pp. 254–258 (2001)
Ramos, P.: Acute myocardial infarction patient data to assess healthcare utilization and treatments. ProQuest, UMI Dissertation Publishing (2011)
Huang, H.-H., Lee, C.-C., Chen, H.-H.: Outpatient department recommendation based on medical summaries. In: Hou, Y., Nie, J.-Y., Sun, L., Wang, B., Zhang, P. (eds.) AIRS 2012. LNCS, vol. 7675, pp. 518–527. Springer, Heidelberg (2012)
Hersh, W.: Information retrieval: A health and biomedical perspective, 3rd edn. Springer (2009)
Voorhees, E., Tong, R.: Overview of the TREC 2011 Medical Records Track. In: TREC (2011)
Voorhees, E., Hersh, W.: Overview of the TREC 2012 Medical Records Track. In: TREC (2012)
Koopman, B., Lawley, M., Bruza, P.: AEHRC & QUT at TREC 2011 Medical Track: A Concept-Based Information Retrieval. In: TREC (2011)
Dinh, D., Tamine, L.: IRIT at TREC 2011: Evaluation of Query Expansion Techniques for Medical Record Retrieval. In: TREC (2011)
Demner-Fushman, D., Abhyankar, S., Jimeno-Yepes, A., Loane, R., Rance, B., Lang, F., Ide, N., Apostolova, E., Aronson, A.R.: A Knowledge-Based Approach to Medical Records Retrieval. In: TREC (2011)
Shannon, C.E.: Prediction and entropy of printed English. Bell System Tech. J. 30(1), 50–64 (1950)
Grignetti, M.C.: A note on the entropy of words in printed English. Information and Control 7, 304–306 (1964)
Li, H.: A Short Introduction to Learning to Rank. IEICE Trans. Inf. & Syst. E-94D(10), 1–9 (2011)
Abacha, A.B., Zweigenbaum, P.: Medical entity recognition: a comparison of semantic and statistical methods. In: Workshop on Biomedical Natural Language Processing, pp. 56–64 (2011)
Chen, H.-B., Huang, H.-H., Chen, H.-H., Tan, C.-T.: A Simplification-Translation-Restoration Framework for Cross-Domain SMT Applications. In: 24th International Conference on Computational Linguistics, pp. 545–560 (2012)
Chen, H.-B., Huang, H.-H., Tjiu, J., Tan, C.-T., Chen, H.-H.: A statistical medical summary translation system. In: ACM SIGHIT International Health Informatics Symposium, pp. 101–110 (2012)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Huang, HH., Lee, CC., Chen, HH. (2014). Mining Professional Knowledge from Medical Records. In: Ślȩzak, D., Tan, AH., Peters, J.F., Schwabe, L. (eds) Brain Informatics and Health. BIH 2014. Lecture Notes in Computer Science(), vol 8609. Springer, Cham. https://doi.org/10.1007/978-3-319-09891-3_15
Download citation
DOI: https://doi.org/10.1007/978-3-319-09891-3_15
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-09890-6
Online ISBN: 978-3-319-09891-3
eBook Packages: Computer ScienceComputer Science (R0)