Abstract
In this paper, based on the real electronic medical record data of the hospital, a customized method of rule-based learning and information extraction is designed, and three steps are adopted to realize the extraction of Chinese information: sampling and labeling. The medical history information of 600 electronic medical records (including current medical history, past history, personal history, family history, etc.) were randomly selected, and the information needed to be extracted (taking diabetes history as an example) was marked by the labeling platform developed in this study. According to the annotation results, the extraction template is summarized, and the extraction template can be directly used to extract the regular expression extraction rules, and these rules can be used to extract the actual information. The method of manual verification and automatic verification is used to verify the effectiveness of the method. By using the method of natural language processing and rule-based information extraction, an algorithm for extracting customized information from unstructured Chinese electronic medical record text data is designed and implemented. Aiming at the extraction of diabetes history in the hospital, the field verification of a single department has achieved good results.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Circular on the issuance of 57 health industry standards, such as the specification for shared documents of Electronic Medical Records Part 1: Summary of Medical Records, State Health and Family Planning Commission of the People’s Republic of China, Shanghai, September 2016
Feng, Z.: A Concise Course on Natural Language Processing. Shanghai Foreign Language Education Press (2012)
Cheng, X., Zhu, Q., Wang, J.: Principle and Application of Chinese Information Extraction. Science Press, Beijing (2010)
Meystre, S.M., et al.: Extracting information from textual documents in the electronic health record: a review of recent research. Yearb. Med. Inform. 35(6), 128 (2008)
Hirschberg, J., Manning, C.D.: Advances in natural language processing. Science 349(6245), 261 (2015)
Chang, Y.C., Manning, C.D., et al.: A hybrid method of rule and machine learning for temporal relation extraction in patient discharge summaries. J. Biomed. Inform. 46(Suppl.), 54–62 (2013)
Jindal, P., Manning, C.D.: Extraction of events and temporal expressions from clinical narratives. J. Biomed. Inform. 46(Suppl.), 13–19 (2013)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Niu, C., Zhao, X. (2020). Study on the Method of Extracting Diabetes History from Unstructured Chinese Electronic Medical Record. In: Shen, H., Sang, Y. (eds) Parallel Architectures, Algorithms and Programming. PAAP 2019. Communications in Computer and Information Science, vol 1163. Springer, Singapore. https://doi.org/10.1007/978-981-15-2767-8_13
Download citation
DOI: https://doi.org/10.1007/978-981-15-2767-8_13
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-2766-1
Online ISBN: 978-981-15-2767-8
eBook Packages: Computer ScienceComputer Science (R0)