Study on the Method of Extracting Diabetes History from Unstructured Chinese Electronic Medical Record

Niu, Chengzhi; Zhao, Xiaofan

doi:10.1007/978-981-15-2767-8_13

Study on the Method of Extracting Diabetes History from Unstructured Chinese Electronic Medical Record

Chengzhi Niu⁸ &
Xiaofan Zhao⁹

Conference paper
First Online: 26 January 2020

1367 Accesses

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1163))

Abstract

In this paper, based on the real electronic medical record data of the hospital, a customized method of rule-based learning and information extraction is designed, and three steps are adopted to realize the extraction of Chinese information: sampling and labeling. The medical history information of 600 electronic medical records (including current medical history, past history, personal history, family history, etc.) were randomly selected, and the information needed to be extracted (taking diabetes history as an example) was marked by the labeling platform developed in this study. According to the annotation results, the extraction template is summarized, and the extraction template can be directly used to extract the regular expression extraction rules, and these rules can be used to extract the actual information. The method of manual verification and automatic verification is used to verify the effectiveness of the method. By using the method of natural language processing and rule-based information extraction, an algorithm for extracting customized information from unstructured Chinese electronic medical record text data is designed and implemented. Aiming at the extraction of diabetes history in the hospital, the field verification of a single department has achieved good results.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Circular on the issuance of 57 health industry standards, such as the specification for shared documents of Electronic Medical Records Part 1: Summary of Medical Records, State Health and Family Planning Commission of the People’s Republic of China, Shanghai, September 2016
Google Scholar
Feng, Z.: A Concise Course on Natural Language Processing. Shanghai Foreign Language Education Press (2012)
Google Scholar
Cheng, X., Zhu, Q., Wang, J.: Principle and Application of Chinese Information Extraction. Science Press, Beijing (2010)
Google Scholar
Meystre, S.M., et al.: Extracting information from textual documents in the electronic health record: a review of recent research. Yearb. Med. Inform. 35(6), 128 (2008)
Google Scholar
Hirschberg, J., Manning, C.D.: Advances in natural language processing. Science 349(6245), 261 (2015)
Article MathSciNet Google Scholar
Chang, Y.C., Manning, C.D., et al.: A hybrid method of rule and machine learning for temporal relation extraction in patient discharge summaries. J. Biomed. Inform. 46(Suppl.), 54–62 (2013)
Article Google Scholar
Jindal, P., Manning, C.D.: Extraction of events and temporal expressions from clinical narratives. J. Biomed. Inform. 46(Suppl.), 13–19 (2013)
Article Google Scholar

Download references

Author information

Authors and Affiliations

The First Affiliated Hospital of Zhengzhou University, Beijing, China
Chengzhi Niu
People’s Public Security University of China, Zhengzhou, China
Xiaofan Zhao

Authors

Chengzhi Niu
View author publications
You can also search for this author in PubMed Google Scholar
Xiaofan Zhao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Chengzhi Niu .

Editor information

Editors and Affiliations

Sun Yat-sen University, Guangzhou, China
Hong Shen
Sun Yat-sen University, Guangzhou, China
Yingpeng Sang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Niu, C., Zhao, X. (2020). Study on the Method of Extracting Diabetes History from Unstructured Chinese Electronic Medical Record. In: Shen, H., Sang, Y. (eds) Parallel Architectures, Algorithms and Programming. PAAP 2019. Communications in Computer and Information Science, vol 1163. Springer, Singapore. https://doi.org/10.1007/978-981-15-2767-8_13

Download citation

DOI: https://doi.org/10.1007/978-981-15-2767-8_13
Published: 26 January 2020
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-2766-1
Online ISBN: 978-981-15-2767-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics