Skip to main content

Study on the Method of Extracting Diabetes History from Unstructured Chinese Electronic Medical Record

  • Conference paper
  • First Online:
  • 1367 Accesses

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1163))

Abstract

In this paper, based on the real electronic medical record data of the hospital, a customized method of rule-based learning and information extraction is designed, and three steps are adopted to realize the extraction of Chinese information: sampling and labeling. The medical history information of 600 electronic medical records (including current medical history, past history, personal history, family history, etc.) were randomly selected, and the information needed to be extracted (taking diabetes history as an example) was marked by the labeling platform developed in this study. According to the annotation results, the extraction template is summarized, and the extraction template can be directly used to extract the regular expression extraction rules, and these rules can be used to extract the actual information. The method of manual verification and automatic verification is used to verify the effectiveness of the method. By using the method of natural language processing and rule-based information extraction, an algorithm for extracting customized information from unstructured Chinese electronic medical record text data is designed and implemented. Aiming at the extraction of diabetes history in the hospital, the field verification of a single department has achieved good results.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Circular on the issuance of 57 health industry standards, such as the specification for shared documents of Electronic Medical Records Part 1: Summary of Medical Records, State Health and Family Planning Commission of the People’s Republic of China, Shanghai, September 2016

    Google Scholar 

  2. Feng, Z.: A Concise Course on Natural Language Processing. Shanghai Foreign Language Education Press (2012)

    Google Scholar 

  3. Cheng, X., Zhu, Q., Wang, J.: Principle and Application of Chinese Information Extraction. Science Press, Beijing (2010)

    Google Scholar 

  4. Meystre, S.M., et al.: Extracting information from textual documents in the electronic health record: a review of recent research. Yearb. Med. Inform. 35(6), 128 (2008)

    Google Scholar 

  5. Hirschberg, J., Manning, C.D.: Advances in natural language processing. Science 349(6245), 261 (2015)

    Article  MathSciNet  Google Scholar 

  6. Chang, Y.C., Manning, C.D., et al.: A hybrid method of rule and machine learning for temporal relation extraction in patient discharge summaries. J. Biomed. Inform. 46(Suppl.), 54–62 (2013)

    Article  Google Scholar 

  7. Jindal, P., Manning, C.D.: Extraction of events and temporal expressions from clinical narratives. J. Biomed. Inform. 46(Suppl.), 13–19 (2013)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Chengzhi Niu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Niu, C., Zhao, X. (2020). Study on the Method of Extracting Diabetes History from Unstructured Chinese Electronic Medical Record. In: Shen, H., Sang, Y. (eds) Parallel Architectures, Algorithms and Programming. PAAP 2019. Communications in Computer and Information Science, vol 1163. Springer, Singapore. https://doi.org/10.1007/978-981-15-2767-8_13

Download citation

  • DOI: https://doi.org/10.1007/978-981-15-2767-8_13

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-15-2766-1

  • Online ISBN: 978-981-15-2767-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics