skip to main content
10.1145/3364908.3365290acmotherconferencesArticle/Chapter ViewAbstractPublication PagessspsConference Proceedingsconference-collections
research-article

Annotations of Chinese Electronic Medical Record using BiLSTM-CRF based Networks

Published:20 September 2019Publication History

ABSTRACT

In the field of medicine, the Chinese electronic medical records are complicated and professional, so the extraction of the knowledge on them is a challenging task. In this paper, a new annotation collection of electronic Chinese medical records is proposed, which aims at reducing the complexity of electronic medical record analysis through the universal annotations of the electronic medical records. Meanwhile, the Word2vec+Char2vec+BiLSTM+CRF (WCBC) model is proposed for the annotations of the electronic medical recordings. To verify the validity of the proposed method, we annotated 20000 Chinese electronic medical records manually under the support of medical experts. On this dataset, we compare the WCBC model with the state-of-the-art models such as Hidden Markov Model (HMM) and Support Vector Machine (SVM) based methods. Experiments show that the comprehensive annotation accuracy of our method is up to 85%, which outperforms other methods obviously.

References

  1. I. K. Ampomah, S.-B. Park, and S.-J. Lee. A sentence-to-sentence relation network for recognizing textual entailment. World Academy of Science, Engineering and Technology, International Journal of Computer, Electrical, Automation, Control and Information Engineering, 10(12):2060--2063, 2016.Google ScholarGoogle Scholar
  2. O.Bodenreider.Theunifiedmedicallanguagesystem(umls): int egrating biomedical terminology. Nucleic acids research, 32(suppl 1):D267-- D270, 2004.Google ScholarGoogle Scholar
  3. F. Chang, J. Guo, W. Xu, and S. R. Chung. Application of word embeddings in biomedical named entity recognition tasks. Journal of Digital Information Management, 13(5), 2015.Google ScholarGoogle Scholar
  4. C. Friedman and S. B. Johnson. Natural language and text processing in biomedicine. In Biomedical Informatics, pages 312--343. Springer, 2006.Google ScholarGoogle ScholarCross RefCross Ref
  5. B. Gann. Giving patients choice and control: health informatics on the patient journey. Yearbook of medical informatics, 7:70--3, 2012.Google ScholarGoogle Scholar
  6. J. Groopman. How doctors think. Houghton Mifflin Harcourt, 2008.Google ScholarGoogle Scholar
  7. Z.Huang, W.Xu, and K.Yu.Bidirectionallstm-crfmodelsforsequence tagging. arXiv preprint arXiv:1508.01991, 2015.Google ScholarGoogle Scholar
  8. V. Jatav, R. Teja, S. Bharadwaj, and V. Srinivasan. Improving part- of-speech tagging for nlp pipelines. arXiv preprint arXiv:1708.00241, 2017.Google ScholarGoogle Scholar
  9. X. Kong, Y. Li, H. Li, and X. Lu. Structuralization of digestive endoscopy report based on nlp. Zhongguo yi liao qi xie za zhi= Chinese journal of medical instrumentation, 32(5):348--351, 2008.Google ScholarGoogle Scholar
  10. G.Lample, M.Ballesteros, S.Subramanian, K.Kawakami, and C. Dyer. Neural architectures for named entity recognition. arXiv preprint arXiv:1603.01360, 2016.Google ScholarGoogle Scholar
  11. L. Li, L. Jin, Y. Jiang, and D. Huang. Recognizing biomedical named entities based on the sentence vector/twin word embeddings conditioned bidirectional lstm. In Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, pages 165--176. Springer, 2016.Google ScholarGoogle ScholarCross RefCross Ref
  12. P. Liu, X. Wang, X. Sun, X. Shen, X. Chen, Y. Sun, and Y. Pan. Hkdp: A hybrid knowledge graph based pediatric disease prediction system. In International Conference on Smart Health, pages 78--90. Springer, 2016.Google ScholarGoogle Scholar
  13. S. M. Meystre and P. J. Haug. Comparing natural language processing tools to extract medical problems from narrative text. In AMIA annual symposium proceedings, volume 2005, page 525. American Medical Informatics Association, 2005.Google ScholarGoogle Scholar
  14. T. Nakagawa, T. Kudo, and Y. Matsumoto. Unknown word guessing and part-of-speech tagging using support vector machines. In NLPRS, pages 325--331. Citeseer, 2001.Google ScholarGoogle Scholar
  15. J. Paparrizos, R. W. White, and E. Horvitz. Screening for pancreatic adenocarcinoma using signals from web search logs: Feasibility study and results. Journal of Oncology Practice, 12(8):737--744, 2016.Google ScholarGoogle ScholarCross RefCross Ref
  16. M. Rotmensch, Y. Halpern, A. Tlimat, S. Horng, and D. Sontag. Learning a health knowledge graph from electronic medical records. Scientific reports, 7(1):5994, 2017.Google ScholarGoogle ScholarCross RefCross Ref
  17. D. S. Sachan and Petuum. Revisiting lstm networks for semi-supervised text classification via mixed objective function. KDD-18 Deep Learning Day, 2018.Google ScholarGoogle Scholar
  18. E. H. Shortliffe. Mycin: Computer-based medical consultations, 1976.Google ScholarGoogle Scholar
  19. H. Tang and J. H. K. Ng. Googling for a diagnosisa use of google as a diagnostic aid: internet based study. Bmj, 333(7579): 1143--1145, 2006. [20] S. M. Thede and M. P. Harper. A second-order hidden markov model for part-of-speech tagging. In Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics, pages 175--182. Association for Computational Linguistics, 1999.Google ScholarGoogle ScholarCross RefCross Ref
  20. R. W. White and E. Horvitz. Cyberchondria: studies of the escalation of medical concerns in web search. ACM Transactions on Information Systems (TOIS), 27(4):23, 2009.Google ScholarGoogle Scholar

Index Terms

  1. Annotations of Chinese Electronic Medical Record using BiLSTM-CRF based Networks

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      SSPS '19: Proceedings of the 2019 International Symposium on Signal Processing Systems
      September 2019
      188 pages
      ISBN:9781450362412
      DOI:10.1145/3364908

      Copyright © 2019 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 20 September 2019

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed limited
    • Article Metrics

      • Downloads (Last 12 months)4
      • Downloads (Last 6 weeks)0

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader