research-article

Annotations of Chinese Electronic Medical Record using BiLSTM-CRF based Networks

Authors:
Haiming Zhou

SSM, Daxing Industrial Development Zone, Beijing, China

SSM, Daxing Industrial Development Zone, Beijing, China
View Profile

,
Wenhai Guo

SSM, Daxing Industrial Development Zone, Beijing, China

SSM, Daxing Industrial Development Zone, Beijing, China
View Profile

,
Dengfeng Ke

CAISA, Beijing, China

CAISA, Beijing, China
View Profile

,
Ning Liu

AI Doctor Co. Ltd., Beijing, China

AI Doctor Co. Ltd., Beijing, China
View Profile

,
Xiufang Zhao

AI Doctor Co. Ltd., Beijing, China

AI Doctor Co. Ltd., Beijing, China
View Profile

,
Changjin Li

SSM, Beijing, China

SSM, Beijing, China
View Profile

SSPS '19: Proceedings of the 2019 International Symposium on Signal Processing SystemsSeptember 2019Pages 131–135https://doi.org/10.1145/3364908.3365290

Published:20 September 2019Publication History

SSPS '19: Proceedings of the 2019 International Symposium on Signal Processing Systems

Pages 131–135

ABSTRACT

In the field of medicine, the Chinese electronic medical records are complicated and professional, so the extraction of the knowledge on them is a challenging task. In this paper, a new annotation collection of electronic Chinese medical records is proposed, which aims at reducing the complexity of electronic medical record analysis through the universal annotations of the electronic medical records. Meanwhile, the Word2vec+Char2vec+BiLSTM+CRF (WCBC) model is proposed for the annotations of the electronic medical recordings. To verify the validity of the proposed method, we annotated 20000 Chinese electronic medical records manually under the support of medical experts. On this dataset, we compare the WCBC model with the state-of-the-art models such as Hidden Markov Model (HMM) and Support Vector Machine (SVM) based methods. Experiments show that the comprehensive annotation accuracy of our method is up to 85%, which outperforms other methods obviously.

References

I. K. Ampomah, S.-B. Park, and S.-J. Lee. A sentence-to-sentence relation network for recognizing textual entailment. World Academy of Science, Engineering and Technology, International Journal of Computer, Electrical, Automation, Control and Information Engineering, 10(12):2060--2063, 2016.Google Scholar
O.Bodenreider.Theunifiedmedicallanguagesystem(umls): int egrating biomedical terminology. Nucleic acids research, 32(suppl 1):D267-- D270, 2004.Google Scholar
F. Chang, J. Guo, W. Xu, and S. R. Chung. Application of word embeddings in biomedical named entity recognition tasks. Journal of Digital Information Management, 13(5), 2015.Google Scholar
C. Friedman and S. B. Johnson. Natural language and text processing in biomedicine. In Biomedical Informatics, pages 312--343. Springer, 2006.Google ScholarCross Ref
B. Gann. Giving patients choice and control: health informatics on the patient journey. Yearbook of medical informatics, 7:70--3, 2012.Google Scholar
J. Groopman. How doctors think. Houghton Mifflin Harcourt, 2008.Google Scholar
Z.Huang, W.Xu, and K.Yu.Bidirectionallstm-crfmodelsforsequence tagging. arXiv preprint arXiv:1508.01991, 2015.Google Scholar
V. Jatav, R. Teja, S. Bharadwaj, and V. Srinivasan. Improving part- of-speech tagging for nlp pipelines. arXiv preprint arXiv:1708.00241, 2017.Google Scholar
X. Kong, Y. Li, H. Li, and X. Lu. Structuralization of digestive endoscopy report based on nlp. Zhongguo yi liao qi xie za zhi= Chinese journal of medical instrumentation, 32(5):348--351, 2008.Google Scholar
G.Lample, M.Ballesteros, S.Subramanian, K.Kawakami, and C. Dyer. Neural architectures for named entity recognition. arXiv preprint arXiv:1603.01360, 2016.Google Scholar
L. Li, L. Jin, Y. Jiang, and D. Huang. Recognizing biomedical named entities based on the sentence vector/twin word embeddings conditioned bidirectional lstm. In Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, pages 165--176. Springer, 2016.Google ScholarCross Ref
P. Liu, X. Wang, X. Sun, X. Shen, X. Chen, Y. Sun, and Y. Pan. Hkdp: A hybrid knowledge graph based pediatric disease prediction system. In International Conference on Smart Health, pages 78--90. Springer, 2016.Google Scholar
S. M. Meystre and P. J. Haug. Comparing natural language processing tools to extract medical problems from narrative text. In AMIA annual symposium proceedings, volume 2005, page 525. American Medical Informatics Association, 2005.Google Scholar
T. Nakagawa, T. Kudo, and Y. Matsumoto. Unknown word guessing and part-of-speech tagging using support vector machines. In NLPRS, pages 325--331. Citeseer, 2001.Google Scholar
J. Paparrizos, R. W. White, and E. Horvitz. Screening for pancreatic adenocarcinoma using signals from web search logs: Feasibility study and results. Journal of Oncology Practice, 12(8):737--744, 2016.Google ScholarCross Ref
M. Rotmensch, Y. Halpern, A. Tlimat, S. Horng, and D. Sontag. Learning a health knowledge graph from electronic medical records. Scientific reports, 7(1):5994, 2017.Google ScholarCross Ref
D. S. Sachan and Petuum. Revisiting lstm networks for semi-supervised text classification via mixed objective function. KDD-18 Deep Learning Day, 2018.Google Scholar
E. H. Shortliffe. Mycin: Computer-based medical consultations, 1976.Google Scholar
H. Tang and J. H. K. Ng. Googling for a diagnosisa use of google as a diagnostic aid: internet based study. Bmj, 333(7579): 1143--1145, 2006. [20] S. M. Thede and M. P. Harper. A second-order hidden markov model for part-of-speech tagging. In Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics, pages 175--182. Association for Computational Linguistics, 1999.Google ScholarCross Ref
R. W. White and E. Horvitz. Cyberchondria: studies of the escalation of medical concerns in web search. ACM Transactions on Information Systems (TOIS), 27(4):23, 2009.Google Scholar

Index Terms

Annotations of Chinese Electronic Medical Record using BiLSTM-CRF based Networks
1. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing

Recommendations

A BERT-Based Named Entity Recognition in Chinese Electronic Medical Record
ICCPR '20: Proceedings of the 2020 9th International Conference on Computing and Pattern Recognition

Named entity recognition, aiming at identifying and classifying named entity mentioned in the structured or unstructured text, is a fundamental subtask for information extraction in natural language processing (NLP). With the development of electronic ...
Read More
A BiLSTM-CRF Method to Chinese Electronic Medical Record Named Entity Recognition
ACAI '18: Proceedings of the 2018 International Conference on Algorithms, Computing and Artificial Intelligence

With the application of electronic medical records in medical field, more and more people are paying attention to how to use these data efficiently. In this paper, the BiLSTM-CRF model is applied to Chinese electronic medical records to recognize ...
Read More
Semantic-Based Exchanger for Electronic Medical Record
ICCIT '08: Proceedings of the 2008 Third International Conference on Convergence and Hybrid Information Technology - Volume 01

Considering the importance of the patient's medical information for the caregivers to ensure that patients receive appropriate and safe treatment, especially the emergency room (ER) patients, thus, sharing distributed medical information among ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in

SSPS '19: Proceedings of the 2019 International Symposium on Signal Processing Systems
September 2019
188 pages
ISBN:9781450362412
DOI:10.1145/3364908

Copyright © 2019 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 20 September 2019
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Automatic Annotation
BiLSTM
Chinese Medical Record
Qualifiers
- research-article
- Research
- Refereed limited
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 68
  Total Downloads
- Downloads (Last 12 months)4
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Annotations of Chinese Electronic Medical Record using BiLSTM-CRF based Networks

SSPS '19: Proceedings of the 2019 International Symposium on Signal Processing Systems

ABSTRACT

References

Cited By

Index Terms

Recommendations

A BERT-Based Named Entity Recognition in Chinese Electronic Medical Record

A BiLSTM-CRF Method to Chinese Electronic Medical Record Named Entity Recognition

Semantic-Based Exchanger for Electronic Medical Record

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Annotations of Chinese Electronic Medical Record using BiLSTM-CRF based Networks

SSPS '19: Proceedings of the 2019 International Symposium on Signal Processing Systems

ABSTRACT

References

Cited By

Index Terms

Recommendations

A BERT-Based Named Entity Recognition in Chinese Electronic Medical Record

A BiLSTM-CRF Method to Chinese Electronic Medical Record Named Entity Recognition

Semantic-Based Exchanger for Electronic Medical Record

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media