Overview of CCKS 2018 Task 1: Named Entity Recognition in Chinese Electronic Medical Records

Zhang, Jiangtao; Li, Juanzi; Jiao, Zengtao; Yan, Jun

doi:10.1007/978-981-15-1956-7_14

Overview of CCKS 2018 Task 1: Named Entity Recognition in Chinese Electronic Medical Records

Jiangtao Zhang^11,12,
Juanzi Li¹²,
Zengtao Jiao¹³ &
…
Jun Yan¹³

Conference paper
First Online: 03 January 2020

1324 Accesses
4 Citations

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1134))

Abstract

The CCKS 2018 presented a named entity recognition (NER) task focusing on Chinese electronic medical records (EMR). The Knowledge Engineering Group of Tsinghua University and Yidu Cloud Beijing Technology Co., Ltd. provided an annotated dataset for this task, which is the only publicly available dataset in the field of Chinese EMR. Using this dataset, 69 systems were developed for the task. The performance of the systems showed that the traditional CRF and Bi-LSTM model were the most popular models for the task. The system achieved the highest performance by combining CRF or Bi-LSTM model with complex feature engineering, indicating that feature engineering is still indispensable. These results also showed that the performance of the task could be augmented with rule-based systems to determine clinical named entities.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

1.
The annotated dataset has not been deposited in a public repository but is available to the research community under data use agreements from the corresponding author on request.

References

de Bruijn, B., Cherry, C., Kiritchenko, S., Martin, J., Zhu, X.: Machine-learned solutions for three stages of clinical information extraction: the state of the art at i2b2 2010. J. Am. Med. Inf. Assoc. 18(5), 557 (2011)
Article Google Scholar
Jiang, M., et al.: A study of machine-learning-based approaches to extract clinical entities and their assertions from discharge summaries. JAMIA 18, 601–606 (2011)
Google Scholar
Kundeti, S.R., Vijayananda, J., Mujjiga, S., Kalyan, M.: Clinical named entity recognition: challenges and opportunities. In: IEEE International Conference on Big Data, pp. 1937–1945 (2016)
Google Scholar
Luo, L., Li, N., Li, S.S., Yang, Z., Lin, H.: Dutir at the ccks-2018 task1: a neural network ensemble approach for chinese clinical named entity recognition. In: CCKS Tasks (2018)
Google Scholar
Meystre, S.M., Savova, G.K., Kipper-Schuler, K.C., Hurdle, J.F.: Extracting information from textual documents in the electronic health record: a review of recent research. In: Yearbook of Medical Informatics, pp. 128–144, January 2008
Google Scholar
Pradhan, S., Elhadad, N., Chapman, W.W., Manandhar, S., Savova, G.: Semeval-2014 task 7: analysis of clinical text. In: SemEval@COLING, pp. 54–62 (2014)
Google Scholar
Qiu, W., Chen, M., Ding, R., Xie, P.: Heiheihahei at ccks clinical entity recognition task: a neural-based ensemble approach. In: CCKS Tasks (2018)
Google Scholar
Ratinov, L., Roth, D.: Design challenges and misconceptions in named entity recognition. In: CoNLL, June 2009
Google Scholar
Settles, B.: Biomedical named entity recognition using conditional random fields and rich feature sets. In: JNLPBA, pp. 104–107 (2004)
Google Scholar
Suominen, H., et al.: Overview of the ShARe/CLEF eHealth evaluation lab 2013. In: Forner, P., Müller, H., Paredes, R., Rosso, P., Stein, B. (eds.) CLEF 2013. LNCS, vol. 8138, pp. 212–231. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-40802-1_24
Chapter Google Scholar
Uzuner, O., South, B.R., Shen, S., DuVall, S.L.: 2010 i2b2/va challenge on concepts, assertions, and relations in clinical text. J. Am. Med. Inf. Assoc. 18(5), 552 (2011)
Article Google Scholar
Yang, X., Huang, W.: A conditional random fields approach to clinical name entity recognition. In: CCKS Tasks (2018)
Google Scholar
Zhang, J., et al.: Category multi-representation: a unified solution for named entity recognition in clinical texts. In: PAKDD, pp. 275–287 (2018)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

The 305 Hospital of PLA, Beijing, 100017, China
Jiangtao Zhang
Tsinghua University, Beijing, 100084, China
Jiangtao Zhang & Juanzi Li
Yidu Cloud Beijing Technology Co., Ltd., Beijing, 100191, China
Zengtao Jiao & Jun Yan

Authors

Jiangtao Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Juanzi Li
View author publications
You can also search for this author in PubMed Google Scholar
Zengtao Jiao
View author publications
You can also search for this author in PubMed Google Scholar
Jun Yan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jiangtao Zhang .

Editor information

Editors and Affiliations

Department of Computer Science and Technology, Tsinghua University, Beijing, China
Xiaoyan Zhu
Harbin Institute of Technology, Harbin, China
Bing Qin
Queen's University, Kingston, Canada
Xiaodan Zhu
Harbin Institute of Technology, Harbin, China
Ming Liu
Soochow University, Soochow, China
Longhua Qian

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhang, J., Li, J., Jiao, Z., Yan, J. (2019). Overview of CCKS 2018 Task 1: Named Entity Recognition in Chinese Electronic Medical Records. In: Zhu, X., Qin, B., Zhu, X., Liu, M., Qian, L. (eds) Knowledge Graph and Semantic Computing: Knowledge Computing and Language Understanding. CCKS 2019. Communications in Computer and Information Science, vol 1134. Springer, Singapore. https://doi.org/10.1007/978-981-15-1956-7_14

Download citation

DOI: https://doi.org/10.1007/978-981-15-1956-7_14
Published: 03 January 2020
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-1955-0
Online ISBN: 978-981-15-1956-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics