Skip to main content

Overview of CCKS 2018 Task 1: Named Entity Recognition in Chinese Electronic Medical Records

  • Conference paper
  • First Online:

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1134))

Abstract

The CCKS 2018 presented a named entity recognition (NER) task focusing on Chinese electronic medical records (EMR). The Knowledge Engineering Group of Tsinghua University and Yidu Cloud Beijing Technology Co., Ltd. provided an annotated dataset for this task, which is the only publicly available dataset in the field of Chinese EMR. Using this dataset, 69 systems were developed for the task. The performance of the systems showed that the traditional CRF and Bi-LSTM model were the most popular models for the task. The system achieved the highest performance by combining CRF or Bi-LSTM model with complex feature engineering, indicating that feature engineering is still indispensable. These results also showed that the performance of the task could be augmented with rule-based systems to determine clinical named entities.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    The annotated dataset has not been deposited in a public repository but is available to the research community under data use agreements from the corresponding author on request.

References

  1. de Bruijn, B., Cherry, C., Kiritchenko, S., Martin, J., Zhu, X.: Machine-learned solutions for three stages of clinical information extraction: the state of the art at i2b2 2010. J. Am. Med. Inf. Assoc. 18(5), 557 (2011)

    Article  Google Scholar 

  2. Jiang, M., et al.: A study of machine-learning-based approaches to extract clinical entities and their assertions from discharge summaries. JAMIA 18, 601–606 (2011)

    Google Scholar 

  3. Kundeti, S.R., Vijayananda, J., Mujjiga, S., Kalyan, M.: Clinical named entity recognition: challenges and opportunities. In: IEEE International Conference on Big Data, pp. 1937–1945 (2016)

    Google Scholar 

  4. Luo, L., Li, N., Li, S.S., Yang, Z., Lin, H.: Dutir at the ccks-2018 task1: a neural network ensemble approach for chinese clinical named entity recognition. In: CCKS Tasks (2018)

    Google Scholar 

  5. Meystre, S.M., Savova, G.K., Kipper-Schuler, K.C., Hurdle, J.F.: Extracting information from textual documents in the electronic health record: a review of recent research. In: Yearbook of Medical Informatics, pp. 128–144, January 2008

    Google Scholar 

  6. Pradhan, S., Elhadad, N., Chapman, W.W., Manandhar, S., Savova, G.: Semeval-2014 task 7: analysis of clinical text. In: SemEval@COLING, pp. 54–62 (2014)

    Google Scholar 

  7. Qiu, W., Chen, M., Ding, R., Xie, P.: Heiheihahei at ccks clinical entity recognition task: a neural-based ensemble approach. In: CCKS Tasks (2018)

    Google Scholar 

  8. Ratinov, L., Roth, D.: Design challenges and misconceptions in named entity recognition. In: CoNLL, June 2009

    Google Scholar 

  9. Settles, B.: Biomedical named entity recognition using conditional random fields and rich feature sets. In: JNLPBA, pp. 104–107 (2004)

    Google Scholar 

  10. Suominen, H., et al.: Overview of the ShARe/CLEF eHealth evaluation lab 2013. In: Forner, P., Müller, H., Paredes, R., Rosso, P., Stein, B. (eds.) CLEF 2013. LNCS, vol. 8138, pp. 212–231. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-40802-1_24

    Chapter  Google Scholar 

  11. Uzuner, O., South, B.R., Shen, S., DuVall, S.L.: 2010 i2b2/va challenge on concepts, assertions, and relations in clinical text. J. Am. Med. Inf. Assoc. 18(5), 552 (2011)

    Article  Google Scholar 

  12. Yang, X., Huang, W.: A conditional random fields approach to clinical name entity recognition. In: CCKS Tasks (2018)

    Google Scholar 

  13. Zhang, J., et al.: Category multi-representation: a unified solution for named entity recognition in clinical texts. In: PAKDD, pp. 275–287 (2018)

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jiangtao Zhang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Zhang, J., Li, J., Jiao, Z., Yan, J. (2019). Overview of CCKS 2018 Task 1: Named Entity Recognition in Chinese Electronic Medical Records. In: Zhu, X., Qin, B., Zhu, X., Liu, M., Qian, L. (eds) Knowledge Graph and Semantic Computing: Knowledge Computing and Language Understanding. CCKS 2019. Communications in Computer and Information Science, vol 1134. Springer, Singapore. https://doi.org/10.1007/978-981-15-1956-7_14

Download citation

  • DOI: https://doi.org/10.1007/978-981-15-1956-7_14

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-15-1955-0

  • Online ISBN: 978-981-15-1956-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics