A comparative study of Chinese named entity recognition with different segment representations

Pan, Jun; Zhang, Chaohua; Wang, Haijun; Wu, Zongda

doi:10.1007/s10489-022-03274-0

A comparative study of Chinese named entity recognition with different segment representations

Published: 06 February 2022

Volume 52, pages 12457–12469, (2022)
Cite this article

Applied Intelligence Aims and scope Submit manuscript

Jun Pan¹,
Chaohua Zhang¹,
Haijun Wang² &
…
Zongda Wu³

903 Accesses
7 Citations
1 Altmetric
Explore all metrics

Abstract

Named entity recognition (NER) is a fundamental but crucial task in the field of natural language processing and has been widely studied. Nevertheless, little attention has been given to the segment representation (SR) schemes used to map multi-token entities into categories in Chinese NER. To address this issue, in this paper, we explore and compare the impact of using different SR schemes on Chinese NER. Our experiments are conducted on four benchmark Chinese NER datasets extended with labels to include seven well-known SR schemes: IO, IOB2, IOE2, IOBES, BI, IE, and BIES. Moreover, all seven SR schemes are investigated via two sets of classifiers: machine learning-based and neural network-based classifiers. The experimental results demonstrate that the proper selection of the best SR scheme is a complicated problem that depends on various factors, such as corpus size, corpus distribution, and the chosen classifier. We also provide a comparative analysis of the time consumption of each classifier in different SR schemes and discuss the impacts of using different SR schemes on NER in Chinese and other languages.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Information extraction from electronic medical documents: state of the art and future research directions

Article 08 November 2022

Pre-trained models for natural language processing: A survey

Article 15 September 2020

Foundation and large language models: fundamentals, challenges, opportunities, and social impacts

Article 27 November 2023

References

Goyal A, Gupta V, Kumar M (2018) Recent named entity recognition and classification schemes: a systematic review. Comput Sci Rev 29:21–43. https://doi.org/10.1016/j.cosrev.2018.06.001
Article Google Scholar
Liu J, Gao L, Guo S et al (2021) A hybrid deep-learning approach for complex biochemical named entity recognition. Knowl-Based Syst 221:106958. https://doi.org/10.1016/j.knosys.2021.106958
Article Google Scholar
Li J, Sun A, Han J et al (2020) A survey on deep learning for named entity recognition. IEEE Trans Knowl Data Eng 99:1. https://doi.org/10.1109/TKDE.2020.2981314
Article Google Scholar
Al-Moslmi T, Ocaña MG, Opdahl AL, Veres C (2020) Named entity extraction for knowledge graphs: a literature overview. IEEE Access 8:32862–32881. https://doi.org/10.1109/ACCESS.2020.2973928
Article Google Scholar
Diefenbach D, Lopez V, Singh K, Maret P (2018) Core techniques of question answering systems over knowledge bases: a survey. Knowl Inf Syst 55(3):529–569. https://doi.org/10.1007/s10115-017-1100-y
Article Google Scholar
Syachrul RMMAK, Bijaksana MA, Huda AF (2019) Person entity recognition for the Indonesian Qur’an translation with the approach hidden Markov model-viterbi. Proc Comp Sci 157:214–220. https://doi.org/10.1016/j.procs.2019.08.160
Article Google Scholar
Muhammad M, Rohaim M, Hamouda A, Abdel-Mageid S (2020) A comparison between conditional random field and structured support vector machine for Arabic named entity recognition. J Comput Sci 16(1):117–125. https://doi.org/10.1186/1758-2946-7-S1-S8
Article Google Scholar
Lin JCW, Shao Y, Zhang J, Yun U (2020) Enhanced sequence labeling based on latent variable conditional random fields. NEUROCOMPUTING 403:431–440. https://doi.org/10.1016/j.neucom.2020.04.102
Article Google Scholar
Sarıgül M, Ozyildirim BM, Avci M (2020) Differential convolutional neural network. Neural Netw 116:279–287. https://doi.org/10.1016/j.neunet.2019.04.025
Article Google Scholar
Sherstinsky A (2020) Fundamentals of recurrent neural network (RNN) and long short-term memory (LSTM) network. Physica D 404:132306. https://doi.org/10.1016/j.physd.2019.132306
Article MathSciNet MATH Google Scholar
Vaswani A, Shazeer N, Parmar N et al (2017) Attention is all you need. In: In: proceedings of the 31st international conference on neural information processing systems, NIPS’17. Curran Associates Inc, Red Hook, pp 6000–6010 https://dl.acm.org/doi/10.5555/3295222.3295349
Google Scholar
LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521(7553):436–444. https://doi.org/10.1038/nature14539
Article Google Scholar
Lin JCW, Shao Y, Djenouri Y, Yun U (2021) ASRNN: a recurrent neural network with an attention model for sequence labeling. Knowl-Based Syst 212:106548. https://doi.org/10.1016/j.knosys.2020.106548
Article Google Scholar
Liu Y, Che W, Qin B, Liu T (2020) Exploring segment representations for neural semi-markov conditional random fields. IEEE/ACM Trans Audio Speech Language Proc 20:813–824. https://doi.org/10.1109/TASLP.2020.2964960
Article Google Scholar
Alshammari N, Alanazi S (2020) The impact of using different annotation representations on named entity recognition. Egypt Inform J 22(3):295–302. https://doi.org/10.1016/j.eij.2020.10.004
Article Google Scholar
Qun N, Yan H, Qiu XP, Huang X (2020) Chinese word segmentation via BiLSTM+ semi-CRF with relay node. J Comput Sci 35(5):1115–1126. https://doi.org/10.1007/s11390-020-9576-4
Article Google Scholar
Cho HC, Okazaki N, Miwa M, Jet T (2013) Named entity recognition with multiple segment representations. Inf Process Manag 49(4):954–965. https://doi.org/10.1016/j.ipm.2013.03.002
Article Google Scholar
Konkol M, Konopík M (2015) Segment representations in named entity recognition. In: International conference on text, speech, and dialogue. Springer, Cham, pp 61–70. https://doi.org/10.1007/978-3-319-24033-6_7
Chapter Google Scholar
Luo L, Yang Z, Yang P et al (2018) An attention-based BiLSTM-CRF approach to document-level chemical named entity recognition. Bioinformatics 34(8):1381–1388. https://doi.org/10.1093/bioinformatics/btx761
Article Google Scholar
Devlin J, Chang M, Lee K, Toutanova K (2018) BERT: pre-training of deep bidirectional transformers for language understanding. In: In: proceedings of the 2019 conference of the north American chapter of the Association for Computational Linguistics: human language technologies, 1st edn. Long and Short Papers, Minneapolis, pp 4171–4186. https://doi.org/10.18653/v1/N19-1423
Chapter Google Scholar
Zhu Q, Li X, Conesa A, Pereira C (2018) GRAM-CNN: a deep learning approach with local context for named entity recognition in biomedical text. BIOINFORMATICS 34(9):1547–1554. https://doi.org/10.1093/bioinformatics/btx815
Article Google Scholar
Catelli R, Gargiulo F, Casola V, Pietro GD, Esposito M (2020) Cross lingual named entity recognition for clinical de-identification applied to a COVID-19 Italian data set. Appl Soft Comput 97:106779. https://doi.org/10.1016/j.asoc.2020.106779
Article Google Scholar
Shibuya T, Hovy E (2020) Nested named entity recognition via second-best sequence learning and decoding. TACL 8:605–620. https://doi.org/10.1162/tacl_a_00334
Article Google Scholar
Ghaddar A, Langlais P, Rashid A, Rezagholizadeh M (2021) Context-aware adversarial training for name regularity bias in named entity recognition. TACL 9:586–604. https://doi.org/10.1162/tacl_a_00386
Article Google Scholar
Ratinov L, Dan R (2009) Design challenges and misconceptions in named entity recognition. In: Proceedings of the Thirteenth Conference on Computational Natural Language Learning, pp. 147–155. https://dl.acm.org/doi/10.5555/1596374.1596399
Tkachenko A, Petmanson T, Laur S (2013) Named entity recognition in estonian. In: In: proceedings of the 4th biennial international workshop on Balto–Slavic natural language processing. Association for Computational Linguistics, Sofia, pp 78–83
Google Scholar
Yang J, Liang S, Zhang Y (2018) Design challenges and misconceptions in neural sequence labeling. In: In: proceedings of the 27th international conference on computational linguistics. Association for Computational Linguistics, Santa Fe, pp 3879–3889
Google Scholar
Mozharova V, Loukachevitch N (2016) Two-stage approach in Russian named entity recognition. In: In: 2016 international FRUCT conference on intelligence, social media and web (ISMW FRUCT). IEEE, St. Petersburg, pp 1–6. https://doi.org/10.1109/fruct.2016.7584769
Chapter Google Scholar
Keretna S, Lim CP, Creighton D, Shaban KB (2015) Enhancing medical named entity recognition with an extended segment representation technique. Comput Methods Prog Biomed 119(2):88–100. https://doi.org/10.1016/j.cmpb.2015.02.007
Article Google Scholar
He Z, Liu J, Dang K et al (2020) Leveraging maximum entropy and correlation on latent factors for learning representations. Neural Netw 131:312–323. https://doi.org/10.1016/j.neunet.2020.07.027
Article MATH Google Scholar
Shashirekha HL, Nayel HA (2016) A comparative study of segment representation for biomedical named entity recognition. In: In: 2016 international conference on advances in computing, communications and informatics (ICACCI). IEEE, Jaipur, pp 1046–1052. https://doi.org/10.1109/icacci.2016.7732182
Chapter Google Scholar
Malik MK, Sarwar SM (2016) Named entity recognition system for postpositional languages: Urdu as a case study. IJACSA 7(10):141–147. https://doi.org/10.14569/IJACSA.2016.071019
Article Google Scholar
Reimers N, Gurevych I (2017) Optimal hyperparameters for deep lstm-networks for sequence labeling tasks. arXiv: 1707.06799
Patil N, Patil A, Pawar BV (2020) Named entity recognition using conditional random fields. Proc Comp Sci 167:1181–1188. https://doi.org/10.1016/j.procs.2020.03.431
Article Google Scholar
Levow GA (2006) The third international Chinese language processing bakeoff: word segmentation and named entity recognition. In: In: proceedings of the fifth SIGHAN workshop on Chinese language processing. Association for Computational Linguistics, Sydney, pp 108–117
Google Scholar
Weischedel R, Palmer M, Marcus M et al (2011) Ontonotes release 4.0. LDC2011T03, Philadelphia, Penn.: Linguistic Data Consortium. https://doi.org/10.35111/gfjf-7r50
Zhang Y, Yang J (2018) Chinese NER Using Lattice LSTM. In: Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Melbourne, Australia, pp 1554–1564. Association for Computational Linguistics https://doi.org/10.18653/v1/P18-1144
Peng N, Dredze M (2015) Named entity recognition for Chinese social media with jointly trained Embeddings. In: In: proceedings of the 2015 conference on empirical methods in natural language processing. Association for Computational Linguistics, Lisbon, pp 548–554. https://doi.org/10.18653/v1/D15-1064
Chapter Google Scholar
Che W, Wang M, Manning CD, Liu T (2013) Named entity recognition with bilingual constraints. In: In: proceedings of the 2013 conference of the north American chapter of the Association for Computational Linguistics: human language technologies. Association for Computational Linguistics, Atlanta, pp 52–62
Google Scholar
Akkasi A, Varoğlu E, Dimililer N (2018) Balanced undersampling: a novel sentence-based undersampling method to improve recognition of named entities in chemical and biomedical text. Appl Intell 48(8):1965–1978. https://doi.org/10.1007/s10489-017-0920-5
Article Google Scholar
Liang Y, He F, Zeng X (2020) 3D mesh simplification with feature preservation based on whale optimization algorithm and differential evolution. Integr Comput-Aid E 27(4):417–435. https://doi.org/10.3233/ICA-200641
Article Google Scholar
Chen Y, He F, Li H, Zhang D, Wu Y (2020) A full migration BBO algorithm with enhanced population quality bounds for multimodal biomedical image registration. Appl Soft Comput 93:106335. https://doi.org/10.1016/j.asoc.2020.106335
Article Google Scholar
Zhang S, He F (2020) DRCDN: learning deep residual convolutional dehazing networks. Vis Comput 36(9):1797–1808. https://doi.org/10.1007/s00371-019-01774-8
Article Google Scholar
Yang Y, He F, Han S, Liang Y, Cheng Y (2021) A novel attribute-based encryption approach with integrity verification for CAD assembly models. ENGINEERING-PRC 7(6):787–797. https://doi.org/10.1016/j.eng.2021.03.011
Article Google Scholar

Download references

Acknowledgments

This work is supported by the Zhejiang Public Welfare Technology Application Research Project of China (grant: LGN21F020003), National Natural Science Foundation of China (grant: 12001489), the key project of Humanities and Social Sciences in Colleges and Universities of Zhejiang Province (No 2021GH017), Humanities and Social Sciences Project of the Ministry of Education of China (No 21YJA870011) and Zhejiang Youth Project of Zhejiang Philosophy and Social Sciences Planning (No 22ZJQN45YB).

Author information

Authors and Affiliations

Laboratory of Artificial Intelligence, School of Science, Zhejiang University of Science and Technology, Hangzhou, 310023, Zhejiang, China
Jun Pan & Chaohua Zhang
School of Electronic and Information Engineering, Taizhou University, Taizhou, 318000, Zhejiang, China
Haijun Wang
Department of Computer Science and Engineering, Shaoxing University, Shaoxing, 312000, Zhejiang, China
Zongda Wu

Authors

Jun Pan
View author publications
You can also search for this author in PubMed Google Scholar
Chaohua Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Haijun Wang
View author publications
You can also search for this author in PubMed Google Scholar
Zongda Wu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zongda Wu.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Pan, J., Zhang, C., Wang, H. et al. A comparative study of Chinese named entity recognition with different segment representations. Appl Intell 52, 12457–12469 (2022). https://doi.org/10.1007/s10489-022-03274-0

Download citation

Accepted: 18 January 2022
Published: 06 February 2022
Issue Date: September 2022
DOI: https://doi.org/10.1007/s10489-022-03274-0

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A comparative study of Chinese named entity recognition with different segment representations

Abstract

Access this article

Similar content being viewed by others

Information extraction from electronic medical documents: state of the art and future research directions

Pre-trained models for natural language processing: A survey

Foundation and large language models: fundamentals, challenges, opportunities, and social impacts

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A comparative study of Chinese named entity recognition with different segment representations

Abstract

Access this article

Similar content being viewed by others

Information extraction from electronic medical documents: state of the art and future research directions

Pre-trained models for natural language processing: A survey

Foundation and large language models: fundamentals, challenges, opportunities, and social impacts

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation