A High-Precision Generality Method for Chinese Nested Named Entity Recognition

Ji, Xiayan; Chen, Lina; Gao, Hong; Shen, Fangyao; Guo, Hongjie

doi:10.1007/978-3-031-71470-2_24

Xiayan Ji¹¹,
Lina Chen¹²,
Hong Gao¹²,
Fangyao Shen¹² &
…
Hongjie Guo¹²

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14999))

Included in the following conference series:

International Conference on Wireless Artificial Intelligent Computing Systems and Applications

207 Accesses

Abstract

Chinese Named Entity Recognition (CNNER) faces numerous challenges, including the diversity of the Chinese language, the complex representation of mixed Chinese and English characters and symbols in texts, the complexity of the Chinese language itself with long sentences containing multiple entities, and the uneven distribution of named entity classes in actual Chinese scenarios. To address these challenges, we propose a method called CPMFA (Character Pair-based method with Multi-feature representation and Attention mechanism). The CPMFA method predicts predetermined relations between character pairs, facilitating the identification of nested named entities based on these relations. Firstly, the method leverages the pre-trained language model LERT (Linguistically-motivated Bidirectional Encoder Representation from Transformer) and BiLSTM (Bidirectional Long Short-Term Memory) to generate comprehensive and accurate character embeddings. Secondly, it incorporates multi-feature representation to capture complex semantic information and introduces the Pyramid Squeeze Attention (PSA) module to emphasize key features. Finally, the PolyLoss function is integrated into the model training process to tackle the challenge of an imbalanced distribution of entity classes. We employed the DiaKG, Yidu-S4K and Weibo datasets to validate and evaluate the efficacy and adaptability of our method. The F1 obtained by the CPMFA on these three datasets is 83.79%, 72.03%, and 70.39%, in that order. The experimental results illustrate the outstanding performance of the proposed CPMFA method in both general knowledge and Chinese medical domains.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 59.99; Price excludes VAT (USA)

Softcover Book: USD 74.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

CPMFA: A Character Pair-Based Method for Chinese Nested Named Entity Recognition

Unleashing the power of pinyin: promoting Chinese named entity recognition with multiple embedding and attention

Article Open access 04 January 2025

Attention-Based Bi-LSTM for Chinese Named Entity Recognition

References

Cai, Z., Xiong, Z., Xu, H., Wang, P., Li, W., Pan, Y.: Generative adversarial networks: a survey toward private and secure applications. ACM Comput. Surv. (CSUR) 54(6), 1–38 (2021)
Article Google Scholar
Chang, D., et al.: DiaKG: an annotated diabetes dataset for medical knowledge graph construction. In: Qin, B., Jin, Z., Wang, H., Pan, J., Liu, Y., An, B. (eds.) CCKS 2021. CCIS, vol. 1466, pp. 308–314. Springer, Singapore (2021). https://doi.org/10.1007/978-981-16-6471-7_26
Chapter Google Scholar
Cui, S., Joe, I.: A multi-head adjacent attention-based pyramid layered model for nested named entity recognition. Neural Comput. Appl. 35(3), 2561–2574 (2023)
Article Google Scholar
De, S., Bermudez-Edo, M., Xu, H., Cai, Z.: Deep generative models in the industrial internet of things: a survey. IEEE Trans. Industr. Inf. 18(9), 5728–5737 (2022)
Article Google Scholar
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018)
Han, X., et al.: Overview of the CCKS 2019 knowledge graph evaluation track: entity, relation, event and QA. arXiv preprint arXiv:2003.03875 (2020)
Huang, Z., Xu, W., Yu, K.: Bidirectional LSTM-CRF models for sequence tagging. arXiv preprint arXiv:1508.01991 (2015)
Islam, T., Zinat, S.M., Sukhi, S., Mridha, M.F.: A comprehensive study on attention-based NER. In: Khanna, A., Gupta, D., Bhattacharyya, S., Hassanien, A.E., Anand, S., Jaiswal, A. (eds.) International Conference on Innovative Computing and Communications: Proceedings of ICICC 2021, Volume 2, pp. 665–681. Springer, Singapore (2022). https://doi.org/10.1007/978-981-16-2597-8_57
Chapter Google Scholar
Ji, X., Chen, L., Shen, F., Guo, H., Gao, H.: CPMFA: a character pair-based method for Chinese nested named entity recognition. In: Yang, X., et al. (eds.) Advanced Data Mining and Applications, pp. 200–212. Springer Nature Switzerland, Cham (2023). https://doi.org/10.1007/978-3-031-46661-8_14
Chapter Google Scholar
Leng, Z., et al.: Polyloss: a polynomial expansion perspective of classification loss functions. arXiv preprint arXiv:2204.12511 (2022)
Li, F., Lin, Z., Zhang, M., Ji, D.: A span-based model for joint overlapped and discontinuous named entity recognition. In: Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pp. 4814–4828 (2021)
Google Scholar
Li, H., Xu, H., Qian, L., Zhou, G.: Multi-layer joint learning of Chinese nested named entity recognition based on self-attention mechanism. In: Zhu, X., Zhang, M., Hong, Yu., He, R. (eds.) Natural Language Processing and Chinese Computing, pp. 144–155. Springer International Publishing, Cham (2020). https://doi.org/10.1007/978-3-030-60457-8_12
Chapter Google Scholar
Li, J., Sun, A., Han, J., Li, C.: A survey on deep learning for named entity recognition. IEEE Trans. Knowl. Data Eng. 34(1), 50–70 (2020)
Article Google Scholar
Li, J., et al.: Unified named entity recognition as word-word relation classification. In: Proceedings of the AAAI Conference on Artificial Intelligence. vol. 36, pp. 10965–10973 (2022)
Google Scholar
Lin, T.Y., Goyal, P., Girshick, R., He, K., Dollár, P.: Focal loss for dense object detection. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2980–2988 (2017)
Google Scholar
Qiu, O., et al.: Chinese engineering geological named entity recognition by fusing multi-features and data enhancement using deep learning. Expert Syst. Appl. 238, 121925 (2024)
Article Google Scholar
Rodríguez, A.J.C., Castro, D.C., García, S.H.: Noun-based attention mechanism for fine-grained named entity recognition. Expert Syst. Appl. 193, 116406 (2022)
Article Google Scholar
Shiyuan, Y., Shuming, G., Ruiyang, H., Jianpeng, Z., Nan, H.: Layered regional exhaustive model for chinese nested named entity recognition. Comput. Technol. Dev. 32(09), 161–166+179 (2022)
Google Scholar
Straková, J., Straka, M., Hajič, J.: Neural architectures for nested NER through linearization. arXiv preprint arXiv:1908.06926 (2019)
Su, J.: Efficient globalpointer: Fewer parameters, more effects (2022). https://spaces.ac.cn/archives/8877
Su, J., et al.: Global pointer: novel efficient span-based approach for named entity recognition. arXiv preprint arXiv:2208.03054 (2022)
Wei, Z., Su, J., Wang, Y., Tian, Y., Chang, Y.: A novel cascade binary tagging framework for relational triple extraction. arXiv preprint arXiv:1909.03227 (2019)
Xu, H., Li, Y., Balogun, O., Wu, S., Wang, Y., Cai, Z.: Security risks concerns of generative AI in the IOT. IEEE Internet Things Mag. 7(3), 62–67 (2024)
Article Google Scholar
Xu, Y., Huang, H., Feng, C., Hu, Y.: A supervised multi-head self-attention network for nested named entity recognition. In: Proceedings of the AAAI conference on artificial intelligence, vol. 35, pp. 14185–14193 (2021)
Google Scholar
Yu, Y., et al.: Chinese mineral named entity recognition based on BERT model. Expert Syst. Appl. 206, 117727 (2022)
Article Google Scholar
Zhang, H., Zu, K., Lu, J., Zou, Y., Meng, D.: EPSANet: an efficient pyramid squeeze attention block on convolutional neural network. In: Proceedings of the Asian Conference on Computer Vision, pp. 1161–1177 (2022)
Google Scholar

Download references

The Key Project of the Regional Innovation and Development Joint Fund of the National Natural Science Foundation of China provided funding for this study (Grant No.U22A2025).

Author information

Authors and Affiliations

College of Physics and Electronic Information Engineering, Zhejiang Normal University, Jinhua, China
Xiayan Ji
School of Computer Science and Technology, Zhejiang Normal University, Jinhua, China
Lina Chen, Hong Gao, Fangyao Shen & Hongjie Guo

Authors

Xiayan Ji
View author publications
You can also search for this author in PubMed Google Scholar
Lina Chen
View author publications
You can also search for this author in PubMed Google Scholar
Hong Gao
View author publications
You can also search for this author in PubMed Google Scholar
Fangyao Shen
View author publications
You can also search for this author in PubMed Google Scholar
Hongjie Guo
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Lina Chen .

Editor information

Editors and Affiliations

Georgia State University, Atlanta, GA, USA
Zhipeng Cai
Old Dominion University, Norfolk, VA, USA
Daniel Takabi
Beijing University of Posts and Telecommunications, Beijing, China
Shaoyong Guo
Shandong University, Qingdao, China
Yifei Zou

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ji, X., Chen, L., Gao, H., Shen, F., Guo, H. (2025). A High-Precision Generality Method for Chinese Nested Named Entity Recognition. In: Cai, Z., Takabi, D., Guo, S., Zou, Y. (eds) Wireless Artificial Intelligent Computing Systems and Applications. WASA 2024. Lecture Notes in Computer Science, vol 14999. Springer, Cham. https://doi.org/10.1007/978-3-031-71470-2_24

Download citation

DOI: https://doi.org/10.1007/978-3-031-71470-2_24
Published: 13 November 2024
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-71469-6
Online ISBN: 978-3-031-71470-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

A High-Precision Generality Method for Chinese Nested Named Entity Recognition