Abstract
Chinese Named Entity Recognition (CNNER) faces numerous challenges, including the diversity of the Chinese language, the complex representation of mixed Chinese and English characters and symbols in texts, the complexity of the Chinese language itself with long sentences containing multiple entities, and the uneven distribution of named entity classes in actual Chinese scenarios. To address these challenges, we propose a method called CPMFA (Character Pair-based method with Multi-feature representation and Attention mechanism). The CPMFA method predicts predetermined relations between character pairs, facilitating the identification of nested named entities based on these relations. Firstly, the method leverages the pre-trained language model LERT (Linguistically-motivated Bidirectional Encoder Representation from Transformer) and BiLSTM (Bidirectional Long Short-Term Memory) to generate comprehensive and accurate character embeddings. Secondly, it incorporates multi-feature representation to capture complex semantic information and introduces the Pyramid Squeeze Attention (PSA) module to emphasize key features. Finally, the PolyLoss function is integrated into the model training process to tackle the challenge of an imbalanced distribution of entity classes. We employed the DiaKG, Yidu-S4K and Weibo datasets to validate and evaluate the efficacy and adaptability of our method. The F1 obtained by the CPMFA on these three datasets is 83.79%, 72.03%, and 70.39%, in that order. The experimental results illustrate the outstanding performance of the proposed CPMFA method in both general knowledge and Chinese medical domains.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Cai, Z., Xiong, Z., Xu, H., Wang, P., Li, W., Pan, Y.: Generative adversarial networks: a survey toward private and secure applications. ACM Comput. Surv. (CSUR) 54(6), 1–38 (2021)
Chang, D., et al.: DiaKG: an annotated diabetes dataset for medical knowledge graph construction. In: Qin, B., Jin, Z., Wang, H., Pan, J., Liu, Y., An, B. (eds.) CCKS 2021. CCIS, vol. 1466, pp. 308–314. Springer, Singapore (2021). https://doi.org/10.1007/978-981-16-6471-7_26
Cui, S., Joe, I.: A multi-head adjacent attention-based pyramid layered model for nested named entity recognition. Neural Comput. Appl. 35(3), 2561–2574 (2023)
De, S., Bermudez-Edo, M., Xu, H., Cai, Z.: Deep generative models in the industrial internet of things: a survey. IEEE Trans. Industr. Inf. 18(9), 5728–5737 (2022)
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018)
Han, X., et al.: Overview of the CCKS 2019 knowledge graph evaluation track: entity, relation, event and QA. arXiv preprint arXiv:2003.03875 (2020)
Huang, Z., Xu, W., Yu, K.: Bidirectional LSTM-CRF models for sequence tagging. arXiv preprint arXiv:1508.01991 (2015)
Islam, T., Zinat, S.M., Sukhi, S., Mridha, M.F.: A comprehensive study on attention-based NER. In: Khanna, A., Gupta, D., Bhattacharyya, S., Hassanien, A.E., Anand, S., Jaiswal, A. (eds.) International Conference on Innovative Computing and Communications: Proceedings of ICICC 2021, Volume 2, pp. 665–681. Springer, Singapore (2022). https://doi.org/10.1007/978-981-16-2597-8_57
Ji, X., Chen, L., Shen, F., Guo, H., Gao, H.: CPMFA: a character pair-based method for Chinese nested named entity recognition. In: Yang, X., et al. (eds.) Advanced Data Mining and Applications, pp. 200–212. Springer Nature Switzerland, Cham (2023). https://doi.org/10.1007/978-3-031-46661-8_14
Leng, Z., et al.: Polyloss: a polynomial expansion perspective of classification loss functions. arXiv preprint arXiv:2204.12511 (2022)
Li, F., Lin, Z., Zhang, M., Ji, D.: A span-based model for joint overlapped and discontinuous named entity recognition. In: Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pp. 4814–4828 (2021)
Li, H., Xu, H., Qian, L., Zhou, G.: Multi-layer joint learning of Chinese nested named entity recognition based on self-attention mechanism. In: Zhu, X., Zhang, M., Hong, Yu., He, R. (eds.) Natural Language Processing and Chinese Computing, pp. 144–155. Springer International Publishing, Cham (2020). https://doi.org/10.1007/978-3-030-60457-8_12
Li, J., Sun, A., Han, J., Li, C.: A survey on deep learning for named entity recognition. IEEE Trans. Knowl. Data Eng. 34(1), 50–70 (2020)
Li, J., et al.: Unified named entity recognition as word-word relation classification. In: Proceedings of the AAAI Conference on Artificial Intelligence. vol. 36, pp. 10965–10973 (2022)
Lin, T.Y., Goyal, P., Girshick, R., He, K., Dollár, P.: Focal loss for dense object detection. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2980–2988 (2017)
Qiu, O., et al.: Chinese engineering geological named entity recognition by fusing multi-features and data enhancement using deep learning. Expert Syst. Appl. 238, 121925 (2024)
Rodríguez, A.J.C., Castro, D.C., García, S.H.: Noun-based attention mechanism for fine-grained named entity recognition. Expert Syst. Appl. 193, 116406 (2022)
Shiyuan, Y., Shuming, G., Ruiyang, H., Jianpeng, Z., Nan, H.: Layered regional exhaustive model for chinese nested named entity recognition. Comput. Technol. Dev. 32(09), 161–166+179 (2022)
Straková, J., Straka, M., Hajič, J.: Neural architectures for nested NER through linearization. arXiv preprint arXiv:1908.06926 (2019)
Su, J.: Efficient globalpointer: Fewer parameters, more effects (2022). https://spaces.ac.cn/archives/8877
Su, J., et al.: Global pointer: novel efficient span-based approach for named entity recognition. arXiv preprint arXiv:2208.03054 (2022)
Wei, Z., Su, J., Wang, Y., Tian, Y., Chang, Y.: A novel cascade binary tagging framework for relational triple extraction. arXiv preprint arXiv:1909.03227 (2019)
Xu, H., Li, Y., Balogun, O., Wu, S., Wang, Y., Cai, Z.: Security risks concerns of generative AI in the IOT. IEEE Internet Things Mag. 7(3), 62–67 (2024)
Xu, Y., Huang, H., Feng, C., Hu, Y.: A supervised multi-head self-attention network for nested named entity recognition. In: Proceedings of the AAAI conference on artificial intelligence, vol. 35, pp. 14185–14193 (2021)
Yu, Y., et al.: Chinese mineral named entity recognition based on BERT model. Expert Syst. Appl. 206, 117727 (2022)
Zhang, H., Zu, K., Lu, J., Zou, Y., Meng, D.: EPSANet: an efficient pyramid squeeze attention block on convolutional neural network. In: Proceedings of the Asian Conference on Computer Vision, pp. 1161–1177 (2022)
The Key Project of the Regional Innovation and Development Joint Fund of the National Natural Science Foundation of China provided funding for this study (Grant No.U22A2025).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2025 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Ji, X., Chen, L., Gao, H., Shen, F., Guo, H. (2025). A High-Precision Generality Method for Chinese Nested Named Entity Recognition. In: Cai, Z., Takabi, D., Guo, S., Zou, Y. (eds) Wireless Artificial Intelligent Computing Systems and Applications. WASA 2024. Lecture Notes in Computer Science, vol 14999. Springer, Cham. https://doi.org/10.1007/978-3-031-71470-2_24
Download citation
DOI: https://doi.org/10.1007/978-3-031-71470-2_24
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-71469-6
Online ISBN: 978-3-031-71470-2
eBook Packages: Computer ScienceComputer Science (R0)