Abstract
Named Entity Recognition is the research foundation of many Natural Language Processing sub-tasks. Named Entity Recognition for Chinese social media is to identify entity nouns such as person names, place names, and organization names in Chinese Social Media corpus. Due to the non-standardization of Chinese Social Media texts and the small size of the corpus, the accuracy of entity recognition will be affected. In this review, aiming at the above issues, we first introduce the historical development and research background of Chinese named entity recognition. Then, we investigate the latest improvement methods of Chinese named entity recognition for social media, and divide these improvement methods into methods to improve model recognition performance with external knowledge and methods to enhance internal knowledge to improve model performance. Finally, we summarize the challenges Chinese named entity recognition in social media based on deep learning, and propose the future development direction for these challenges.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Nadeau, D., Sekine, S.: A survey of named entity recognition and classification. Lingvisticae Investigationes 30, 3–26 (2007)
Tran, P., Ta, V., Truong, Q., Duong, Q., Nguyen, T., Phan, X.: Named entity recognition for vietnamese spoken texts and its application in smart mobile voice interaction. In: Nguyen, N.T., Trawiński, B., Fujita, H., Hong, TP. (eds.) Intelligent Information and Database Systems. ACIIDS 2016. LNCS, vol. 9621, pp. 170–180. Springer, Heidelberg (2016). https://doi.org/10.1007/978-3-662-49381-6_17
Yang, J., Zhang, Y., Dong, F.: Neural reranking for named entity recognition RANLP. In: Advances in Natural Language Processing Meet Deep Learning, pp. 84–92 (2017)
Wang, Y., Sun, Y., Ma, Z., Gao, L., Xu, Y., Sun, T.: Application of pre-training models in named entity recognition. In: 2020 12th International Conference on Intelligent Human-Machine Systems and Cybernetics (IHMSC), vol. 1, pp. 23–26. IEEE (2020)
Klinger, R., Friedrich, C.: User’s choice of precision and. named entity recognition. In: Proceedings of the International Conference RANLP-2009, pp. 92–96 (2009)
Yoo, S., Jeong, O.: EP-Bot: empathetic chatbot using auto-growing knowledge graph. Comput. Mater. Cont. 67(3), 2807–2817 (2021)
He, Q., Wu, L., Yin, Cai, Y., H: Knowledge-graph augmented word representations for named entity recognition. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 34, pp. 19–26 (2020)
Lossio-Ventura, J., et al.: Towards an obesity-cancer knowledge base: Biomedical entity identification and relation detection. In: IEEE International Conference on Bioinformatics and Biomedicine (BIBM), pp. 81–88. IEEE (2016)
Loster, M. Knowledge base construction with machine learning methods. Universität Potsdam (2021)
He, Z., Li, W.H.: Named entity recognition and disambiguation. General Information (2013)
Adak, C., Chaudhuri, B., Blumenstein, M.: Named entity recognition from unstructured handwritten document. In: Images Document Analysis Systems, pp. 75–80. IEEE (2016)
Dandapat, S., Way, A.: Improved named entity recognition using machine translation- based cross-lingual. Information Computacion Y Sistemas 20, 495–504 (2016)
Li, Z., Qu, D., Xie, C., Li, Y.: Language model pre-training method in machine translation based on named entity recognition. Int. J. Artif. Intell. Tools 29(7n08), 2040021 (2020)
Al-Besher, A., Kumar, K., Sangeetha, M., Butsa, T.: Bert for conversational question answer- ing systems using semantic similarity estimation. Comput. Mater. Cont. 70(3), 4763–4780 (2022)
Wang, Z., Guan, H.: 2020 Research on named entity recognition of doctor-patient question answering community based on bilstm-crf model. In: IEEE International Conference on Bioinformatics and Biomedicine (BIBM), pp. 41–44. IEEE (2020)
Lamurias, A., Couto, F.: Biomedical question answering using bidirectional transformers and named entity recognition. In: Proceedings of the 18th BioNLP Workshop and Shared Task, pp. 23–27 (2019)
Rau, L.: Extracting company names from text. In: Proceedings of the Seventh IEEE Conference on Artificial Intelligence Application, pp. 29–32. IEEE (1991)
Bikel, D., Schwarta, R., Weischedel, R.: An algorithm that learns what’s in a. name. Mach. Learn. 34, 211–242 (1999)
Chinchor, N., Robinson, P.: MUC-7 named entity task definition. In: Proceedings of the 7th Conference on Message Understanding, vol. 29, pp. 1–21 (1997)
Wu, Y., Lin, Y.J., Q: Description of the NCU Chinese word segmentation and named entity recognition system for SIGHAN Bakeoff. In: Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing, pp. 209–221 (2006)
Sang, E.F.T.K., DeMeulder, F.: Introduction to the CoNLL-2003 shared task: language-independent named entity recognition. In: Proceedings of the Seventh Conference on Natural Language Learning at HLT-NAACL, pp. 142–147 (2003)
Mao, X., Dong, Y., He, S., Wang, H., Bao, S.: Chinese word segmentation and named entity recognition based on conditional random fields. In: Proceedings of the Sixth SIGHAN Workshop on Chinese Language Processing, pp. 90–93 (2008)
Li, L., Mao, T., Huang, D., Yang, Y.: Hybrid models for Chinese named entity recognition. In: Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing, pp. 72–78 (2006)
Liu, X., Zhang, S., Wei, F., Zhou, M.: Recognizing named entities in tweets. In: Proceedings of the 49th annual meeting of the association for computational linguistics: human language technologies, pp. 59–67 (2011)
Ling, W., Xiang, G., Dyer, C., Alan, B., Isabel, T.: Microblogs as parallel corpora. In: Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, vol. 1, pp. 76–86 (2013)
Peng, N., Dredze, M.: Named entity recognition for Chinese social media with jointly trained embeddings. In: Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pp. 48–54 (2015)
Cheng, J., Yang, Y., Tang, X., Xiong, N., Zhang, Y., Lei, F.: Generative adversarial net- works: a literature review. KSII Trans. Internet Inf. Syst. 14(12), 4625–4647 (2020)
Lei, F., Cheng, J., Yang, Y., Tang, X., Sheng, V., Huang, C.: Improving heterogeneous network knowledge transfer based on the principle of generative adversarial. Electronics 10(13), 1525 (2021)
Tang, X., Tu, W., Li, K., Cheng, J.: DFFNet: an IoT-perceptive dual feature fusion network for general real-time semantic segmentation. Inf. Sci. 565, 326–343 (2021)
Cheng, J., Peng, X., Tang, W., Tu, W., Xu: MIFNet: a lightweight multiscale information fusion network. Int. J. Intell. Syst. 1–26 (2021)
Li, T., Hu, Y., Ju, A., Hu, Z.: Adversarial active learning for named entity recognition in cybersecurity. Comput. Mater. Cont. 66(1), 407–420 (2021)
Zhao, S., Hu, M., Cai, Z., Zhang, Z., Zhou, T., Liu, F.: Enhancing Chinese character representation with lattice-aligned attention. IEEE Trans. Neural Netw. Learn. Syst. (2021). https://doi.org/10.1109/TNNLS.2021.3114378
Cheng, J., Liu, J., Xu, X.: A review of Chinese named entity recognition. KSII Trans. Internet Inf. Syst. (TIIS) 15(6), 2012–2030 (2021)
He, J., Wang, H.: Chinese named entity recognition and word segmentation based on character. In: Proceedings of the Sixth SIGHAN Workshop on Chinese Language Processing, pp. 28–32 (2008)
He, Z., Li, W.H., S: The task 2 of CIPS-SIGHAN 2012 named entity recognition and disambiguation. In. Chinese Bakeoff Proceedings of the Second CIPS-SIGHAN Joint Conference on Chinese Language Processing, pp. 108–122 (2012)
Li, J., Sun, A., Han, J., Li, C.: A survey on deep learning for named entity recognition. IEEE Trans. Knowl. Data Eng. (2020). https://doi.org/10.1109/TKDE.2020.2981314
Peng, N., Dredze, M.: Improving named entity recognition for Chinese social media with word segmentation representation learning. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, vol. 2, pp. 49–55 (2016)
He, H., Sun, X.: Score driven max margin neural network for named entity recognition in Chinese social media. In: Proceedings of the 15th Conference of the European, chap. 2, pp. 713–731 (2017)
He, H., Sun, X.: A unified model for cross-domain and semi-supervised named entity recognition in Chinese social media. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 31 (2017)
Wang, B., Chai, Y., Xing, S.: Attention-based recurrent neural model for named entity recognition in. Chinese social media. In: Proceedings of the 2019 2nd International Conference on Algorithms, Computing and Artificial Intelligence, pp. 91–96 (2019)
Nie, Y., Tian, Y., Wan, X., Song, Y., Dai, B.: Named entity recognition for social media texts with semantic augmentation. In: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 83–91 (2020)
Gong, Z., Chen, P., Zhou, J.: integrating boundary assembling into a DNN framework for named entity recognition in Chinese social media text. arXiv:2002.11910 (2020)
Dong, C., Wu, H., Zhang, J., Zong, C.: Multichannel LSTM-CRF for named entity recognition in Chinese social media. In: Sun, M., Wang, X., Chang, B., Xiong, D. (eds.) NLP-NABD CCL 2017. LNCS, vol. 10565, pp. 197–208. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-69005-6_17
Acknowledgments
This work was supported by National Natural Science Foundation of China (Grant No.62162024 and No. 62162022), Key Projects in Hainan Province (Grant No. ZDYF2021GXJS003 and No. ZDYF2020040), the Major science and technology project of Hainan Province(Grant No.ZDKJ2020012) and Graduate Innovation Project (Grant No.Qhys2021–187).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Liu, J., Cheng, J., Wang, Z., Lou, C., Shen, C., Sheng, V.S. (2022). A Survey of Deep Learning for Named Entity Recognition in Chinese Social Media. In: Sun, X., Zhang, X., Xia, Z., Bertino, E. (eds) Artificial Intelligence and Security. ICAIS 2022. Lecture Notes in Computer Science, vol 13338. Springer, Cham. https://doi.org/10.1007/978-3-031-06794-5_46
Download citation
DOI: https://doi.org/10.1007/978-3-031-06794-5_46
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-06793-8
Online ISBN: 978-3-031-06794-5
eBook Packages: Computer ScienceComputer Science (R0)