A Survey of Deep Learning for Named Entity Recognition in Chinese Social Media

Liu, Jingxin; Cheng, Jieren; Wang, Ziyan; Lou, Congqiang; Shen, Chenli; Sheng, Victor S.

doi:10.1007/978-3-031-06794-5_46

Jingxin Liu¹¹,
Jieren Cheng^11,12,
Ziyan Wang¹¹,
Congqiang Lou¹¹,
Chenli Shen¹¹ &
…
Victor S. Sheng¹³

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13338))

Included in the following conference series:

International Conference on Adaptive and Intelligent Systems

1722 Accesses
2 Citations

Abstract

Named Entity Recognition is the research foundation of many Natural Language Processing sub-tasks. Named Entity Recognition for Chinese social media is to identify entity nouns such as person names, place names, and organization names in Chinese Social Media corpus. Due to the non-standardization of Chinese Social Media texts and the small size of the corpus, the accuracy of entity recognition will be affected. In this review, aiming at the above issues, we first introduce the historical development and research background of Chinese named entity recognition. Then, we investigate the latest improvement methods of Chinese named entity recognition for social media, and divide these improvement methods into methods to improve model recognition performance with external knowledge and methods to enhance internal knowledge to improve model performance. Finally, we summarize the challenges Chinese named entity recognition in social media based on deep learning, and propose the future development direction for these challenges.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 89.00; Price excludes VAT (USA)

Softcover Book: USD 119.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Nadeau, D., Sekine, S.: A survey of named entity recognition and classification. Lingvisticae Investigationes 30, 3–26 (2007)
Article Google Scholar
Tran, P., Ta, V., Truong, Q., Duong, Q., Nguyen, T., Phan, X.: Named entity recognition for vietnamese spoken texts and its application in smart mobile voice interaction. In: Nguyen, N.T., Trawiński, B., Fujita, H., Hong, TP. (eds.) Intelligent Information and Database Systems. ACIIDS 2016. LNCS, vol. 9621, pp. 170–180. Springer, Heidelberg (2016). https://doi.org/10.1007/978-3-662-49381-6_17
Yang, J., Zhang, Y., Dong, F.: Neural reranking for named entity recognition RANLP. In: Advances in Natural Language Processing Meet Deep Learning, pp. 84–92 (2017)
Google Scholar
Wang, Y., Sun, Y., Ma, Z., Gao, L., Xu, Y., Sun, T.: Application of pre-training models in named entity recognition. In: 2020 12th International Conference on Intelligent Human-Machine Systems and Cybernetics (IHMSC), vol. 1, pp. 23–26. IEEE (2020)
Google Scholar
Klinger, R., Friedrich, C.: User’s choice of precision and. named entity recognition. In: Proceedings of the International Conference RANLP-2009, pp. 92–96 (2009)
Google Scholar
Yoo, S., Jeong, O.: EP-Bot: empathetic chatbot using auto-growing knowledge graph. Comput. Mater. Cont. 67(3), 2807–2817 (2021)
Google Scholar
He, Q., Wu, L., Yin, Cai, Y., H: Knowledge-graph augmented word representations for named entity recognition. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 34, pp. 19–26 (2020)
Google Scholar
Lossio-Ventura, J., et al.: Towards an obesity-cancer knowledge base: Biomedical entity identification and relation detection. In: IEEE International Conference on Bioinformatics and Biomedicine (BIBM), pp. 81–88. IEEE (2016)
Google Scholar
Loster, M. Knowledge base construction with machine learning methods. Universität Potsdam (2021)
Google Scholar
He, Z., Li, W.H.: Named entity recognition and disambiguation. General Information (2013)
Google Scholar
Adak, C., Chaudhuri, B., Blumenstein, M.: Named entity recognition from unstructured handwritten document. In: Images Document Analysis Systems, pp. 75–80. IEEE (2016)
Google Scholar
Dandapat, S., Way, A.: Improved named entity recognition using machine translation- based cross-lingual. Information Computacion Y Sistemas 20, 495–504 (2016)
Google Scholar
Li, Z., Qu, D., Xie, C., Li, Y.: Language model pre-training method in machine translation based on named entity recognition. Int. J. Artif. Intell. Tools 29(7n08), 2040021 (2020)
Google Scholar
Al-Besher, A., Kumar, K., Sangeetha, M., Butsa, T.: Bert for conversational question answer- ing systems using semantic similarity estimation. Comput. Mater. Cont. 70(3), 4763–4780 (2022)
Google Scholar
Wang, Z., Guan, H.: 2020 Research on named entity recognition of doctor-patient question answering community based on bilstm-crf model. In: IEEE International Conference on Bioinformatics and Biomedicine (BIBM), pp. 41–44. IEEE (2020)
Google Scholar
Lamurias, A., Couto, F.: Biomedical question answering using bidirectional transformers and named entity recognition. In: Proceedings of the 18th BioNLP Workshop and Shared Task, pp. 23–27 (2019)
Google Scholar
Rau, L.: Extracting company names from text. In: Proceedings of the Seventh IEEE Conference on Artificial Intelligence Application, pp. 29–32. IEEE (1991)
Google Scholar
Bikel, D., Schwarta, R., Weischedel, R.: An algorithm that learns what’s in a. name. Mach. Learn. 34, 211–242 (1999)
Google Scholar
Chinchor, N., Robinson, P.: MUC-7 named entity task definition. In: Proceedings of the 7th Conference on Message Understanding, vol. 29, pp. 1–21 (1997)
Google Scholar
Wu, Y., Lin, Y.J., Q: Description of the NCU Chinese word segmentation and named entity recognition system for SIGHAN Bakeoff. In: Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing, pp. 209–221 (2006)
Google Scholar
Sang, E.F.T.K., DeMeulder, F.: Introduction to the CoNLL-2003 shared task: language-independent named entity recognition. In: Proceedings of the Seventh Conference on Natural Language Learning at HLT-NAACL, pp. 142–147 (2003)
Google Scholar
Mao, X., Dong, Y., He, S., Wang, H., Bao, S.: Chinese word segmentation and named entity recognition based on conditional random fields. In: Proceedings of the Sixth SIGHAN Workshop on Chinese Language Processing, pp. 90–93 (2008)
Google Scholar
Li, L., Mao, T., Huang, D., Yang, Y.: Hybrid models for Chinese named entity recognition. In: Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing, pp. 72–78 (2006)
Google Scholar
Liu, X., Zhang, S., Wei, F., Zhou, M.: Recognizing named entities in tweets. In: Proceedings of the 49th annual meeting of the association for computational linguistics: human language technologies, pp. 59–67 (2011)
Google Scholar
Ling, W., Xiang, G., Dyer, C., Alan, B., Isabel, T.: Microblogs as parallel corpora. In: Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, vol. 1, pp. 76–86 (2013)
Google Scholar
Peng, N., Dredze, M.: Named entity recognition for Chinese social media with jointly trained embeddings. In: Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pp. 48–54 (2015)
Google Scholar
Cheng, J., Yang, Y., Tang, X., Xiong, N., Zhang, Y., Lei, F.: Generative adversarial net- works: a literature review. KSII Trans. Internet Inf. Syst. 14(12), 4625–4647 (2020)
Google Scholar
Lei, F., Cheng, J., Yang, Y., Tang, X., Sheng, V., Huang, C.: Improving heterogeneous network knowledge transfer based on the principle of generative adversarial. Electronics 10(13), 1525 (2021)
Article Google Scholar
Tang, X., Tu, W., Li, K., Cheng, J.: DFFNet: an IoT-perceptive dual feature fusion network for general real-time semantic segmentation. Inf. Sci. 565, 326–343 (2021)
Article Google Scholar
Cheng, J., Peng, X., Tang, W., Tu, W., Xu: MIFNet: a lightweight multiscale information fusion network. Int. J. Intell. Syst. 1–26 (2021)
Google Scholar
Li, T., Hu, Y., Ju, A., Hu, Z.: Adversarial active learning for named entity recognition in cybersecurity. Comput. Mater. Cont. 66(1), 407–420 (2021)
Google Scholar
Zhao, S., Hu, M., Cai, Z., Zhang, Z., Zhou, T., Liu, F.: Enhancing Chinese character representation with lattice-aligned attention. IEEE Trans. Neural Netw. Learn. Syst. (2021). https://doi.org/10.1109/TNNLS.2021.3114378
Article Google Scholar
Cheng, J., Liu, J., Xu, X.: A review of Chinese named entity recognition. KSII Trans. Internet Inf. Syst. (TIIS) 15(6), 2012–2030 (2021)
Google Scholar
He, J., Wang, H.: Chinese named entity recognition and word segmentation based on character. In: Proceedings of the Sixth SIGHAN Workshop on Chinese Language Processing, pp. 28–32 (2008)
Google Scholar
He, Z., Li, W.H., S: The task 2 of CIPS-SIGHAN 2012 named entity recognition and disambiguation. In. Chinese Bakeoff Proceedings of the Second CIPS-SIGHAN Joint Conference on Chinese Language Processing, pp. 108–122 (2012)
Google Scholar
Li, J., Sun, A., Han, J., Li, C.: A survey on deep learning for named entity recognition. IEEE Trans. Knowl. Data Eng. (2020). https://doi.org/10.1109/TKDE.2020.2981314
Article Google Scholar
Peng, N., Dredze, M.: Improving named entity recognition for Chinese social media with word segmentation representation learning. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, vol. 2, pp. 49–55 (2016)
Google Scholar
He, H., Sun, X.: Score driven max margin neural network for named entity recognition in Chinese social media. In: Proceedings of the 15th Conference of the European, chap. 2, pp. 713–731 (2017)
Google Scholar
He, H., Sun, X.: A unified model for cross-domain and semi-supervised named entity recognition in Chinese social media. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 31 (2017)
Google Scholar
Wang, B., Chai, Y., Xing, S.: Attention-based recurrent neural model for named entity recognition in. Chinese social media. In: Proceedings of the 2019 2nd International Conference on Algorithms, Computing and Artificial Intelligence, pp. 91–96 (2019)
Google Scholar
Nie, Y., Tian, Y., Wan, X., Song, Y., Dai, B.: Named entity recognition for social media texts with semantic augmentation. In: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 83–91 (2020)
Google Scholar
Gong, Z., Chen, P., Zhou, J.: integrating boundary assembling into a DNN framework for named entity recognition in Chinese social media text. arXiv:2002.11910 (2020)
Dong, C., Wu, H., Zhang, J., Zong, C.: Multichannel LSTM-CRF for named entity recognition in Chinese social media. In: Sun, M., Wang, X., Chang, B., Xiong, D. (eds.) NLP-NABD CCL 2017. LNCS, vol. 10565, pp. 197–208. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-69005-6_17

Download references

Acknowledgments

This work was supported by National Natural Science Foundation of China (Grant No.62162024 and No. 62162022), Key Projects in Hainan Province (Grant No. ZDYF2021GXJS003 and No. ZDYF2020040), the Major science and technology project of Hainan Province(Grant No.ZDKJ2020012) and Graduate Innovation Project (Grant No.Qhys2021–187).

Author information

Authors and Affiliations

School of Computer Science and Technology, Hainan University, Haikou, 570228, China
Jingxin Liu, Jieren Cheng, Ziyan Wang, Congqiang Lou & Chenli Shen
Hainan Blockchain Technology Engineering Research Center, Hainan University, Haikou, 570228, China
Jieren Cheng
Department of Computer Science, Texas Tech University, Lubbock, TX, 79409, USA
Victor S. Sheng

Authors

Jingxin Liu
View author publications
You can also search for this author in PubMed Google Scholar
Jieren Cheng
View author publications
You can also search for this author in PubMed Google Scholar
Ziyan Wang
View author publications
You can also search for this author in PubMed Google Scholar
Congqiang Lou
View author publications
You can also search for this author in PubMed Google Scholar
Chenli Shen
View author publications
You can also search for this author in PubMed Google Scholar
Victor S. Sheng
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jieren Cheng .

Editor information

Editors and Affiliations

Nanjing University of Information Science and Technology, Nanjing, China
Xingming Sun
Nanjing University of Information Science and Technology, Nanjing, China
Xiaorui Zhang
Jinan University, Guangzhou, China
Zhihua Xia
Purdue University, West Lafayette, IN, USA
Elisa Bertino

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Liu, J., Cheng, J., Wang, Z., Lou, C., Shen, C., Sheng, V.S. (2022). A Survey of Deep Learning for Named Entity Recognition in Chinese Social Media. In: Sun, X., Zhang, X., Xia, Z., Bertino, E. (eds) Artificial Intelligence and Security. ICAIS 2022. Lecture Notes in Computer Science, vol 13338. Springer, Cham. https://doi.org/10.1007/978-3-031-06794-5_46

Download citation

DOI: https://doi.org/10.1007/978-3-031-06794-5_46
Published: 04 July 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-06793-8
Online ISBN: 978-3-031-06794-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics