Skip to main content

A Survey of Deep Learning for Named Entity Recognition in Chinese Social Media

  • Conference paper
  • First Online:
Artificial Intelligence and Security (ICAIS 2022)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13338))

Included in the following conference series:

Abstract

Named Entity Recognition is the research foundation of many Natural Language Processing sub-tasks. Named Entity Recognition for Chinese social media is to identify entity nouns such as person names, place names, and organization names in Chinese Social Media corpus. Due to the non-standardization of Chinese Social Media texts and the small size of the corpus, the accuracy of entity recognition will be affected. In this review, aiming at the above issues, we first introduce the historical development and research background of Chinese named entity recognition. Then, we investigate the latest improvement methods of Chinese named entity recognition for social media, and divide these improvement methods into methods to improve model recognition performance with external knowledge and methods to enhance internal knowledge to improve model performance. Finally, we summarize the challenges Chinese named entity recognition in social media based on deep learning, and propose the future development direction for these challenges.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 89.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 119.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Nadeau, D., Sekine, S.: A survey of named entity recognition and classification. Lingvisticae Investigationes 30, 3–26 (2007)

    Article  Google Scholar 

  2. Tran, P., Ta, V., Truong, Q., Duong, Q., Nguyen, T., Phan, X.: Named entity recognition for vietnamese spoken texts and its application in smart mobile voice interaction. In: Nguyen, N.T., Trawiński, B., Fujita, H., Hong, TP. (eds.) Intelligent Information and Database Systems. ACIIDS 2016. LNCS, vol. 9621, pp. 170–180. Springer, Heidelberg (2016). https://doi.org/10.1007/978-3-662-49381-6_17

  3. Yang, J., Zhang, Y., Dong, F.: Neural reranking for named entity recognition RANLP. In: Advances in Natural Language Processing Meet Deep Learning, pp. 84–92 (2017)

    Google Scholar 

  4. Wang, Y., Sun, Y., Ma, Z., Gao, L., Xu, Y., Sun, T.: Application of pre-training models in named entity recognition. In: 2020 12th International Conference on Intelligent Human-Machine Systems and Cybernetics (IHMSC), vol. 1, pp. 23–26. IEEE (2020)

    Google Scholar 

  5. Klinger, R., Friedrich, C.: User’s choice of precision and. named entity recognition. In: Proceedings of the International Conference RANLP-2009, pp. 92–96 (2009)

    Google Scholar 

  6. Yoo, S., Jeong, O.: EP-Bot: empathetic chatbot using auto-growing knowledge graph. Comput. Mater. Cont. 67(3), 2807–2817 (2021)

    Google Scholar 

  7. He, Q., Wu, L., Yin, Cai, Y., H: Knowledge-graph augmented word representations for named entity recognition. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 34, pp. 19–26 (2020)

    Google Scholar 

  8. Lossio-Ventura, J., et al.: Towards an obesity-cancer knowledge base: Biomedical entity identification and relation detection. In: IEEE International Conference on Bioinformatics and Biomedicine (BIBM), pp. 81–88. IEEE (2016)

    Google Scholar 

  9. Loster, M. Knowledge base construction with machine learning methods. Universität Potsdam (2021)

    Google Scholar 

  10. He, Z., Li, W.H.: Named entity recognition and disambiguation. General Information (2013)

    Google Scholar 

  11. Adak, C., Chaudhuri, B., Blumenstein, M.: Named entity recognition from unstructured handwritten document. In: Images Document Analysis Systems, pp. 75–80. IEEE (2016)

    Google Scholar 

  12. Dandapat, S., Way, A.: Improved named entity recognition using machine translation- based cross-lingual. Information Computacion Y Sistemas 20, 495–504 (2016)

    Google Scholar 

  13. Li, Z., Qu, D., Xie, C., Li, Y.: Language model pre-training method in machine translation based on named entity recognition. Int. J. Artif. Intell. Tools 29(7n08), 2040021 (2020)

    Google Scholar 

  14. Al-Besher, A., Kumar, K., Sangeetha, M., Butsa, T.: Bert for conversational question answer- ing systems using semantic similarity estimation. Comput. Mater. Cont. 70(3), 4763–4780 (2022)

    Google Scholar 

  15. Wang, Z., Guan, H.: 2020 Research on named entity recognition of doctor-patient question answering community based on bilstm-crf model. In: IEEE International Conference on Bioinformatics and Biomedicine (BIBM), pp. 41–44. IEEE (2020)

    Google Scholar 

  16. Lamurias, A., Couto, F.: Biomedical question answering using bidirectional transformers and named entity recognition. In: Proceedings of the 18th BioNLP Workshop and Shared Task, pp. 23–27 (2019)

    Google Scholar 

  17. Rau, L.: Extracting company names from text. In: Proceedings of the Seventh IEEE Conference on Artificial Intelligence Application, pp. 29–32. IEEE (1991)

    Google Scholar 

  18. Bikel, D., Schwarta, R., Weischedel, R.: An algorithm that learns what’s in a. name. Mach. Learn. 34, 211–242 (1999)

    Google Scholar 

  19. Chinchor, N., Robinson, P.: MUC-7 named entity task definition. In: Proceedings of the 7th Conference on Message Understanding, vol. 29, pp. 1–21 (1997)

    Google Scholar 

  20. Wu, Y., Lin, Y.J., Q: Description of the NCU Chinese word segmentation and named entity recognition system for SIGHAN Bakeoff. In: Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing, pp. 209–221 (2006)

    Google Scholar 

  21. Sang, E.F.T.K., DeMeulder, F.: Introduction to the CoNLL-2003 shared task: language-independent named entity recognition. In: Proceedings of the Seventh Conference on Natural Language Learning at HLT-NAACL, pp. 142–147 (2003)

    Google Scholar 

  22. Mao, X., Dong, Y., He, S., Wang, H., Bao, S.: Chinese word segmentation and named entity recognition based on conditional random fields. In: Proceedings of the Sixth SIGHAN Workshop on Chinese Language Processing, pp. 90–93 (2008)

    Google Scholar 

  23. Li, L., Mao, T., Huang, D., Yang, Y.: Hybrid models for Chinese named entity recognition. In: Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing, pp. 72–78 (2006)

    Google Scholar 

  24. Liu, X., Zhang, S., Wei, F., Zhou, M.: Recognizing named entities in tweets. In: Proceedings of the 49th annual meeting of the association for computational linguistics: human language technologies, pp. 59–67 (2011)

    Google Scholar 

  25. Ling, W., Xiang, G., Dyer, C., Alan, B., Isabel, T.: Microblogs as parallel corpora. In: Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, vol. 1, pp. 76–86 (2013)

    Google Scholar 

  26. Peng, N., Dredze, M.: Named entity recognition for Chinese social media with jointly trained embeddings. In: Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pp. 48–54 (2015)

    Google Scholar 

  27. Cheng, J., Yang, Y., Tang, X., Xiong, N., Zhang, Y., Lei, F.: Generative adversarial net- works: a literature review. KSII Trans. Internet Inf. Syst. 14(12), 4625–4647 (2020)

    Google Scholar 

  28. Lei, F., Cheng, J., Yang, Y., Tang, X., Sheng, V., Huang, C.: Improving heterogeneous network knowledge transfer based on the principle of generative adversarial. Electronics 10(13), 1525 (2021)

    Article  Google Scholar 

  29. Tang, X., Tu, W., Li, K., Cheng, J.: DFFNet: an IoT-perceptive dual feature fusion network for general real-time semantic segmentation. Inf. Sci. 565, 326–343 (2021)

    Article  Google Scholar 

  30. Cheng, J., Peng, X., Tang, W., Tu, W., Xu: MIFNet: a lightweight multiscale information fusion network. Int. J. Intell. Syst. 1–26 (2021)

    Google Scholar 

  31. Li, T., Hu, Y., Ju, A., Hu, Z.: Adversarial active learning for named entity recognition in cybersecurity. Comput. Mater. Cont. 66(1), 407–420 (2021)

    Google Scholar 

  32. Zhao, S., Hu, M., Cai, Z., Zhang, Z., Zhou, T., Liu, F.: Enhancing Chinese character representation with lattice-aligned attention. IEEE Trans. Neural Netw. Learn. Syst. (2021). https://doi.org/10.1109/TNNLS.2021.3114378

    Article  Google Scholar 

  33. Cheng, J., Liu, J., Xu, X.: A review of Chinese named entity recognition. KSII Trans. Internet Inf. Syst. (TIIS) 15(6), 2012–2030 (2021)

    Google Scholar 

  34. He, J., Wang, H.: Chinese named entity recognition and word segmentation based on character. In: Proceedings of the Sixth SIGHAN Workshop on Chinese Language Processing, pp. 28–32 (2008)

    Google Scholar 

  35. He, Z., Li, W.H., S: The task 2 of CIPS-SIGHAN 2012 named entity recognition and disambiguation. In. Chinese Bakeoff Proceedings of the Second CIPS-SIGHAN Joint Conference on Chinese Language Processing, pp. 108–122 (2012)

    Google Scholar 

  36. Li, J., Sun, A., Han, J., Li, C.: A survey on deep learning for named entity recognition. IEEE Trans. Knowl. Data Eng. (2020). https://doi.org/10.1109/TKDE.2020.2981314

    Article  Google Scholar 

  37. Peng, N., Dredze, M.: Improving named entity recognition for Chinese social media with word segmentation representation learning. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, vol. 2, pp. 49–55 (2016)

    Google Scholar 

  38. He, H., Sun, X.: Score driven max margin neural network for named entity recognition in Chinese social media. In: Proceedings of the 15th Conference of the European, chap. 2, pp. 713–731 (2017)

    Google Scholar 

  39. He, H., Sun, X.: A unified model for cross-domain and semi-supervised named entity recognition in Chinese social media. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 31 (2017)

    Google Scholar 

  40. Wang, B., Chai, Y., Xing, S.: Attention-based recurrent neural model for named entity recognition in. Chinese social media. In: Proceedings of the 2019 2nd International Conference on Algorithms, Computing and Artificial Intelligence, pp. 91–96 (2019)

    Google Scholar 

  41. Nie, Y., Tian, Y., Wan, X., Song, Y., Dai, B.: Named entity recognition for social media texts with semantic augmentation. In: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 83–91 (2020)

    Google Scholar 

  42. Gong, Z., Chen, P., Zhou, J.: integrating boundary assembling into a DNN framework for named entity recognition in Chinese social media text. arXiv:2002.11910 (2020)

  43. Dong, C., Wu, H., Zhang, J., Zong, C.: Multichannel LSTM-CRF for named entity recognition in Chinese social media. In: Sun, M., Wang, X., Chang, B., Xiong, D. (eds.) NLP-NABD CCL 2017. LNCS, vol. 10565, pp. 197–208. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-69005-6_17

Download references

Acknowledgments

This work was supported by National Natural Science Foundation of China (Grant No.62162024 and No. 62162022), Key Projects in Hainan Province (Grant No. ZDYF2021GXJS003 and No. ZDYF2020040), the Major science and technology project of Hainan Province(Grant No.ZDKJ2020012) and Graduate Innovation Project (Grant No.Qhys2021–187).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jieren Cheng .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Liu, J., Cheng, J., Wang, Z., Lou, C., Shen, C., Sheng, V.S. (2022). A Survey of Deep Learning for Named Entity Recognition in Chinese Social Media. In: Sun, X., Zhang, X., Xia, Z., Bertino, E. (eds) Artificial Intelligence and Security. ICAIS 2022. Lecture Notes in Computer Science, vol 13338. Springer, Cham. https://doi.org/10.1007/978-3-031-06794-5_46

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-06794-5_46

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-06793-8

  • Online ISBN: 978-3-031-06794-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics