Skip to main content

Chinese Named Entity Recognition Based on B-LSTM Neural Network with Additional Features

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10656))

Abstract

Traditional methods for named entity recognition (NER) require heavy feature engineering to achieve high performance. We propose a novel neural network architecture for NER that detects word features automatically without feature engineering. Our approach uses word embedding as input, feeds them into a bidirectional long short-term memory (B-LSTM) for modeling the context within a sentence, and outputs the NER results. This study extends the neural network language model through B-LSTM, which outperforms other deep neural network models in NER tasks. Experimental results show that the B-LSTM with word embedding trained on a large corpus achieves the highest F-score of 0.9247, thus outperforming state-of-the-art methods that are based on feature engineering.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Borthwick, A., Sterling, J., Agichtein, E., et al.: NYU: Description of the MENE Named Entity System as Used in MUC-7 (1998)

    Google Scholar 

  2. Chinchor, N.: MUC7 Named Entity Task Definition Message Understanding Conference (1997)

    Google Scholar 

  3. Abbas, A., Ekrem, V., Nazife, D.: ChemTok: a new rule based Tokenizer for chemical named entity recognition. BioMed Res. Int. 2016(5), 1–9 (2016)

    Google Scholar 

  4. Zhu, J., Li, T., Liu, S.: Research on Tibetan name recognition technology under CRF. J. Nanjing Univ. 3494, 234–250 (2016)

    Google Scholar 

  5. Collobert, R., Weston, J.: A unified architecture for natural language processing: deep neural networks with multitask learning. In: International Conference, pp. 160–167. DBLP (2008)

    Google Scholar 

  6. Mikolov, T.: Statistical Language Models Based on Neural Networks (2012)

    Google Scholar 

  7. Mikolov, T., Karafiit, M., Burget, L., et al.: Recurrent neural network based language model. In: INTERSPEECH Conference of the International Speech Communication Association, Makuhari, Chiba, Japan, September 2010, pp. 1045–1048. DBLP (2010)

    Google Scholar 

  8. Ahmadi, F., Moradi, H.: A hybrid method for Persian named entity recognition. In: Information and Knowledge Technology, pp. 1–7. IEEE (2015)

    Google Scholar 

  9. Skenduli, M.P., Biba, M.: A named entity recognition approach for Albanian (2013)

    Google Scholar 

  10. Ekbal, A., Saha, S., Singh, D.: Ensemble based active annotation for named entity recognition. In: International Conference on Advances in Computing, Communications and Informatics, pp. 973–978. IEEE (2013)

    Google Scholar 

  11. Bam, S.B., Shahi, T.B.: Named entity recognition for Nepali text using support vector machines. Intell. Inf. Manag. 06(2), 21–29 (2014)

    Google Scholar 

  12. Manamini, S.A.P.M., Ahamed, A.F., Rajapakshe, R.A.E.C., et al.: Ananya - a Named-Entity-Recognition (NER) system for Sinhala language. In: Moratuwa Engineering Research Conference, pp. 30–35. IEEE (2016)

    Google Scholar 

  13. Aryoyudanta, B., Adji, T.B., Hidayah, I.: Semi-supervised learning approach for Indonesian Named Entity Recognition (NER) using co-training algorithm. In: International Seminar on Intelligent Technology and ITS Applications, pp. 7–12. IEEE (2017)

    Google Scholar 

  14. He, H., Sun, X.: F-Score Driven Max Margin Neural Network for Named Entity Recognition in Chinese Social Media (2016)

    Google Scholar 

  15. Huang, E.H., Socher, R., Manning, C.D., et al.: Improving word representations via global context and multiple word prototypes. In: Meeting of the Association for Computational Linguistics: Long Papers. Association for Computational Linguistics, pp. 873–882 (2012)

    Google Scholar 

  16. Goller, C., Kuchler, A.: Learning task-dependent distributed representations by backpropagation through structure. In: IEEE International Conference on Neural Networks, vol. 1, pp. 347–352. IEEE (1996)

    Google Scholar 

  17. Wang, R., Panju, M., Gohari, M.: Classification-based RNN machine translation using GRUs (2017)

    Google Scholar 

  18. Lample, G., Ballesteros, M., Subramanian, S., et al.: Neural Architectures for Named Entity Recognition, pp. 260–270 (2016)

    Google Scholar 

  19. Gers, F.A., Schmidhuber, J., Cummins, F.: Learning to forget: continual prediction with LSTM. In: Ninth International Conference on Artificial Neural Networks, ICANN 1999, p. 2451 (2002)

    Google Scholar 

  20. Graves, A., Mohamed, A.R., Hinton, G.: Speech recognition with deep recurrent neural networks 38(2003), 6645–6649 (2013)

    Google Scholar 

  21. Liu, Y., Burkhart, C., Hearne, J., et al.: Enhancing sumerian lemmatization by unsupervised named-entity recognition. In: Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 1446–1451 (2015)

    Google Scholar 

  22. Collobert, R., Weston, J., Karlen, M., et al.: Natural language processing (almost) from scratch. J. Mach. Learn. Res. 12(1), 2493–2537 (2011)

    MATH  Google Scholar 

  23. Mikolov, T., Sutskever, I., Chen, K., et al.: Distributed representations of words and phrases and their compositionality. Adv. Neural Inf. Process. Syst. 26, 3111–3119 (2013)

    Google Scholar 

  24. https://github.com/fxsjy/jieba

  25. Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. Comput. Sci. (2014)

    Google Scholar 

  26. Forney, G.D.J.: The viterbi algorithm. Proc. IEEE 61(5), 268–278 (1973)

    Article  MathSciNet  Google Scholar 

  27. Jing, X.: Research on named entity recognition based on word vector and conditional random field. Wirel. Internet Technol. (1), 111–112 (2017)

    Google Scholar 

  28. Wang, G., Cai, Y., Ge, F.: Using hybrid neural network to address Chinese named entity recognition. In: IEEE, International Conference on Cloud Computing and Intelligence Systems, pp. 433–438. IEEE (2015)

    Google Scholar 

  29. Feng, Y.-T., Zhang, H.-J., Hao, W.-N., Chen, G.J.: Named entity recognition based on deep belief net. Comput. Sci. 43(4), 224–230 (2016)

    Google Scholar 

Download references

Acknowledgments

This work was supported by the project of National Natural Science Foundation of China (No. 61471169).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yuan Tian .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Ouyang, L., Tian, Y., Tang, H., Zhang, B. (2017). Chinese Named Entity Recognition Based on B-LSTM Neural Network with Additional Features. In: Wang, G., Atiquzzaman, M., Yan, Z., Choo, KK. (eds) Security, Privacy, and Anonymity in Computation, Communication, and Storage. SpaCCS 2017. Lecture Notes in Computer Science(), vol 10656. Springer, Cham. https://doi.org/10.1007/978-3-319-72389-1_22

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-72389-1_22

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-72388-4

  • Online ISBN: 978-3-319-72389-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics