Chinese Named Entity Recognition Based on B-LSTM Neural Network with Additional Features

Ouyang, Liubo; Tian, Yuan; Tang, Hui; Zhang, Boyun

doi:10.1007/978-3-319-72389-1_22

Chinese Named Entity Recognition Based on B-LSTM Neural Network with Additional Features

Liubo Ouyang¹⁷,
Yuan Tian¹⁷,
Hui Tang¹⁷ &
…
Boyun Zhang¹⁸

Conference paper
First Online: 07 December 2017

1953 Accesses
8 Citations

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10656))

Abstract

Traditional methods for named entity recognition (NER) require heavy feature engineering to achieve high performance. We propose a novel neural network architecture for NER that detects word features automatically without feature engineering. Our approach uses word embedding as input, feeds them into a bidirectional long short-term memory (B-LSTM) for modeling the context within a sentence, and outputs the NER results. This study extends the neural network language model through B-LSTM, which outperforms other deep neural network models in NER tasks. Experimental results show that the B-LSTM with word embedding trained on a large corpus achieves the highest F-score of 0.9247, thus outperforming state-of-the-art methods that are based on feature engineering.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Borthwick, A., Sterling, J., Agichtein, E., et al.: NYU: Description of the MENE Named Entity System as Used in MUC-7 (1998)
Google Scholar
Chinchor, N.: MUC7 Named Entity Task Definition Message Understanding Conference (1997)
Google Scholar
Abbas, A., Ekrem, V., Nazife, D.: ChemTok: a new rule based Tokenizer for chemical named entity recognition. BioMed Res. Int. 2016(5), 1–9 (2016)
Google Scholar
Zhu, J., Li, T., Liu, S.: Research on Tibetan name recognition technology under CRF. J. Nanjing Univ. 3494, 234–250 (2016)
Google Scholar
Collobert, R., Weston, J.: A unified architecture for natural language processing: deep neural networks with multitask learning. In: International Conference, pp. 160–167. DBLP (2008)
Google Scholar
Mikolov, T.: Statistical Language Models Based on Neural Networks (2012)
Google Scholar
Mikolov, T., Karafiit, M., Burget, L., et al.: Recurrent neural network based language model. In: INTERSPEECH Conference of the International Speech Communication Association, Makuhari, Chiba, Japan, September 2010, pp. 1045–1048. DBLP (2010)
Google Scholar
Ahmadi, F., Moradi, H.: A hybrid method for Persian named entity recognition. In: Information and Knowledge Technology, pp. 1–7. IEEE (2015)
Google Scholar
Skenduli, M.P., Biba, M.: A named entity recognition approach for Albanian (2013)
Google Scholar
Ekbal, A., Saha, S., Singh, D.: Ensemble based active annotation for named entity recognition. In: International Conference on Advances in Computing, Communications and Informatics, pp. 973–978. IEEE (2013)
Google Scholar
Bam, S.B., Shahi, T.B.: Named entity recognition for Nepali text using support vector machines. Intell. Inf. Manag. 06(2), 21–29 (2014)
Google Scholar
Manamini, S.A.P.M., Ahamed, A.F., Rajapakshe, R.A.E.C., et al.: Ananya - a Named-Entity-Recognition (NER) system for Sinhala language. In: Moratuwa Engineering Research Conference, pp. 30–35. IEEE (2016)
Google Scholar
Aryoyudanta, B., Adji, T.B., Hidayah, I.: Semi-supervised learning approach for Indonesian Named Entity Recognition (NER) using co-training algorithm. In: International Seminar on Intelligent Technology and ITS Applications, pp. 7–12. IEEE (2017)
Google Scholar
He, H., Sun, X.: F-Score Driven Max Margin Neural Network for Named Entity Recognition in Chinese Social Media (2016)
Google Scholar
Huang, E.H., Socher, R., Manning, C.D., et al.: Improving word representations via global context and multiple word prototypes. In: Meeting of the Association for Computational Linguistics: Long Papers. Association for Computational Linguistics, pp. 873–882 (2012)
Google Scholar
Goller, C., Kuchler, A.: Learning task-dependent distributed representations by backpropagation through structure. In: IEEE International Conference on Neural Networks, vol. 1, pp. 347–352. IEEE (1996)
Google Scholar
Wang, R., Panju, M., Gohari, M.: Classification-based RNN machine translation using GRUs (2017)
Google Scholar
Lample, G., Ballesteros, M., Subramanian, S., et al.: Neural Architectures for Named Entity Recognition, pp. 260–270 (2016)
Google Scholar
Gers, F.A., Schmidhuber, J., Cummins, F.: Learning to forget: continual prediction with LSTM. In: Ninth International Conference on Artificial Neural Networks, ICANN 1999, p. 2451 (2002)
Google Scholar
Graves, A., Mohamed, A.R., Hinton, G.: Speech recognition with deep recurrent neural networks 38(2003), 6645–6649 (2013)
Google Scholar
Liu, Y., Burkhart, C., Hearne, J., et al.: Enhancing sumerian lemmatization by unsupervised named-entity recognition. In: Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 1446–1451 (2015)
Google Scholar
Collobert, R., Weston, J., Karlen, M., et al.: Natural language processing (almost) from scratch. J. Mach. Learn. Res. 12(1), 2493–2537 (2011)
MATH Google Scholar
Mikolov, T., Sutskever, I., Chen, K., et al.: Distributed representations of words and phrases and their compositionality. Adv. Neural Inf. Process. Syst. 26, 3111–3119 (2013)
Google Scholar
https://github.com/fxsjy/jieba
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. Comput. Sci. (2014)
Google Scholar
Forney, G.D.J.: The viterbi algorithm. Proc. IEEE 61(5), 268–278 (1973)
Article MathSciNet Google Scholar
Jing, X.: Research on named entity recognition based on word vector and conditional random field. Wirel. Internet Technol. (1), 111–112 (2017)
Google Scholar
Wang, G., Cai, Y., Ge, F.: Using hybrid neural network to address Chinese named entity recognition. In: IEEE, International Conference on Cloud Computing and Intelligence Systems, pp. 433–438. IEEE (2015)
Google Scholar
Feng, Y.-T., Zhang, H.-J., Hao, W.-N., Chen, G.J.: Named entity recognition based on deep belief net. Comput. Sci. 43(4), 224–230 (2016)
Google Scholar

Download references

Acknowledgments

This work was supported by the project of National Natural Science Foundation of China (No. 61471169).

Author information

Authors and Affiliations

School of Information Science and Engineering, Hunan University, Changsha, 410082, China
Liubo Ouyang, Yuan Tian & Hui Tang
Department of Information Technology, Hunan Police Academy, Changsha, 410138, China
Boyun Zhang

Authors

Liubo Ouyang
View author publications
You can also search for this author in PubMed Google Scholar
Yuan Tian
View author publications
You can also search for this author in PubMed Google Scholar
Hui Tang
View author publications
You can also search for this author in PubMed Google Scholar
Boyun Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yuan Tian .

Editor information

Editors and Affiliations

Guangzhou University , Guangzhou, China
Guojun Wang
Edith Kinney Gaylord Presidential Professor, University of Oklahoma, Norman, Oklahoma, USA
Mohammed Atiquzzaman
Aalto University, Espoo, Finland
Zheng Yan
University of Texas at San Antonio, San Antonio, Texas, USA
Kim-Kwang Raymond Choo

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ouyang, L., Tian, Y., Tang, H., Zhang, B. (2017). Chinese Named Entity Recognition Based on B-LSTM Neural Network with Additional Features. In: Wang, G., Atiquzzaman, M., Yan, Z., Choo, KK. (eds) Security, Privacy, and Anonymity in Computation, Communication, and Storage. SpaCCS 2017. Lecture Notes in Computer Science(), vol 10656. Springer, Cham. https://doi.org/10.1007/978-3-319-72389-1_22

Download citation

DOI: https://doi.org/10.1007/978-3-319-72389-1_22
Published: 07 December 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-72388-4
Online ISBN: 978-3-319-72389-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics