ABSTRACT
Chinese characters are composed of radicals, and their radicals have the distinction between "shaped parts" (representing semantics) and "sound parts" (representing speech). As a hieroglyph, many radicals of Chinese characters have certain semantic information, which can effectively improve the performance of Chinese named entity recognition. In the Chinese named entity recognition, many related studies use Bi-LSTM to extract the semantic features from radicals. However, the LSTM-based model cannot effectively extract the semantic information of radicals due to ambiguity in partitioning the granularity of radicals and weak dependency between Chinese radicals. Therefore, this paper presents a radical neural network method RCBC (Radical CNN-BiLSTM-CRF). The experimental results on SIGHAN 2006 Bakeoff MSRA dataset and Peking University's People's Daily dataset in 1998 indicate that this model can effectively extract the semantic information of Chinese radicals and improve the performanceof Chinese named entity recognition compared with the traditional model.
- Duan, H., & Zheng, Y. (2011). A study on features of the CRFs-based Chinese Named Entity Recognition. International Journal of Advanced Intelligence, 3(2), 287--294.Google Scholar
- Han, A. L. F., Wong, D. F., & Chao, L. S. (2013, June). Chinese named entity recognition with conditional random fields in the light of Chinese characteristics. In Intelligent Information Systems Symposium (pp. 57--68). Springer, Berlin, Heidelberg.Google Scholar
- Li, L., Mao, T., Huang, D., & Yang, Y. (2006). Hybrid models for Chinese named entity recognition. In Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing (pp. 72--78).Google Scholar
- Liu, L., Shang, J., Ren, X., Xu, F.F., Gui, H., Peng, J., & Han, J. (2018, April). Empower sequence labeling with task-aware neural language model. In Thirty-Second AAAI Conference on Artificial Intelligence.Google Scholar
- Peters, M. E., Ammar, W., Bhagavatula, C., & Power, R. (2017). Semi-supervised sequence tagging with bidirectional language models. arXiv preprint arXiv:1705.00108.Google Scholar
- Dong, C., Zhang, J., Zong, C., Hattori, M., & Di, H. (2016). Character-based LSTM-CRF with radical-level features for Chinese named entity recognition. In Natural Language Understanding and Intelligent Applications (pp. 239--250). Springer, Cham.Google ScholarCross Ref
- Wu Jinxing, Nasun-urtu, Yang Zhenxin. Recognition method of Mongolian person names based on conditional random fields [J]. Application Research of Computers, 2016,33(07): 2014--2017.Google Scholar
- Bai Bing, Hou Xia, Shi Song. Named entity recognition method based on CRF and BI-LSTM [J]. Journal of Beijing Information Science & Technology University,2018,33(06): 27--33.Google Scholar
- Santos, C. D., & Zadrozny, B. (2014). Learning character-level representations for part-of-speech tagging. In Proceedings of the 31st International Conference on Machine Learning (ICML-14) (pp. 1818--1826).Google ScholarDigital Library
- Labeau, M., Löser, K., & Allauzen, A. (2015). Non-lexical neural architecture for fine-grained POS tagging. In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing (pp. 232--237).Google ScholarCross Ref
- Chiu, J. P., & Nichols, E. (2016). Named entity recognition with bidirectional LSTM-CNNs. Transactions of the Association for Computational Linguistics, 4, 357--370..Google ScholarCross Ref
- Cao Chunping, Guan Pengju. Clinical text named entity recognition based on e-cnn and blstm-crf [J/OL]. Application Research of Computers, 1--5[2019-02-13]. https://doi.org/10.19734/j.issn.1001-3695.2018.09.0606.Google Scholar
- Cao, S., Lu, W., Zhou, J., & Li, X. (2018, April). cw2vec: Learning chinese word embeddings with stroke n-gram information. In Thirty-Second AAAI Conference on Artificial Intelligence.Google Scholar
Index Terms
- A Radical-Based Method for Chinese Named Entity Recognition
Recommendations
Attention in Character-Based BiLSTM-CRF for Chinese Named Entity Recognition
ICMAI '19: Proceedings of the 2019 4th International Conference on Mathematics and Artificial IntelligenceNamed Entity Recognition (NER) is a crucial step in natural language processing (NLP). Recently, some researches work on enhancing the word representations by character-level extensions in English and have achieve excellent performance. The same method ...
Chinese Clinical Named Entity Recognition Based on Stroke-Level and Radical-Level Features
Smart Computing and CommunicationAbstractClinical Named Entity Recognition (CNER) is an important step for mining clini-cal text. Aiming at the problem of insufficient representation of potential Chinese features, we propose the Chinese clinical named entity recognition model based on ...
Single character Chinese named entity recognition
SIGHAN '03: Proceedings of the second SIGHAN workshop on Chinese language processing - Volume 17Single character named entity (SCNE) is a name entity (NE) composed of one Chinese character, such as "[Abstract contained text which could not be captured.]" (zhong1, China) and "[Abstract contained text which could not be captured.]" (e2, Russia). ...
Comments