research-article

Named Entity Recognition of Zhuang Language Based on the Feature of Initial Letter in Word

Authors:
Weiquan Zhang

School of Electronic Engineering, Guangxi Normal University, China

School of Electronic Engineering, Guangxi Normal University, China
View Profile

,
Suqin Tang

Faculty of Education, Guangxi Normal University, China

Faculty of Education, Guangxi Normal University, China
View Profile

,
Danni He

School of Electronic Engineering, Guangxi Normal University, China

School of Electronic Engineering, Guangxi Normal University, China
View Profile

,
Tinghui Li

School of Electronic Engineering, Guangxi Normal University, China

School of Electronic Engineering, Guangxi Normal University, China
View Profile

,
Changchun Pan

School of Mathematics and Computer Science, Guangxi Science & Technology Normal University, China

School of Mathematics and Computer Science, Guangxi Science & Technology Normal University, China
View Profile

ICIAI '22: Proceedings of the 2022 6th International Conference on Innovation in Artificial IntelligenceMarch 2022Pages 44–49https://doi.org/10.1145/3529466.3529478

Published:04 June 2022Publication History

ICIAI '22: Proceedings of the 2022 6th International Conference on Innovation in Artificial Intelligence

Pages 44–49

ABSTRACT

Named entity recognition is an important task and basis for the intelligent information processing and knowledge representation learning of the Zhuang Language. A BilSTM-CNN-CRF network model combining the uppercase and lowercase characters of words is proposed to be applied to the named entity recognition task of the Zhuang language, which lacks corpus for named entity labeling. Firstly, word2vec is used to train in unmarked Zhuang text to get the word vector of the Zhuang language. Then convolutional neural network is used to extract the character features of Zhuang words, and the character feature vector is obtained. The above two vectors were connected with the initial case feature vectors, which are randomly generated, and then the connected vectors were input into a BilSTM-CNN-CRF model for training; thus, the end-to-end named entity recognition model of Zhuang language was constructed. Experimental results show that, without relying on artificial features and external dictionaries, the proposed method in this study is superior to contrastive models by achieving an 80.37% F1 value in the named entity recognition task, which leads to the realization of automated named entity recognition of Zhuang language.

References

Yue W, Mengxuan W, Sheng Z Named Entity Recognition of Warning Text Based on BERT [J]. Computer Application,2020,40(02):535-540.Google Scholar
Mengcheng M, Qingwen Y, Amutula E, etc. Chinese Named Entity Classification Based on Word Vector and Conditional Random Fields [J]. Computer Engineering and Design,2020,41(09):2515-2522.Google Scholar
Huang Z, Xu W, Yu K. Bidirectional LSTM-CRF models for sequence tagging. arXiv preprint arXiv:1508.01991, 2015.Google Scholar
Ma X, Hovy E. End-to-end sequence labeling via bi-directional lstm-cnns-crf. arXiv preprint arXiv:1603.01354, 2016.Google Scholar
Lishuang L, Yuankai G. Biomedical named entity recognition based on CNN-BLSTM-CRF model. Chinese Journal of Information, 2018, 1: 18-23.Google Scholar
Tang Suqin, Sun Yaru, Li Zhixin Part of speech tagging of Zhuang language based on reinforcement learning. Computer Engineering,2020,46(04):309-315.Google Scholar
Maimaitiayifu, SILAMU Wushouer, MUHETAER Palidan, Uyghur named entity recognition based on BiLSTM-CNN-CRF model.Computer Engineering, 2018, 44(8):230-236.Google Scholar
Yang J, Liang S, Zhang Y. Design challenges and misconceptions in neural sequence labeling. arXiv preprint arXiv:1806.04470, 2018.Google Scholar
Chiu J P C, Nichols E. Named entity recognition with bidirectional LSTM-CNNs. Transactions of the Association for Computational Linguistics, 2016, 4: 357-370.Google ScholarCross Ref
Hochreiter S, Schmidhuber J. Long short-term memory[J]. Neural computation, 1997, 9(8): 1735-1780.Google ScholarDigital Library
Graves A, Jaitly N, Mohamed A. Hybrid speech recognition with deep bidirectional LSTM. 2013 IEEE workshop on automatic speech recognition and understanding. IEEE, 2013: 273-278.Google Scholar
Graves A. Generating sequences with recurrent neural networks. arXiv preprint arXiv:1308.0850, 2013.Google Scholar
Lample G, Ballesteros M, Subramanian S, Neural architectures for named entity recognition. arXiv preprint arXiv:1603.01360, 2016.Google Scholar
Ratinov L, Roth D. Design challenges and misconceptions in named entity recognition. In Proceedings of the Thirteenth Conference on Computational Natural Language Learning (CoNLL-2009). 2009: 147-155.Google ScholarDigital Library
Dai H J, Lai P T, Chang Y C, Enhancing of chemical compound and drug name recognition using representative tag scheme and fine-grained tokenization. Journal of cheminformatics, 2015, 7(1): 1-10.Google Scholar
Dandan C,Xiulei L, Ruoyu C Lattice LSTM based Named Entity Recognition in Ancient Chinese [J]. Computer Science,2020,47(S2):18-22.Google Scholar
Srivastava N, Hinton G, Krizhevsky A, Dropout: a simple way to prevent neural networks from overfitting. The journal of machine learning research, 2014, 15(1): 1929-1958.Google Scholar
Mikolov T, Chen K, Corrado G, Efficient estimation of word representations in vector space[J]. arXiv preprint arXiv:1301.3781, 2013Google Scholar

Recommendations

Learning multilingual named entity recognition from Wikipedia

We automatically create enormous, free and multilingual silver-standard training annotations for named entity recognition (ner) by exploiting the text and structure of Wikipedia. Most ner systems rely on statistical models of annotated data to identify ...
Read More
Named entity recognition an aid to improve multilingual entity filling in language-independent approach
IKM4DR '12: Proceedings of the first workshop on Information and knowledge management for developing region

This paper details the approach to identify Named Entities (NEs) from a large non-English corpus and associate them with appropriate tags, requiring minimal human intervention and no linguistic expertise. The main objective in this paper is to focus on ...
Read More
Two-stage approach to named entity recognition using Wikipedia and DBpedia
IMCOM '17: Proceedings of the 11th International Conference on Ubiquitous Information Management and Communication

In natural language understanding, extraction of named entity (NE) mentions in given text and classification of the mentions into pre-defined NE types are important processes. Most NE recognition (NER) relies on resources such as a training corpus or NE ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in

ICIAI '22: Proceedings of the 2022 6th International Conference on Innovation in Artificial Intelligence
March 2022
240 pages
ISBN:9781450395502
DOI:10.1145/3529466

Copyright © 2022 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 4 June 2022
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Conditional random field;
Convolutional neural network
Long-short term memory network
Named entity recognition
Zhuang language
Qualifiers
- research-article
- Research
- Refereed limited
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 27
  Total Downloads
- Downloads (Last 12 months)12
- Downloads (Last 6 weeks)1
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format

Named Entity Recognition of Zhuang Language Based on the Feature of Initial Letter in Word

ICIAI '22: Proceedings of the 2022 6th International Conference on Innovation in Artificial Intelligence

ABSTRACT

References

Cited By

Recommendations

Learning multilingual named entity recognition from Wikipedia

Named entity recognition an aid to improve multilingual entity filling in language-independent approach

Two-stage approach to named entity recognition using Wikipedia and DBpedia

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

HTML Format

Caption

Named Entity Recognition of Zhuang Language Based on the Feature of Initial Letter in Word

ICIAI '22: Proceedings of the 2022 6th International Conference on Innovation in Artificial Intelligence

ABSTRACT

References

Cited By

Recommendations

Learning multilingual named entity recognition from Wikipedia

Named entity recognition an aid to improve multilingual entity filling in language-independent approach

Two-stage approach to named entity recognition using Wikipedia and DBpedia

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

HTML Format

Share this Publication link

Share on Social Media