Abstract
NER is challenging because of the semantic ambiguities in academic literature, especially for non-Latin languages. Besides, recognizing Chinese named entities needs to consider word boundary information, as words contained in Chinese texts are not separated with spaces. Leveraging word boundary information could help to determine entity boundaries and thus improve entity recognition performance. In this paper, we propose to combine word boundary information and semantic information for named entity recognition based on multi-task adversarial learning. We learn common shared boundary information of entities from multiple kinds of tasks, including Chinese word segmentation (CWS), part-of-speech (POS) tagging and entity recognition, with adversarial learning. We learn task-specific semantic information of words from these tasks, and combine the learned boundary information with the semantic information to improve entity recognition, with multi-task learning. We conduct extensive experiments to demonstrate that our model achieves considerable performance improvements.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Allen, J.F.: Natural language processing. In: Encyclopedia of Computer Science, pp. 1218–1222 (2003)
Bunescu, R.C., Mooney, R.J.: A shortest path dependency kernel for relation extraction. In: HLT-EMNLP, pp. 724–731 (2005)
Cheng, D., Niu, Z., Zhang, Y.: Contagious chain risk rating for networked-guarantee loans. In: KDD, pp. 2715–2723 (2020)
Cheng, D., Wang, X., Zhang, Y., Zhang, L.: Graph neural network for fraud detection via spatial-temporal attention. TKDE (2020)
Fan, M., et al.: Fusing global domain information and local semantic information to classify financial documents. In: CIKM, pp. 2413–2420 (2020)
Ganin, Y., Lempitsky, V.: Unsupervised domain adaptation by backpropagation. In: ICML, pp. 1180–1189 (2014)
Lample, G., Ballesteros, M., Subramanian, S., Kawakami, K., Dyer, C.: Neural architectures for named entity recognition. In: NAACL, pp. 260–270 (2016)
Liang, X., Cheng, D., Yang, F., Luo, Y., Qian, W., Zhou, A.: F-HMTC: detecting financial events for investment decisions based on neural hierarchical multi-label text classification. In: IJCAI-20, pp. 4490–4496 (2020)
Peng, N., Dredze, M.: Improving named entity recognition for Chinese social media with word segmentation representation learning. In: ACL, pp. 149–155 (2016)
Ye, Z., Ling, Z.H.: Hybrid semi-Markov CRF for neural sequence labeling. In: ACL, pp. 235–240 (2018)
Zhang, Y., Yang, J.: Chinese NER using lattice LSTM. In: ACL, pp. 1554–1564 (2018)
Acknowledgments
This work was supported by National Key R&D Program of China (2018YFC0831904), the National Natural Science Foundation of China (U1711262, 62072185), and the Joint Research Program of SeekData Inc. and ECNU.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Zhu, P., Cheng, D., Yang, F., Luo, Y., Qian, W., Zhou, A. (2021). ZH-NER: Chinese Named Entity Recognition with Adversarial Multi-task Learning and Self-Attentions. In: Jensen, C.S., et al. Database Systems for Advanced Applications. DASFAA 2021. Lecture Notes in Computer Science(), vol 12682. Springer, Cham. https://doi.org/10.1007/978-3-030-73197-7_40
Download citation
DOI: https://doi.org/10.1007/978-3-030-73197-7_40
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-73196-0
Online ISBN: 978-3-030-73197-7
eBook Packages: Computer ScienceComputer Science (R0)