A Weak Supervision Approach with Adversarial Training for Named Entity Recognition

Shao, Jianxuan; Bu, Chenyang; Ji, Shengwei; Wu, Xindong

doi:10.1007/978-3-030-89363-7_2

Jianxuan Shao^12,13,
Chenyang Bu^12,13,
Shengwei Ji^12,13 &
…
Xindong Wu^12,13,14

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13032))

Included in the following conference series:

Pacific Rim International Conference on Artificial Intelligence

1496 Accesses

Abstract

Named entity recognition (NER) is a basic task of natural language processing (NLP), whose purpose is to identify named entities such as the names of persons, places, and organizations in the corpus. Utilizing neural networks for feature extraction, followed by conditional random field (CRF) layer decoding, is effective for the NER task. However, achieving reliable results using neural networks generally requires a large amount of labeled data and the acquisition of high-quality labeled data is costly. To obtain a better NER effect without labeled data, we propose a weak supervision approach with adversarial training (WSAT). WSAT obtains supervised information and domain knowledge through labeling functions, including external knowledge bases, heuristic functions, and generic entity recognition tools. The labeled results are aggregated through the linked hidden Markov model (linked HMM), and adversarial training strategies are added when using the aggregated results for training. We evaluate WSAT on two real-world datasets. When compared to rival algorithms, the F1 values are improved by approximately 2% and 1% on the MSRA and Resume NER datasets, respectively.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Lample, G., Ballesteros, M., Subramanian, S., et al.: Neural architectures for named entity recognition. arXiv preprint arXiv:1603.01360 (2016)
Maxwell, J.C.: A Treatise on Electricity and Magnetism, 3rd edn, vol. 2, pp. 68–73. Clarendon, Oxford (1892)
Google Scholar
Devlin, J., Chang, M.W., Lee, K., et al.: Bert: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018)
Yan, H., Deng, B., Li, X., et al.: Tener: Adapting transformer encoder for named entity recognition. arXiv preprint arXiv:1911.04474 (2019)
Chiu, J.P.C., Nichols, E.: Named entity recognition with bidirectional LSTM-CNNs. Trans. Assoc. Comput. Linguist. 4, 357–370 (2016)
Article Google Scholar
Li, J., Sun, A., Han, J., et al.: A survey on deep learning for named entity recognition. IEEE Trans. Knowl. Data Eng. (2020)
Google Scholar
Huang, Z., Xu, W., Yu, K.: Bidirectional LSTM-CRF models for sequence tagging. arXiv preprint arXiv:1508.01991 (2015)
Strubell, E., Verga, P., Belanger, D., et al.: Fast and accurate entity recognition with iterated dilated convolutions. arXiv preprint arXiv:1702.02098 (2017)
Lin, B.Y., Xu, F.F., Luo, Z., et al.: Multi-channel BiLSTM-CRF model for emerging named entity recognition in social media. In: Proceedings of the 3rd Workshop on Noisy User-generated Text, pp. 160–165 (2017)
Google Scholar
Li, J., Bu, C., Li, P., et al.: A coarse-to-fine collective entity linking method for heterogeneous information networks. Knowl.-Based Syst. 228(2), 107286 (2021)
Google Scholar
The website of HanLP. https://hanlp.hankcs.com/docs/references.html. Accessed 21 May 2020
Geng, Z., Yan, H., Qiu, X., et al.: fastHan: A BERT-based Joint Many-Task Toolkit for Chinese NLP. arXiv preprint arXiv:2009.08633 (2020)
Mintz, M., Bills, S., Snow, R., et al.: Distant supervision for relation extraction without labeled data. In: Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP, pp. 1003–1011 (2009)
Google Scholar
Jiang, Y., Wu, G., Bu, C., et al.: Chinese entity relation extraction based on syntactic features. In: 2018 IEEE International Conference on Big Knowledge (ICBK). IEEE (2018)
Google Scholar
Shen, Y., Yun, H., Lipton, Z.C., et al.: Deep active learning for named entity recognition. arXiv preprint arXiv:1707.05928 (2017)
Shang, J., Liu, L., Ren, X., et al.: Learning named entity tagger using domain-specific dictionary. arXiv preprint arXiv:1809.03599 (2018)
Fries, J., Wu, S., Ratner, A., et al.: Swellshark: A generative model for biomedical named entity recognition without labeled data. arXiv preprint arXiv:1704.06360 (2017)
Yang, Y., Chen, W., Li, Z., et al.: Distantly supervised NER with partial annotation learning and reinforcement learning. In: Proceedings of the 27th International Conference on Computational Linguistics, pp. 2159–2169 (2018)
Google Scholar
Jie, Z., Xie, P., Lu, W., et al.: Better modeling of incomplete annotations for named entity recognition. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, vol. 1 (Long and Short Papers), pp. 729–734 (2019)
Google Scholar
Liu, A., Du, J., Stoyanov, V.: Knowledge-augmented language model and its application to unsupervised named-entity recognition. arXiv preprint arXiv:1904.04458 (2019)
Lison, P., Hubin, A., Barnes, J., et al.: Named entity recognition without labelled data: A weak supervision approach. arXiv preprint arXiv:2004.14723 (2020)
Luo, Y., Zhao, H., Zhan, J.: Named Entity Recognition Only from Word Embeddings. arXiv preprint arXiv:1909.00164 (2019)
Safranchik, E., Luo, S., Bach, S.: Weakly supervised sequence tagging from noisy rules. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 34, no.04, pp. 5570–5578 (2020)
Google Scholar
Madry, A., Makelov, A., Schmidt, L., et al.: Towards deep learning models resistant to adversarial attacks. arXiv preprint arXiv:1706.06083 (2017)
Miyato, T., Dai, A.M., Goodfellow, I.: Adversarial training methods for semi-supervised text classification. arXiv preprint arXiv:1605.07725 (2016)
Levow, G.A.: The third international Chinese language processing bakeoff: word segmentation and named entity recognition. In: Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing, pp. 108–117 (2006)
Google Scholar
Zhang, Y., Yang, J.: Chinese NER using lattice LSTM. arXiv preprint arXiv:1805.02023 (2018)
Ratner, A., Bach, S.H., Ehrenberg, H., et al.: Snorkel: rapid training data creation with weak supervision. Proc. VLDB Endow. 11(3) (2017)
Google Scholar
Li, X., Yan, H., Qiu, X., et al.: FLAT: Chinese NER using flat-lattice transformer. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (2020)
Google Scholar

Download references

Acknowledgements

This work was supported by the National Key Research and Development Program of China (2016YFB1000901), the National Natural Science Foundation of China (61806065), and the Fundamental Research Funds for the Central Universities (JZ2020HGQA0186).

Author information

Authors and Affiliations

Key Laboratory of Knowledge Engineering with Big Data (Hefei University of Technology), Ministry of Education, Hefei, China
Jianxuan Shao, Chenyang Bu, Shengwei Ji & Xindong Wu
School of Computer Science and Information Engineering, Hefei University of Technology, Hefei, China
Jianxuan Shao, Chenyang Bu, Shengwei Ji & Xindong Wu
Mininglamp Academy of Sciences, Mininglamp Technology, Beijing, China
Xindong Wu

Authors

Jianxuan Shao
View author publications
You can also search for this author in PubMed Google Scholar
Chenyang Bu
View author publications
You can also search for this author in PubMed Google Scholar
Shengwei Ji
View author publications
You can also search for this author in PubMed Google Scholar
Xindong Wu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Chenyang Bu .

Editor information

Editors and Affiliations

MIMOS Berhad, Kuala Lumpur, Malaysia
Duc Nghia Pham
Sirindhorn International Institute of Science and Technology, Thammasat University, Mueang Pathum Thani, Thailand
Thanaruk Theeramunkong
Data61, CSIRO, Brisbane, QLD, Australia
Guido Governatori
Department of Philosophy, Tsinghua University, Beijing, China
Fenrong Liu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Shao, J., Bu, C., Ji, S., Wu, X. (2021). A Weak Supervision Approach with Adversarial Training for Named Entity Recognition. In: Pham, D.N., Theeramunkong, T., Governatori, G., Liu, F. (eds) PRICAI 2021: Trends in Artificial Intelligence. PRICAI 2021. Lecture Notes in Computer Science(), vol 13032. Springer, Cham. https://doi.org/10.1007/978-3-030-89363-7_2

Download citation

DOI: https://doi.org/10.1007/978-3-030-89363-7_2
Published: 01 November 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-89362-0
Online ISBN: 978-3-030-89363-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics