A Semi-supervised Joint Entity and Relation Extraction Model Based on Tagging Scheme and Information Gain

Zhao, Yonglin; Sun, Xudong; Wang, Shuxin; He, Jianwei; Wei, Yanzhi; Fu, Xianghua

doi:10.1007/978-3-030-60239-0_35

Yonglin Zhao^9,10,
Xudong Sun⁹,
Shuxin Wang¹⁰,
Jianwei He^9,10,
Yanzhi Wei^9,10 &
…
Xianghua Fu¹⁰

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 12453))

Included in the following conference series:

International Conference on Algorithms and Architectures for Parallel Processing

2082 Accesses

Abstract

Joint entity and relation extraction, completing the entity recognition and relation extraction simultaneously, can better integrate the information between two tasks and reduce the errors of each task. The methods based on tagging scheme treat the joint extraction as a sequence labeling task and have achieved outstanding results. However, those tag-based methods are insufficient in making full use of the information between entities and relations. It maybe the reason why they show a relatively poor precision. In this paper, we propose a novel semi-supervised approach that combines the tagging scheme with an information gain module. The information gain module is a combination of distant supervision and attention mechanism, which is used for obtaining prior information of candidate entities and relations. We believe that it is important for the joint extraction results to make much of the links among them. Our tagging scheme adds distant supervision to tag entity words and relational words in the given sentences, and combines with attention mechanism to improve their weight. It effectively helps our bias objective function improve model performance. Experiments on public dataset show that our methods are better than most of the existing pipelined and joint learning methods.

Y. Zhao and X. Fu—This research is supported by the Scientific Research Platforms and Projects in Universities in Guangdong Province under Grants 2019KTSCX204.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Joint extraction of entities and relations using multi-label tagging and relational alignment

Article 17 January 2022

Enhancing Joint Entity and Relation Extraction with Language Modeling and Hierarchical Attention

A joint extraction model of entities and relations based on relation decomposition

Article 24 January 2022

Notes

1.
Stanford CoreNLP can be downloaded at: https://stanfordnlp.github.io/CoreNLP/.
2.
Word2vec can be downloaded at: https://code.g.oogle.com/archive/p/word2vec/.
3.
Google distance supervised API can be downloaded at: https://developers.google.com/knowledge-graph.
4.
NYT dataset can be downloaded at: http://iesl.cs.umass.edu/riedel/ecml/.

References

Alzaidy, R., Caragea, C., Giles, C.L.: BI-LSTM-CRF sequence labeling for keyphrase extraction from scholarly documents. In: The World Wide Web Conference, ser. WWW 2019, pp. 2551–2557. Association for Computing Machinery, New York (2019)
Google Scholar
Bollacker, K., Cook, R., Tufts, P.: Freebase: a shared database of structured general human knowledge, pp. 1962–1963 (January 2007)
Google Scholar
Christopoulou, F., Miwa, M., Ananiadou, S.: A walk-based model on entity graphs for relation extraction (February 2019)
Google Scholar
Gormley, M.R., Yu, M., Dredze, M.: Improved relation extraction with feature-rich compositional embedding models. arXiv preprint arXiv:1505.02419 (2015)
Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)
Article Google Scholar
Hoffmann, R., Zhang, C., Ling, X., Zettlemoyer, L., Weld, D.S.: Knowledge-based weak supervision for information extraction of overlapping relations. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, vol. 1. Association for Computational Linguistics, pp. 541–550 (2011)
Google Scholar
Huang, Z., Xu, W., Yu, K.: Bidirectional LSTM-CRF models for sequence tagging. arXiv preprint arXiv:1508.01991 (2015)
Katiyar, A., Cardie, C.: Investigating LSTMs for joint extraction of opinion entities and relations. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Long Papers), vol. 1, pp. 919–929 (2016)
Google Scholar
Lample, G., Ballesteros, M., Subramanian, S., Kawakami, K., Dyer, C.: Neural architectures for named entity recognition. arXiv preprint arXiv:1603.01360 (2016)
Li, P., Mao, K.: Knowledge-oriented convolutional neural network for causal relation extraction from natural language texts. Expert Syst. Appl. 115, 512–523 (2019)
Article Google Scholar
Li, Q., Ji, H.: Incremental joint extraction of entity mentions and relations. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Long Papers), vol. 1, pp. 402–412 (2014)
Google Scholar
Lin, Z., et al.: A structured self-attentive sentence embedding. arXiv preprint arXiv:1703.03130 (2017)
Luo, G., Huang, X., Lin, C.-Y., Nie, Z.: Joint entity recognition and disambiguation. In: Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pp. 879–888 (2015)
Google Scholar
Manning, C.D., Surdeanu, M., Bauer, J., Finkel, J.R., Bethard, S., McClosky, D.: The Stanford CoreNLP natural language processing toolkit. In: Proceedings of 52nd Annual Meeting of the Association for Computational Linguistics: System Demonstrations, pp. 55–60 (2014)
Google Scholar
Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Advances in Neural Information Processing Systems, pp. 3111–3119 (2013)
Google Scholar
Mintz, M., Bills, S., Snow, R., Jurafsky, D.: Distant supervision for relation extraction without labeled data. In: Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP, vol. 2. Association for Computational Linguistics, pp. 1003–1011 (2009)
Google Scholar
Miwa, M., Sasaki, Y.: Modeling joint entity and relation extraction with table representation. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 1858–1869 (2014)
Google Scholar
Nadeau, D., Sekine, S.: A survey of named entity recognition and classification. Lingvisticae Investigationes 30(1), 3–26 (2007)
Article Google Scholar
Passos, A., Kumar, V., McCallum, A.: Lexicon infused phrase embeddings for named entity resolution. arXiv preprint arXiv:1404.5367 (2014)
Ren, X., et al.: CoType: joint extraction of typed entities and relations with knowledge bases. In: Proceedings of the 26th International Conference on World Wide Web, pp. 1015–1024 (2017)
Google Scholar
Rink, B., Harabagiu, S.: UTD: classifying semantic relations by combining lexical and semantic resources. In: Proceedings of the 5th International Workshop on Semantic Evaluation, pp. 256–259. Association for Computational Linguistics (2010)
Google Scholar
Santos, C.N.D., Xiang, B., Zhou, B.: Classifying relations by ranking with convolutional neural networks. arXiv preprint arXiv:1504.06580 (2015)
Takanobu, R., Zhang, T., Liu, J., Huang, M.: A hierarchical framework for relation extraction with reinforcement learning. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 33, pp. 7072–7079 (2019)
Google Scholar
Tang, J., Qu, M., Wang, M., Zhang, M., Yan, J., Mei, Q.: Line: Large-scale information network embedding. In: Proceedings of the 24th International Conference on World Wide Web, pp. 1067–1077 (2015)
Google Scholar
Vaswani, A., Bisk, Y., Sagae, K., Musa, R.: Supertagging with LSTMs. In: Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 232–237 (2016)
Google Scholar
Yadav, V., Bethard, S.: A survey on recent advances in named entity recognition from deep learning models. arXiv preprint arXiv:1910.11470 (2019)
Yang, B., Cardie, C.: Joint inference for fine-grained opinion extraction. In: Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Long Papers), vol. 1, pp. 1640–1649 (2013)
Google Scholar
Zeng, D., Liu, K., Chen, Y., Zhao, J.: Distant supervision for relation extraction via piecewise convolutional neural networks. In: Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pp. 1753–1762 (2015)
Google Scholar
Zhai, F., Potdar, S., Xiang, B., Zhou, B.: Neural models for sequence chunking. In: Thirty-First AAAI Conference on Artificial Intelligence (2017)
Google Scholar
Zheng, S., Wang, F., Bao, H., Hao, Y., Zhou, P., Xu, B.: Joint extraction of entities and relations based on a novel tagging scheme. arXiv preprint arXiv:1706.05075 (2017)
Zheng, S., Xu, J., Zhou, P., Bao, H., Qi, Z., Xu, B.: A neural network framework for relation extraction: learning entity semantic and relation pattern. Knowl.-Based Syst. 114, 12–23 (2016)
Article Google Scholar
Zou, F., Shen, L., Jie, Z., Zhang, W., Liu, W.: A sufficient condition for convergences of Adam and RMSProp. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 11127–11135 (2019)
Google Scholar
Qin, P., Xu, W., Wang, W.Y.: Robust distant supervision relation extraction via deep reinforcement learning. arXiv preprint arXiv:1805.09927 (2018)

Download references

Author information

Authors and Affiliations

College of Computer Science and Software Engineering, Shenzhen University, Shenzhen, 518061, China
Yonglin Zhao, Xudong Sun, Jianwei He & Yanzhi Wei
College of Big Data and Internet, Shenzhen University of Technology, Shenzhen, 60590, China
Yonglin Zhao, Shuxin Wang, Jianwei He, Yanzhi Wei & Xianghua Fu

Authors

Yonglin Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Xudong Sun
View author publications
You can also search for this author in PubMed Google Scholar
Shuxin Wang
View author publications
You can also search for this author in PubMed Google Scholar
Jianwei He
View author publications
You can also search for this author in PubMed Google Scholar
Yanzhi Wei
View author publications
You can also search for this author in PubMed Google Scholar
Xianghua Fu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Yonglin Zhao or Xianghua Fu .

Editor information

Editors and Affiliations

Columbia University, New York, NY, USA
Meikang Qiu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhao, Y., Sun, X., Wang, S., He, J., Wei, Y., Fu, X. (2020). A Semi-supervised Joint Entity and Relation Extraction Model Based on Tagging Scheme and Information Gain. In: Qiu, M. (eds) Algorithms and Architectures for Parallel Processing. ICA3PP 2020. Lecture Notes in Computer Science(), vol 12453. Springer, Cham. https://doi.org/10.1007/978-3-030-60239-0_35

Download citation

DOI: https://doi.org/10.1007/978-3-030-60239-0_35
Published: 29 September 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-60238-3
Online ISBN: 978-3-030-60239-0
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics