skip to main content
10.1145/3383972.3384044acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicmlcConference Proceedingsconference-collections
research-article

DSREFC: Improving Distantly-supervised Neural Relation Extraction Using Feature Combination

Published: 26 May 2020 Publication History

Abstract

Distant supervisory relationship extraction can automatically align the expected entity pairs, and automatically obtain a large number of annotation data, thus saving a lot of labor costs. However, the automatic acquisition of annotated data will lead to the introduction of noise data, making little effect of relation extraction task. To solve this problem, we propose the relation extraction model DSREFC, which integrates semantic features and syntactic features into the representation and uses attention mechanism to obtain bag representation. The DSREFC model has three characteristics: 1) The BERT+Bi-LSTM is used as the text representation extractor to extract the semantic information of the text. 2) the grammatical information is extracted with the GCN network used in the text, and combine theBi-LSTM output with the GCN output to obtain a distributed representation of each token. 3) The two-step attention mechanism is used to remove the influence of the noise data by giving the noise data a lower weight value. Attention is used to obtain the sentence representation for each token and attention is used to obtain the packet representation for each sentence in the packet. Experiments show that the DSREFC model combining semantic features and grammatical features can significantly improve the effect of relation extraction.

References

[1]
Parisa Naderi Golshan, HosseinAli Rahmani Dashti, Shahrzad Azizi, and Leila Safari, 'A study of recent contributions on information extraction', CoRR, abs/1803.05667, (2018).
[2]
Faisal Alshuwaier, Ali Areshey, and Josiah Poon, 'A comparative study of the current technologies and approaches of relation extraction in biomedical literature using text mining', in 2017 4th IEEE International Conference on Engineering Technologies and Applied Sciences (ICETAS), (2017).
[3]
Mike Mintz, Steven Bills, Rion Snow, and Daniel Jurafsky, 'Distant supervision for relation extraction without labeled data', in ACL 2009, Proceedings of the 47th Annual Meeting of the Association for Computational Linguistics and the 4th International Joint Conference on Natural Language Processing of the AFNLP, 2-7 August 2009, Singapore, (2009).
[4]
Yang Xiang, Xiaolong Wang, Yaoyun Zhang, Yang Qin, and Shixi Fan, 'Distant supervision for relation extraction via group selection', in International Conference on Neural Information Processing, (2015).
[5]
Yankai Lin, Shiqi Shen, Zhiyuan Liu, Huanbo Luan, and Maosong Sun, 'Neural relation extraction with selective attention over instances', in Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 2124--2133, Berlin, Germany, (August 2016). Association for Computational Linguistics.
[6]
Guoliang Ji, Kang Liu, Shizhu He, and Jun Zhao, 'Distant supervision for relation extraction with sentence-level attention and entity descriptions', in Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, AAAI'17, pp. 3060--3066. AAAI Press, (2017).
[7]
Zhao M, Zhao Y, Xu B. Knowledge Graph Completion via Complete Attention between Knowledge Graph and Entity Descriptions[C]//Proceedings of the 3rd International Conference on Computer Science and Application Engineering. 2019: 1--6.
[8]
Daojian Zeng, Kang Liu, Yubo Chen, and Jun Zhao, 'Distant supervision for relation extraction via piecewise convolutional neural networks', in Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pp. 1753--1762, Lisbon, Portugal, (September 2015). Association for Computational Linguistics.
[9]
Sharmistha Jat, Siddhesh Khandelwal, and Partha P. Talukdar, 'Improving distantly supervised relation extraction using word and entity based attention', CoRR, abs/1804.06987, (2018).
[10]
Shikhar Vashishth, Rishabh Joshi, Sai Suman Prayaga, Chiranjib Bhattacharyya, and Partha P. Talukdar, 'RESIDE: improving distantlysupervised neural relation extraction using side information', CoRR, abs/1812.04361, (2018).
[11]
Yankai Lin, Shiqi Shen, Zhiyuan Liu, Huanbo Luan, and Maosong Sun, 'Neural relation extraction with selective attention over instances', in Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 2124--2133, Berlin, Germany, (August 2016). Association for Computational Linguistics.
[12]
Christopher Manning, Mihai Surdeanu, John Bauer, Jenny R@Finkel, Steven Bethard, and David McClosky, 'The Stanford CoreNLP natural language processing toolkit', in Proceedings of 52nd Annual Meeting of the Association for Computational Linguistics: System Demonstrations, pp. 55--60, Baltimore, Maryland, (June 2014). Association for Computational Linguistics.
[13]
Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova, 'BERT: pre-training of deep bidirectional transformers for language understanding', CoRR, abs/1810.04805, (2018).
[14]
Thomas N. Kipf and Max Welling, 'Semi-supervised classification with graph convolutional networks', CoRR, abs/1609.02907, (2016).
[15]
Marcheggiani, Diego, and Ivan Titov. "Encoding sentences with graph convolutional networks for semantic role labeling." arXiv preprint arXiv: 1703.04826 (2017).
[16]
Ilya Sutskever, Oriol Vinyals, and Quoc V. Le, 'Sequence to sequence learning with neural networks', CoRR, abs/1409.3215, (2014).
[17]
Christopher Manning, Mihai Surdeanu, John Bauer, Jenny Finkel, Steven Bethard, and David McClosky, 'The Stanford CoreNLP natural language processing toolkit', in Proceedings of 52nd Annual Meeting of the Association for Computational Linguistics: System Demonstrations, pp. 55--60, Baltimore, Maryland, (June 2014). Association for Computational Linguistics.
[18]
Shikhar Vashishth, Prince Jain, and Partha P. Talukdar, 'CESI: canonicalizing open knowledge bases using embeddings and side information', CoRR, abs/1902.00172, (2019).
[19]
Sebastian Riedel, Limin Yao, and Andrew McCallum, 'Modeling relations and their mentions without labeled text', in Machine Learning and Knowledge Discovery in Databases, eds., Jose Luis Balc' azar,' Francesco Bonchi, Aristides Gionis, and Michele Sebag, pp. 148--163,' Berlin, Heidelberg, (2010). Springer Berlin Heidelberg.
[20]
Jenny Rose Finkel, Trond Grenager, and Christopher Manning, 'Incorporating non-local information into information extraction systems by gibbs sampling', in Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics, ACL '05, pp. 363--370, Stroudsburg, PA, USA, (2005). Association for Computational Linguistics.
[21]
Raphael Hoffmann, Congle Zhang, Xiao Ling, Luke Zettlemoyer, and Daniel S. Weld, 'Knowledge-based weak supervision for information extraction of overlapping relations', in Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1, HLT '11, pp. 541--550, Stroudsburg, PA, USA, (2011). Association for Computational Linguistics.
[22]
Mausam, Michael Schmitz, Robert Bart, Stephen Soderland, and Oren Etzioni, 'Open language learning for information extraction', in Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, EMNLP-CoNLL '12, pp. 523--534, Stroudsburg, PA, USA, (2012). Association for Computational Linguistics.
[23]
Guoliang Ji, Kang Liu, Shizhu He, and Jun Zhao, 'Distant supervision for relation extraction with sentence-level attention and entity descriptions', in Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, AAAI'17, pp. 3060--3066. AAAI Press, (2017).

Index Terms

  1. DSREFC: Improving Distantly-supervised Neural Relation Extraction Using Feature Combination

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    ICMLC '20: Proceedings of the 2020 12th International Conference on Machine Learning and Computing
    February 2020
    607 pages
    ISBN:9781450376426
    DOI:10.1145/3383972
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    In-Cooperation

    • Shenzhen University: Shenzhen University

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 26 May 2020

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Distant Supervisory
    2. Feature Combination
    3. GCN Network
    4. Relation Extraction

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    ICMLC 2020

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 108
      Total Downloads
    • Downloads (Last 12 months)4
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 02 Mar 2025

    Other Metrics

    Citations

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media