ABSTRACT
The state-of-the-art for Relation Extraction, defined as the detection of existing relations between a pair of entities in a sentence, relies on neural networks that require a large number of training examples to perform well. To address that cost, Distant Supervision has become the preferred choice for collecting labeled sentences. However, Distant Supervision has many limitations and often introduces noise into the training set. Recent work has shown an alternative way of training neural methods for relation extraction, namely to provide a small number of annotated sentences and explanations for why those sentences express the relation. Training classifiers with this approach results in accuracy comparable to Distant Supervision, but requires humans to annotate the sentences and provide the explanations. In this paper, we show a way to generate synthetic explanations from a small number of relational trigger words, for each relation, whose resulting explanations achieve comparable accuracy to human produced ones. We validate the method on five relation extraction tasks with different entity types (person-person, person-location, etc.). Furthermore, experiments on two public datasets demonstrate the effectiveness of our generated synthetic explanations, with 6% improvement in accuracy on relation extraction and 19% improvement in F1-score on generating labeled training sentences compared to the next best methods.
- The lemur project. http://lemurproject.org/.Google Scholar
- E. Agichtein and L. Gravano. Snowball: Extracting relations from large plain-text collections. In Proceedings of the Fifth ACM Conference on Digital Libraries, DL '00, pages 85–94. ACM, 2000.Google ScholarDigital Library
- C. Alt, M. Hu¨bner, and L. Hennig. Improving relation extraction by pre-trained language representations. arXiv preprint arXiv:1906.03088, 2019.Google Scholar
- M. Banko, M. J. Cafarella, S. Soderland, M. Broadhead, and O. Etzioni. Open information extraction from the web. In IJCAI, volume 7, pages 2670–2676, 2007.Google Scholar
- D. T. Bollegala, Y. Matsuo, and M. Ishizuka. Relational duality: Unsupervised extraction of semantic relations between entities on the web. In Proceedings of the 19th international conference on World wide web, pages 151–160. ACM, 2010.Google Scholar
- S. Brin. Extracting patterns and relations from the world wide web. In International Workshop on The World Wide Web and Databases, pages 172–183. Springer, 1998.Google Scholar
- R. C. Bunescu and R. J. Mooney. A shortest path dependency kernel for relation extraction. In Proceedings of the conference on human language technology and empirical methods in natural language processing, pages 724–731. Association for Computational Linguis-tics, 2005.Google Scholar
- J. Ellis, X. Li, K. Griffitt, S. M. Strassel, and J. Wright. Linguistic resources for 2013 knowledge base population evaluations. In TAC, 2012.Google Scholar
- J. R. Finkel, T. Grenager, and C. Manning. Incorporating non-local information into information extraction systems by gibbs sampling. In Proceedings of the 43rd annual meeting on association for computational linguistics, pages 363–370. Association for Computational Linguistics, 2005.Google Scholar
- E. Gabrilovich, M. Ringgaard, and A. Subramanya. Facc1: Freebase annotation of clueweb corpora, version 1 (release date 2013-06-26, format version 1, correction level 0). Note: http://lemurproject.org/clueweb09/FACC1/Cited by, 5, 2013Google Scholar
- M. R. Gormley, M. Yu, and M. Dredze. Improved relation extraction with feature-rich compositional embedding models.Google Scholar
- Z. Guo and D. Barbosa. Robust named entity disambiguation with random walks. Semantic Web, 9(4):459– 479, 2018.Google Scholar
- B. Hancock. Babble labble. https://github.com/ HazyResearch/babble, 2018.Google Scholar
- B. Hancock, P. Varma, S. Wang, M. Bringmann, P. Liang, and C. R´e. Training classifiers with natural language explanations. arXiv preprint arXiv:1805.03818, 2018.Google Scholar
- G. E. Hinton Learning distributed representations of concepts. In Proceedings of the eighth annual conference of the cognitive science society, volume 1, page 12. Amherst, MA, 1986.Google Scholar
- R. Hoffmann, C. Zhang, X. Ling, L. Zettlemoyer, and D. S. Weld. Knowledge-based weak supervision for information extraction of overlapping relations. In Proceedings of the 49th Annual Meeting of the Associa- tion for Computational Linguistics: Human Language Technologies-Volume 1, pages 541–550. Association for Computational Linguistics, 2011.Google Scholar
- A. Joulin, E. Grave, P. Bojanowski, and T. Mikolov. Bag of tricks for efficient text classification. arXiv preprint arXiv:1607.01759, 2016.Google Scholar
- X. Ling and D. S. Weld. Fine-grained entity recognition. In AAAI, volume 12, pages 94–100, 2012.Google Scholar
- M. Mintz, S. Bills, R. Snow, and D. Jurafsky. Distant supervision for relation extraction without labeled data. In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2-Volume 2, pages 1003–1011. Association for Computational Linguistics, 2009.Google Scholar
- B. Perozzi, R. Al-Rfou, and S. Skiena. Deepwalk: Online learning of social representations. In Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 701–710. ACM, 2014.Google Scholar
- X. Ren, Z. Wu, W. He, M. Qu, C. R. Voss, H. Ji, T. F. Abdelzaher, and J. Han. Cotype: Joint extraction of typed entities and relations with knowledge bases. In Proceedings of the 26th International Conference on World Wide Web, pages 1015–1024. International World Wide Web Conferences Steering Committee, 2017.Google ScholarDigital Library
- E. Riloff, R. Jones, Learning dictionaries for information extraction by multi-level bootstrapping. In AAAI/IAAI, pages 474–479, 1999.Google Scholar
- B. Rozenfeld and R. Feldman. Self-supervised relation extraction from the web. Knowledge and Information Systems, 17(1):17–33, 2008.Google Scholar
- S. Shimaoka, P. Stenetorp, K. Inui, and S. Riedel. Neural architectures for fine-grained entity type classification. In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers, volume 1, pages 1271–1280, 2017.Google ScholarCross Ref
- Y. Shinyama and S. Sekine. Preemptive information extraction using unrestricted relation discovery. In Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, pages 304–311. Association for Computational Linguistics, 2006.Google Scholar
- J. Tang, M. Qu, M. Wang, M. Zhang, J. Yan, and Q. Mei. Line: Large-scale information network embedding. In Proceedings of the 24th international conference on world wide web, pages 1067–1077. International World Wide Web Conferences Steering Committee, 2015.Google ScholarDigital Library
- C. Zhang. Deepdive: a data management system for automatic knowledge base construction. University of Wisconsin-Madison, Madison, Wisconsin, 2015Google Scholar
Index Terms
- Relation Extraction with Synthetic Explanations and Neural Network
Recommendations
Clustering-Augmented Multi-instance Learning for Neural Relation Extraction
Advances in Information RetrievalAbstractDespite its efficiency in generating training data, distant supervision for sentential relation extraction assigns labels to instances in a context-agnostic manner—a process that may introduce false labels and confuse sentential model learning. In ...
Distantly Supervised Neural Network Model for Relation Extraction
Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big DataAbstractFor the task of relation extraction, distant supervision is an efficient approach to generate labeled data by aligning knowledge base (KB) with free texts. Albeit easy to scale to thousands of different relations, this procedure suffers from ...
A Relation-Oriented Method for Joint Entity and Relation Extraction Based on Neural Network
EITCE '21: Proceedings of the 2021 5th International Conference on Electronic Information Technology and Computer EngineeringEntity and relation extraction is a basic task of information extraction in natural language processing. At present, Entity and relation extraction based on artificial intelligence has been widely studied, but most methods adopt the idea of identifying ...
Comments