Abstract
Distant supervised relation extraction has been widely used to identify new relation facts from free text. However, relying on a single-node categorization model to identify relation facts for thousands of relations simultaneously inevitably accompanies with serious false categorization problem. Also to the best of our knowledge, no previous efforts has yet considered to update the categorization model with the new identified relation facts, which wastes the chance to further improsve the extraction precision and recall. In this paper, we novelly propose a multi-level distant supervision model for relation extraction, which divides the original categorization task into a number of sub-tasks in multiple levels of a constructed tree-like categorization structure. With the tree-like structure, an unlabelled relation instance would be categorized step by step along a path from the root node to a leaf node. Beyond that, we propose to do bootstrapped distant supervision to update the distant supervision model with new learned relation facts iteratively to further improve the extraction precision and recall. Experimental results conducted on two real datasets prove that our approach outperforms state-of-the-art approaches by reaching more than 10% better extraction quality.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Mintz, M., Bills, S., Snow, R., Dan, J.: Distant supervision for relation extraction without labeled data. In: ACL-IJCNLP, pp. 1003–1011 (2009)
Savenkov, D., Agichtein, E.: When a knowledge base is not enough: question answering over knowledge bases with external text data. In: SIGIR, pp. 235–244 (2016)
Ji, G., Liu, K., He, S., Zhao, J.: Distant supervision for relation extraction with sentence-level attention and entity descriptions. In: AAAI, pp. 3060–3066 (2017)
Han, X., Sun, L.: Global distant supervision for relation extraction. In: AAAI (2016)
Culotta, A., Sorensen, J.: Dependency tree kernels for relation extraction. In: ACL, pp. 423–429 (2004)
Bunescu, R.C., Mooney, R.J.: A shortest path dependency kernel for relation extraction. In: HLT-EMNLP, pp. 724–731 (2005)
Zhou, G., Zhang, M., Ji, D.H., Zhu, Q.: Tree kernel-based relation extraction with context-sensitive structured parse tree information. In: EMNLP-CoNLL, pp. 728–736 (2007)
Zhou, G., Qian, L., Fan, J.: Tree kernel-based semantic relation extraction with rich syntactic and semantic information. Inf. Sci. 180(8), 1313–1325 (2010)
Zhou, G., Su, J., Zhang, J., Zhang, M.: Exploring various knowledge in relation extraction. In: ACL, pp. 419–444 (2002)
Zeng, D., Liu, K., Lai, S., Zhou, G., Zhao, J., et al.: Relation classification via convolutional deep neural network. In: COLING, pp. 2335–2344 (2014)
Santos, C.N.D., Xiang, B., Zhou, B.: Classifying relations by ranking with convolutional neural networks. Comput. Sci. 86(86), 132–137 (2015)
Agichtein, E., Gravano, L.: Snowball: extracting relations from large plain-text collections. In: ACMDL, pp. 85–94 (2000)
Etzioni, O., et al.: Web-scale information extraction in knowltAll. J. Korean Med. Sci. 27(2), 146–52 (2004)
Carlson, A., Betteridge, J., Kisiel, B., Settles, B., Hruschka Jr., E.R., Mitchell, T.M.: Toward an architecture for never-ending language learning. In: AAAI, pp. 1306–1313 (2010)
Wu, W., Li, H., Wang, H., Zhu, K.Q.: Probase: a probabilistic taxonomy for text understanding. In: SIGMOD, pp. 481–492 (2012)
Riedel, S., Yao, L., Mccallum, A.: Modeling relations and their mentions without labeled text. In: ECML-PKDD, pp. 148–163 (2010)
Hoffmann, R., Zhang, C., Ling, X., Zettlemoyer, L., Weld, D.S.: Knowledge-based weak supervision for information extraction of overlapping relations. In: ACL-HLT, pp. 541–550 (2011)
Surdeanu, M., Tibshirani, J., Nallapati, R., Manning, C.D.: Multi-instance multi-label learning for relation extraction. In: EMNLP-CoNLL, pp. 455–465 (2012)
Zeng, D., Liu, K., Chen, Y., Zhao, J.: Distant supervision for relation extraction via piecewise convolutional neural networks. In: EMNLP, pp. 1753–1762 (2015)
Srivastava, N., Salakhutdinov, R.: Discriminative transfer learning with tree-based priors. In: Advances in Neural Information Processing Systems, pp. 2094–2102 (2013)
Bart, E., Porteous, I., Perona, P., Welling, M.: Unsupervised learning of visual taxonomies. In: CVPR, pp. 1–8 (2008)
Yan, Z., Zhang, H., Piramuthu, R., Jagadeesh, V., Decoste, D., Di, W., Yu, Y.: HD-CNN: hierarchical deep convolutional neural networks for large scale visual recognition. In: ICCV, pp. 2740–2748 (2016)
Xue, Y., Liao, X., Carin, L., Krishnapuram, B.: Multi-task learning for classification with dirichlet process priors. J. Mach. Learn. Res. 8(1), 35–63 (2007)
Salakhutdinov, R.R., Tenenbaum, J., Torralba, A.: Learning to learn with compound HD models. In: Advances in Neural Information Processing Systems, pp. 2061–2069 (2012)
Socher, R., Huval, B., Manning, C.D., Ng, A.Y.: Semantic compositionality through recursive matrix-vector spaces. In: EMNLP-CoNLL, pp. 1201–1211 (2012)
Bengio, Y., Schwenk, H., Sencal, J.S., Morin, F., Gauvain, J.L.: Neural probabilistic language models. J. Mach. Learn. Res. 3(6), 1137–1155 (2006)
Collobert, R., Weston, J., Karlen, M., Kavukcuoglu, K., Kuksa, P.: Natural language processing (almost) from scratch. J. Mach. Learn. Res. 12(1), 2493–2537 (2011)
Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013)
Lin, Y., Shen, S., Liu, Z., Luan, H., Sun, M.: Neural relation extraction with selective attention over instances. In: ACL, pp. 2124–2133 (2016)
Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.: Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15, 1929–1958 (2014)
Acknowledgments
This research is partially supported by National Natural Science Foundation of China (Grant No. 61632016, 61402313, 61472263), and the Natural Science Research Project of Jiangsu Higher Education Institution (No. 17KJA520003).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Switzerland AG
About this paper
Cite this paper
He, Y. et al. (2018). Bootstrapped Multi-level Distant Supervision for Relation Extraction. In: Hacid, H., Cellary, W., Wang, H., Paik, HY., Zhou, R. (eds) Web Information Systems Engineering – WISE 2018. WISE 2018. Lecture Notes in Computer Science(), vol 11233. Springer, Cham. https://doi.org/10.1007/978-3-030-02922-7_28
Download citation
DOI: https://doi.org/10.1007/978-3-030-02922-7_28
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-02921-0
Online ISBN: 978-3-030-02922-7
eBook Packages: Computer ScienceComputer Science (R0)