Skip to main content

Bootstrapped Multi-level Distant Supervision for Relation Extraction

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11233))

Abstract

Distant supervised relation extraction has been widely used to identify new relation facts from free text. However, relying on a single-node categorization model to identify relation facts for thousands of relations simultaneously inevitably accompanies with serious false categorization problem. Also to the best of our knowledge, no previous efforts has yet considered to update the categorization model with the new identified relation facts, which wastes the chance to further improsve the extraction precision and recall. In this paper, we novelly propose a multi-level distant supervision model for relation extraction, which divides the original categorization task into a number of sub-tasks in multiple levels of a constructed tree-like categorization structure. With the tree-like structure, an unlabelled relation instance would be categorized step by step along a path from the root node to a leaf node. Beyond that, we propose to do bootstrapped distant supervision to update the distant supervision model with new learned relation facts iteratively to further improve the extraction precision and recall. Experimental results conducted on two real datasets prove that our approach outperforms state-of-the-art approaches by reaching more than 10% better extraction quality.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    http://iesl.cs.umass.edu/riedel/ecml/.

  2. 2.

    http://lemurproject.org/clueweb09/.

  3. 3.

    http://code.google.com/archive/p/word2vec/.

References

  1. Mintz, M., Bills, S., Snow, R., Dan, J.: Distant supervision for relation extraction without labeled data. In: ACL-IJCNLP, pp. 1003–1011 (2009)

    Google Scholar 

  2. Savenkov, D., Agichtein, E.: When a knowledge base is not enough: question answering over knowledge bases with external text data. In: SIGIR, pp. 235–244 (2016)

    Google Scholar 

  3. Ji, G., Liu, K., He, S., Zhao, J.: Distant supervision for relation extraction with sentence-level attention and entity descriptions. In: AAAI, pp. 3060–3066 (2017)

    Google Scholar 

  4. Han, X., Sun, L.: Global distant supervision for relation extraction. In: AAAI (2016)

    Google Scholar 

  5. Culotta, A., Sorensen, J.: Dependency tree kernels for relation extraction. In: ACL, pp. 423–429 (2004)

    Google Scholar 

  6. Bunescu, R.C., Mooney, R.J.: A shortest path dependency kernel for relation extraction. In: HLT-EMNLP, pp. 724–731 (2005)

    Google Scholar 

  7. Zhou, G., Zhang, M., Ji, D.H., Zhu, Q.: Tree kernel-based relation extraction with context-sensitive structured parse tree information. In: EMNLP-CoNLL, pp. 728–736 (2007)

    Google Scholar 

  8. Zhou, G., Qian, L., Fan, J.: Tree kernel-based semantic relation extraction with rich syntactic and semantic information. Inf. Sci. 180(8), 1313–1325 (2010)

    Article  MathSciNet  Google Scholar 

  9. Zhou, G., Su, J., Zhang, J., Zhang, M.: Exploring various knowledge in relation extraction. In: ACL, pp. 419–444 (2002)

    Google Scholar 

  10. Zeng, D., Liu, K., Lai, S., Zhou, G., Zhao, J., et al.: Relation classification via convolutional deep neural network. In: COLING, pp. 2335–2344 (2014)

    Google Scholar 

  11. Santos, C.N.D., Xiang, B., Zhou, B.: Classifying relations by ranking with convolutional neural networks. Comput. Sci. 86(86), 132–137 (2015)

    Google Scholar 

  12. Agichtein, E., Gravano, L.: Snowball: extracting relations from large plain-text collections. In: ACMDL, pp. 85–94 (2000)

    Google Scholar 

  13. Etzioni, O., et al.: Web-scale information extraction in knowltAll. J. Korean Med. Sci. 27(2), 146–52 (2004)

    Google Scholar 

  14. Carlson, A., Betteridge, J., Kisiel, B., Settles, B., Hruschka Jr., E.R., Mitchell, T.M.: Toward an architecture for never-ending language learning. In: AAAI, pp. 1306–1313 (2010)

    Google Scholar 

  15. Wu, W., Li, H., Wang, H., Zhu, K.Q.: Probase: a probabilistic taxonomy for text understanding. In: SIGMOD, pp. 481–492 (2012)

    Google Scholar 

  16. Riedel, S., Yao, L., Mccallum, A.: Modeling relations and their mentions without labeled text. In: ECML-PKDD, pp. 148–163 (2010)

    Chapter  Google Scholar 

  17. Hoffmann, R., Zhang, C., Ling, X., Zettlemoyer, L., Weld, D.S.: Knowledge-based weak supervision for information extraction of overlapping relations. In: ACL-HLT, pp. 541–550 (2011)

    Google Scholar 

  18. Surdeanu, M., Tibshirani, J., Nallapati, R., Manning, C.D.: Multi-instance multi-label learning for relation extraction. In: EMNLP-CoNLL, pp. 455–465 (2012)

    Google Scholar 

  19. Zeng, D., Liu, K., Chen, Y., Zhao, J.: Distant supervision for relation extraction via piecewise convolutional neural networks. In: EMNLP, pp. 1753–1762 (2015)

    Google Scholar 

  20. Srivastava, N., Salakhutdinov, R.: Discriminative transfer learning with tree-based priors. In: Advances in Neural Information Processing Systems, pp. 2094–2102 (2013)

    Google Scholar 

  21. Bart, E., Porteous, I., Perona, P., Welling, M.: Unsupervised learning of visual taxonomies. In: CVPR, pp. 1–8 (2008)

    Google Scholar 

  22. Yan, Z., Zhang, H., Piramuthu, R., Jagadeesh, V., Decoste, D., Di, W., Yu, Y.: HD-CNN: hierarchical deep convolutional neural networks for large scale visual recognition. In: ICCV, pp. 2740–2748 (2016)

    Google Scholar 

  23. Xue, Y., Liao, X., Carin, L., Krishnapuram, B.: Multi-task learning for classification with dirichlet process priors. J. Mach. Learn. Res. 8(1), 35–63 (2007)

    MathSciNet  MATH  Google Scholar 

  24. Salakhutdinov, R.R., Tenenbaum, J., Torralba, A.: Learning to learn with compound HD models. In: Advances in Neural Information Processing Systems, pp. 2061–2069 (2012)

    Google Scholar 

  25. Socher, R., Huval, B., Manning, C.D., Ng, A.Y.: Semantic compositionality through recursive matrix-vector spaces. In: EMNLP-CoNLL, pp. 1201–1211 (2012)

    Google Scholar 

  26. Bengio, Y., Schwenk, H., Sencal, J.S., Morin, F., Gauvain, J.L.: Neural probabilistic language models. J. Mach. Learn. Res. 3(6), 1137–1155 (2006)

    Google Scholar 

  27. Collobert, R., Weston, J., Karlen, M., Kavukcuoglu, K., Kuksa, P.: Natural language processing (almost) from scratch. J. Mach. Learn. Res. 12(1), 2493–2537 (2011)

    MATH  Google Scholar 

  28. Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013)

  29. Lin, Y., Shen, S., Liu, Z., Luan, H., Sun, M.: Neural relation extraction with selective attention over instances. In: ACL, pp. 2124–2133 (2016)

    Google Scholar 

  30. Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.: Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15, 1929–1958 (2014)

    MathSciNet  MATH  Google Scholar 

Download references

Acknowledgments

This research is partially supported by National Natural Science Foundation of China (Grant No. 61632016, 61402313, 61472263), and the Natural Science Research Project of Jiangsu Higher Education Institution (No. 17KJA520003).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Zhixu Li .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

He, Y. et al. (2018). Bootstrapped Multi-level Distant Supervision for Relation Extraction. In: Hacid, H., Cellary, W., Wang, H., Paik, HY., Zhou, R. (eds) Web Information Systems Engineering – WISE 2018. WISE 2018. Lecture Notes in Computer Science(), vol 11233. Springer, Cham. https://doi.org/10.1007/978-3-030-02922-7_28

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-02922-7_28

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-02921-0

  • Online ISBN: 978-3-030-02922-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics