Skip to main content

Probabilistic Matrix Factorization Leveraging Contexts for Unsupervised Relation Extraction

  • Conference paper
Advances in Knowledge Discovery and Data Mining (PAKDD 2011)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6634))

Included in the following conference series:

Abstract

The clustering of the semantic relations between entities extracted from a corpus is one of the main issues in unsupervised relation extraction (URE). Previous methods assume a huge corpus because they have utilized frequently appearing entity pairs in the corpus. In this paper, we present a URE that works well for a small corpus by using word sequences extracted as relations. The feature vectors of the word sequences are extremely sparse. To deal with the sparseness problem, we take the two approaches: dimension reduction and leveraging context in the whole corpus including sentences from which no relations are extracted. The context in this case is captured with feature co-occurrences, which indicate appearances of two features in a single sentence. The approaches are implemented by a probabilistic matrix factorization that jointly factorizes the matrix of the feature vectors and the matrix of the feature co-occurrences. Experimental results show that our method outperforms previously proposed methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Banko, M., Cafarella, M.J., Soderland, S., Broadhead, M., Etzioni, O.: Open information extraction from the web. In: Proceedings of IJCAI (2007)

    Google Scholar 

  2. Banko, M., Etzioni, O.: The tradeoffs between open and traditional relation. In: Proceedings of ACL 2008: HLT (2008)

    Google Scholar 

  3. Deerwester, S., Dumais, S.T., Furnas, G.W., Landauer, T.K., Harshman, R.: Indexing by latent semantic analysis. Journal of the American Society of Information Science 41(6), 391–407 (1990)

    Article  Google Scholar 

  4. Harris, Z.: Distributional structure. Word 10, 146–162 (1954)

    Article  Google Scholar 

  5. Hasegawa, T., Sekine, S., Grishman, R.: Discovering relations among named entities from large corpora. In: Proceedings of ACL (2004)

    Google Scholar 

  6. Ma, H., Yang, H., Lyu, M.R., King, I.: SoRec: social recommendation using probabilistic matrix factorization. In: Proceedings of CIKM (2008)

    Google Scholar 

  7. Nakov, P., Popova, A., Mateev, P.: Weight functions impact on LSA performance. In: Proceedings of Euro Conference RANLP 2001 (2001)

    Google Scholar 

  8. Rosenfeld, B., Feldman, R.: URES: an unsupervised web relation extraction system. In: Proceedings of COLING/ACL (2006)

    Google Scholar 

  9. Salakhutdinov, R., Mnih, A.: Probabilistic matrix factorization. In: Proceedings of NIPS (2008)

    Google Scholar 

  10. Salakhutdinov, R., Mnih, A.: Bayesian probabilistic matrix factorization using Markov chain Monte Carlo. In: Proceedings of ICML (2008)

    Google Scholar 

  11. Shuetze, H.: Dimensions of meaning. In: Proceedings of Supercomputing (1992)

    Google Scholar 

  12. Shuetze, H.: Automatic word sense discrimination. Computational Linguistics 24(1), 97–123 (1998)

    Google Scholar 

  13. Turney, P.D.: Measuring semantic similarity by latent relational analysis. In: Proceedings of IJCAI (2005)

    Google Scholar 

  14. Yates, A., Etzioni, O.: Unsupervised methods for determining object and relation systems on the web. Journal of Artificial Intelligence Research 34, 255–296 (2009)

    MATH  Google Scholar 

  15. Zhu, S., Yu, K., Chi, Y., Gong, Y.: Combining content and link for classification using matrix factorization. In: Proceedings of SIGIR (2007)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Takamatsu, S., Sato, I., Nakagawa, H. (2011). Probabilistic Matrix Factorization Leveraging Contexts for Unsupervised Relation Extraction. In: Huang, J.Z., Cao, L., Srivastava, J. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2011. Lecture Notes in Computer Science(), vol 6634. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-20841-6_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-20841-6_8

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-20840-9

  • Online ISBN: 978-3-642-20841-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics