Entity Network Prediction Using Multitype Topic Models

Shiozaki, Hitohiro; Eguchi, Koji; Ohkawa, Takenao

doi:10.1007/978-3-540-68125-0_67

Hitohiro Shiozaki¹,
Koji Eguchi² &
Takenao Ohkawa²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5012))

Included in the following conference series:

Pacific-Asia Conference on Knowledge Discovery and Data Mining

2481 Accesses
3 Citations

Abstract

Conveying information about who, what, when and where is a primary purpose of some genres of documents, typically news articles. To handle such information, statistical models that capture dependencies between named entities and topics can serve an important role. Although some relationships between who and where should be mentioned in such a document, no statistical topic models explicitly addressed the textual interactions between a who-entity and a where-entity. This paper presents a statistical model that directly captures dependencies between an arbitrary number of word types, such as who-entities, where-entities and topics, mentioned in each document. We show how this multitype topic model performs better at making predictions on entity networks, in which each vertex represents an entity and each edge weight represents how a pair of entities at the incident vertices is closely related, through our experiments on predictions of who-entities and links between them.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Allan, J.: Introduction to Topic Detection and Tracking. In: Topic Detection and Tracking: Event-based Information Organization, ch. 1, Kluwer Academic Publishers, Dordrecht (2002)
Google Scholar
Baeza-Yates, R., Ribeiro-Neto, B.: Retrieval Evaluation. In: Modern Information Retrieval, ch. 3, pp. 73–97. Addison-Wesley, Reading (1999)
Google Scholar
Bikel, D.M., Schwartz, R.L., Weischedel, R.M.: An algorithm that learns what’s in a name. Machine Learning 34, 211–231 (1999)
Article MATH Google Scholar
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent Dirichlet allocation. Journal of Machine Learning Research 3, 993–1022 (2003)
Article MATH Google Scholar
Callan, J.P., Croft, W.B., Harding, S.M.: The INQUERY retrieval system. In: Proceedings of the 3rd International Conference on Database and Expert Systems Applications, Valencia, Spain, pp. 78–83 (1992)
Google Scholar
Griffiths, T.L., Steyvers, M.: Finding scientific topics. Proceedings of the National Academy of Sciences of the United States of America 101, 5228–5235 (2004)
Article Google Scholar
Hofmann, T.: Probabilistic latent semantic indexing. In: Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Berkeley, CA, USA, pp. 50–57 (1999)
Google Scholar
Newman, D., Chemudugunta, C., Smyth, P., Steyvers, M.: Statistical entity-topic models. In: Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, New York, NY, USA, pp. 680–686 (2006)
Google Scholar
Robertson, S.: On GMAP: and other transformations. In: Proceedings of the 15th ACM International Conference on Information and Knowledge Management, New York, NY, USA, pp. 78–83 (2006)
Google Scholar
Steyvers, M., Griffiths, T.: Probabilistic Topic Models. In: Handbook of Latent Semantic Analysis, ch. 21, Lawrence Erbaum Associates (2007)
Google Scholar
Ueda, N., Saito, K.: Parametric mixture models for multi-labeled text. In: Advances in Neural Information Processing Systems, 15, Cambridge, MA, USA (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

Graduate School of Science and Technology, Kobe University, 1-1 Rokkodai, Nada, Kobe, 657-8501, Japan
Hitohiro Shiozaki
Graduate School of Engineering, Kobe University, 1-1 Rokkodai, Nada, Kobe, 657-8501, Japan
Koji Eguchi & Takenao Ohkawa

Authors

Hitohiro Shiozaki
View author publications
You can also search for this author in PubMed Google Scholar
Koji Eguchi
View author publications
You can also search for this author in PubMed Google Scholar
Takenao Ohkawa
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Takashi Washio Einoshin Suzuki Kai Ming Ting Akihiro Inokuchi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Shiozaki, H., Eguchi, K., Ohkawa, T. (2008). Entity Network Prediction Using Multitype Topic Models. In: Washio, T., Suzuki, E., Ting, K.M., Inokuchi, A. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2008. Lecture Notes in Computer Science(), vol 5012. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-68125-0_67

Download citation

DOI: https://doi.org/10.1007/978-3-540-68125-0_67
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-68124-3
Online ISBN: 978-3-540-68125-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics