skip to main content
10.1145/1553374.1553460acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicmlConference Proceedingsconference-collections
research-article

Topic-link LDA: joint models of topic and author community

Published:14 June 2009Publication History

ABSTRACT

Given a large-scale linked document collection, such as a collection of blog posts or a research literature archive, there are two fundamental problems that have generated a lot of interest in the research community. One is to identify a set of high-level topics covered by the documents in the collection; the other is to uncover and analyze the social network of the authors of the documents. So far these problems have been viewed as separate problems and considered independently from each other. In this paper we argue that these two problems are in fact inter-dependent and should be addressed together. We develop a Bayesian hierarchical approach that performs topic modeling and author community discovery in one unified framework. The effectiveness of our model is demonstrated on two blog data sets in different domains and one research paper citation data from CiteSeer.

References

  1. Blei, D. M., & Lafferty, J. D. (2006). Dynamic topic models. Proc. of Int. Conf. on Mach. Learn. (ICML'06) (pp. 113--120). Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Blei, D. M., Ng, A. Y., & Jordan, M. I. (2003). Latent dirichlet allocation. J. Mach. Learn. Res., 3, 993--1022. Google ScholarGoogle ScholarCross RefCross Ref
  3. Chakrabarti, D., & Faloutsos, C. (2006). Graph mining: Laws, generators, and algorithms. ACM Comput. Surv., 38, 2. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Chang, J., & Blei, D. (2009). Relational topic models for document networks. Proc. of Conf. on AI and Statistics (AISTATS'09).Google ScholarGoogle Scholar
  5. Cohn, D., & Hofmann, T. (2001). The missing link - a probabilistic model of document content and hypertext connectivity. Proc. of Conf. on Neural Information Processing Systems (NIPS'01) (pp. 430--436).Google ScholarGoogle Scholar
  6. Dietz, L., Bickel, S., & Scheffer, T. (2007). Unsupervised prediction of citation influences. Proc. of Int. Conf. on Mach. Learn. (ICML'07) (pp. 233--240). Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Erosheva, E., Fienberg, S., & Lafferty, J. (2004). Mixed membership models of scientific publications. Proc. Nat. Acad. Sci., 101, 5220--5227.Google ScholarGoogle ScholarCross RefCross Ref
  8. Gibson, D., Kleinberg, J. M., & Raghavan, P. (1998). Inferring web communities from link topology. UK Conference on Hypertext (pp. 225--234). Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Griffiths, T. L., & Steyvers, M. (2004). Finding scientific topics. Proc. Nat. Acad. Sci., 101, 5228--5235.Google ScholarGoogle ScholarCross RefCross Ref
  10. Jaakkola, T. (1997). Variational methods for inference and estimation in graphical models. PhD thesis, MIT. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Jordan, M. I., Ghahramani, Z., Jaakkola, T., & Saul, L. K. (1999). An introduction to variational methods for graphical models. Machine Learning, 37, 183--233. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Mccallum, A., Corrada-Emmanuel, A., & Wang, X. (2005). Topic and role discovery in social networks. Proc. of Int. Joint Conf. on Articial Intelligence (IJCAI'05) (pp. 786--791). Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. McCallum, A., Nigam, K., Rennie, J., & Seymore, K. (2000). Automating the construction of internet portals with machine learning. Information Retrieval Journal, 3, 127--163. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Mei, Q., Cai, D., Zhang, D., & Zhai, C. (2008). Topic modeling with network regularization. Proc. of Int. World Wide Web Conf. (WWW'08) (pp. 101--110). Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Nallapati, R., & Cohen, W. (2008). Link-plsa-lda: A new unsupervised model for topics and influence in blogs. Proc. of Int. Conf. on Weblogs and Social Media (ICWSM'08) (pp. 84--92).Google ScholarGoogle Scholar
  16. Rosen-Zvi, M., Griffiths, T., Steyvers, M., & Smyth, P. (2004). The author-topic model for authors and documents. Proc. of Conf. on Uncertainty in Artificial Intelligence (UAI'04) (pp. 487--494). Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Xu, Z., Tresp, V., Yu, K., Yu, S., & Kriegel, H.-P. (2005). Dirichlet enhanced relational learning. Proc. of Int. Conf. on Mach. Learn. (pp. 1004--1011). Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Yu, K., Chu, W., Yu, S., Tresp, V., & Xu, Z. (2006). Stochastic relational models for discriminative link prediction. Proc. of Conf. on Neural Information Processing Systems (NIPS'06) (pp. 1553--1560).Google ScholarGoogle Scholar

Index Terms

  1. Topic-link LDA: joint models of topic and author community

                      Recommendations

                      Comments

                      Login options

                      Check if you have access through your login credentials or your institution to get full access on this article.

                      Sign in
                      • Published in

                        cover image ACM Other conferences
                        ICML '09: Proceedings of the 26th Annual International Conference on Machine Learning
                        June 2009
                        1331 pages
                        ISBN:9781605585161
                        DOI:10.1145/1553374

                        Copyright © 2009 Copyright 2009 by the author(s)/owner(s).

                        Publisher

                        Association for Computing Machinery

                        New York, NY, United States

                        Publication History

                        • Published: 14 June 2009

                        Permissions

                        Request permissions about this article.

                        Request Permissions

                        Check for updates

                        Qualifiers

                        • research-article

                        Acceptance Rates

                        Overall Acceptance Rate140of548submissions,26%

                      PDF Format

                      View or Download as a PDF file.

                      PDF

                      eReader

                      View online with eReader.

                      eReader