Abstract
Rank-integrated topic models which incorporate link structures into topic modeling through topical ranking have shown promising performance comparing to other link combined topic models. However, existing work on rank-integrated topic modeling treats ranking as document distribution for topic, and therefore can’t integrate topical ranking with LDA model, which is one of the most popular topic models. In this paper, we introduce a new method to integrate topical ranking with topic modeling and propose a general framework for topic modeling of documents with link structures. By interpreting the normalized topical ranking score vectors as topic distributions for documents, we fuse ranking into topic modeling in a general framework. Under this general framework, we construct two rank-integrated PLSA models and two rank-integrated LDA models, and present the corresponding learning algorithms. We apply our models on four real datasets and compare them with baseline topic models and the state-of-the-art link combined topic models in generalization performance, document classification, document clustering and topic interpretability. Experiments show that all rank-integrated topic models perform better than baseline models, and rank-integrated LDA models outperform all the compared models.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Hofmann, T.: Probabilistic latent semantic analysis. In: Proceedings of the 15th Conference on Uncertainty in Artificial Intelligence, pp. 289–296. Morgan Kaufmann Publishers Inc. (1999)
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent Dirichlet allocation. J. Mach. Learn. Res. 3(Jan), 993–1022 (2003)
Sun, Y., Han, J., Gao, J., Yu, Y.: iTopicModel: information network-integrated topic modeling. In: Proceedings of the 9th IEEE International Conference on Data Mining, pp. 493–502. IEEE (2009)
Chang, J., Blei, D.M.: Hierarchical relational models for document networks. Ann. Appl. Stat. 4(1), 124–150 (2010)
Mei, Q., Cai, D., Zhang, D., Zhai, C.: Topic modeling with network regularization. In: Proceedings of the 17th International Conference on World Wide Web, pp. 101–110. ACM (2008)
Duan, D., Li, Y., Li, R., Zhang, R., Wen, A.: RankTopic: ranking based topic modeling. In: Proceedings of the 12th IEEE International Conference on Data Mining, pp. 211–220. IEEE (2012)
Duan, D., Li, Y., Li, R., Zhang, R., Gu, X., Wen, K.: LIMTopic: a framework of incorporating link based importance into topic modeling. IEEE Trans. Knowl. Data Eng. 26(10), 2493–2506 (2014)
Blei, D.M., Lafferty, J.D.: Dynamic topic models. In: Proceedings of the 23rd International Conference on Machine Learning, pp. 113–120. ACM (2006)
Guo, W., Wu, S., Wang, L., Tan, T.: Social-relational topic model for social networks. In: Proceedings of the 24th ACM International Conference on Information and Knowledge Management, pp. 1731–1734. ACM (2015)
Yao, L., Zhang, Y., Wei, B., Jin, Z., Zhang, R., Zhang, Y., Chen, Q.: Incorporating knowledge graph embeddings into topic modeling. In: AAAI, pp. 3119–3126 (2017)
He, J., Hu, Z., Berg-Kirkpatrick, T., Huang, Y., Xing, E.P.: Efficient correlated topic modeling with topic embedding. In: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 225–233. ACM (2017)
Li, C., Duan, Y., Wang, H., Zhang, Z., Sun, A., Ma, Z.: Enhancing topic modeling for short texts with auxiliary word embeddings. ACM Trans. Inf. Syst. 36(2), 11:1–11:30 (2017)
Liu, C., Jin, T., Hoi, S.C., Zhao, P., Sun, J.: Collaborative topic regression for online recommender systems: an online and Bayesian approach. Mach. Learn. 106(5), 651–670 (2017)
Nie, L., Davison, B.D., Qi, X.: Topical link analysis for web search. In: Proceedings of the 29th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 91–98. ACM (2006)
Yao, W., He, J., Wang, H., Zhang, Y., Cao, J.: Collaborative topic ranking: leveraging item meta-data for sparsity reduction. In: AAAI, pp. 374–380 (2015)
Ding, W., Ishwar, P., Saligrama, V.: A topic modeling approach to ranking. In: Artificial Intelligence and Statistics, pp. 214–222 (2015)
Acknowledgments
This work is supported by the National Key Research and Development Program of China under grants 2016QY01W0202 and 2016YFB08-00402, National Natural Science Foundation of China under grants 61572221, U1401258, 61433006 and 61502185, Major Projects of the National Social Science Foundation under grant 16ZDA092, Science and Technology Support Program of Hubei Province under grant 2015AAA013, Science and Technology Program of Guangdong Province under grant 2014B010111007, and Guangxi High Level Innovation Team in Higher Education Institutions - Innovation Team of ASEAN Digital Cloud Big Data Security and Mining Technology.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG, part of Springer Nature
About this paper
Cite this paper
Zhang, Z., Li, R., Li, Y., Gu, X. (2018). Rank-Integrated Topic Modeling: A General Framework. In: Cai, Y., Ishikawa, Y., Xu, J. (eds) Web and Big Data. APWeb-WAIM 2018. Lecture Notes in Computer Science(), vol 10987. Springer, Cham. https://doi.org/10.1007/978-3-319-96890-2_2
Download citation
DOI: https://doi.org/10.1007/978-3-319-96890-2_2
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-96889-6
Online ISBN: 978-3-319-96890-2
eBook Packages: Computer ScienceComputer Science (R0)