Skip to main content
Log in

An empirical study on user-topic rating based collaborative filtering methods

  • Published:
World Wide Web Aims and scope Submit manuscript

Abstract

User based collaborative filtering (CF) has been successfully applied into recommender system for years. The main idea of user based CF is to discover communities of users sharing similar interests, thus, in which, the measurement of user similarity is the foundation of CF. However, existing user based CF methods suffer from data sparsity, which means the user-item matrix is often too sparse to get ideal outcome in recommender systems. One possible way to alleviate this problem is to bring new data sources into user based CF. Thanks to the rapid development of social annotation systems, we turn to using tags as new sources. In these approaches, user-topic rating based CF is proposed to extract topics from tags using different topic model methods, based on which we compute the similarities between users by measuring their preferences on topics. In this paper, we conduct comparisons between three user-topic rating based CF methods, using PLSA, Hierarchical Clustering and LDA. All these three methods calculate user-topic preferences according to their ratings of items and topic weights. We conduct the experiments using the MovieLens dataset. The experimental results show that LDA based user-topic rating CF and Hierarchical Clustering outperforms the traditional user based CF in recommending accuracy, while the PLSA based user-topic rating CF performs worse than the traditional user based CF.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Figure 1
Figure 2

Similar content being viewed by others

Notes

  1. http://www.amazon.com

  2. http://www.netflix.com

  3. http://www.grouplens.org/datasets/movielens/

References

  1. Adomavicius, G., Tuzhilin, A.: Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions. IEEE Trans. Knowl. Data Eng. 17(6), 734–749 (2005)

    Article  Google Scholar 

  2. Blei, D.M.: Probabilistic topic models. Commun. ACM 55(4), 77–84 (2012)

    Article  Google Scholar 

  3. Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. JMLR 3, 993–1022 (2003)

    MATH  Google Scholar 

  4. Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)

    MATH  Google Scholar 

  5. Corpet, F.: Multiple sequence alignment with hierarchical clustering. Nucleic Acids Res. 16(22), 10881–10890 (1988)

    Article  Google Scholar 

  6. Deerwester, S.C., et al.: Indexing by latent semantic analysis. JASIS 41(6), 391–407 (1990)

    Article  Google Scholar 

  7. Griffiths, T.: Gibbs Sampling in the Generative Model of Latent Dirichlet Allocation. Stanford University (2002)

  8. Griffiths, T.L., Steyvers, M.: Finding scientific topics. PNAS 101(Suppl 1), 5228–5235 (2004)

    Article  Google Scholar 

  9. He, T., Du, X., Wang, W., Chen, Z., Liu, J.: Comparing Collaborative Filtering Methods Based on User-Topic Ratings. In: SEKE, pp 312–317 (2013)

  10. Hoeffler, S., Ariely, D.: Constructing stable preferences: a look into dimensions of experience and their impact on preference stability. Journal Of Consumer Psychology 8, 113–139 (1999)

    Article  Google Scholar 

  11. Hofmann, T.: Probabilistic latent semantic analysis. In: Proceedings of the Fifteenth Conference on Uncertainty in Artificial Intelligence, pp 289–296 (1999)

  12. Krestel, R., Fankhauser, P.: Language models and topic models for personalizing tag recommendation. In: Web Intelligence, pp 82–89 (2010)

  13. Milicevic, A.K., Nanopoulos, A., Ivanovic, M.: Social tagging in recommender systems: a survey of the state-of-the-art and possible extensions. Artif. Intell. Rev. 33 (3), 187–209 (2010)

    Article  Google Scholar 

  14. Phan, X.-H., Nguyen, C.-T.: GibbsLDA++: A C/C++ implementation of latent Dirichlet allocation (LDA) (2007)

  15. Qi, Q., Chen, Z., Liu, J., Hui, C., Wu, Q.: Using inferred tag ratings to improve user-based collaborative filtering. SAC, 2008–2013 (2012)

  16. Schein, A.I., et al.: Methods and metrics for cold-start recommendations. In: Proceedings of the 25Th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp 253–260 (2002)

  17. Shepitsen, A., Gemmell, J., Mobasher, B., Burke, R.: Personalized recommendation in social tagging systems using hierarchical clustering. In: Recsys, pp 259–266 (2008)

  18. Su, X., Khoshgoftaar, T.M.: A survey of collaborative filtering techniques. Advances in Artificial Intelligence 2009, 4 (2009)

  19. Wang, W., Chen, Z., Liu, J., Qi, Q., Zhao, Z.: User-based collaborative filtering on cross domain by tag transfer learning. In: Proceedings of the 1st International Workshop on Cross Domain Knowledge Discovery in Web and Social Network Mining, pp 10–17 (2012)

  20. Wang, X., McCallum, A.: Topics over time: a non-Markov continuous-time model of topical trends. In: KDD, pp 424–433 (2006)

  21. Wartena, C., Brussee, R., Wibbels, M.: Using tag co-occurrence for recommendation. In: ISDA, pp 273–278 (2009)

  22. Xiance, S., Maosong, S.: Tag-LDA for scalable real-time tag recommendation. JCIS 6(1), 23–31 (2009)

    Google Scholar 

  23. Yin, H., Cui, B., Huang, Z., Wang, W., Wu, X., Zhou, X.: Joint Modeling of Users’ Interests and Mobility Patterns for Point-Of-Interest Recommendation. In: ACM Multimedia, pp 819–296 (2015)

  24. Yin, H., Cui, B., Lu, H., Huang, Y., Yao, J.: A unified model for stable and temporal topic detection from social media data. In: ICDE, pp 661–672 (2013)

  25. Yuan, Q., Cong, G., Ma, Z., Sun, A., Thalmann, N.M.: Who, where, when and what discover spatio-temporal topics for twitter users. In: KDD, pp 605–613 (2013)

Download references

Acknowledgments

This work is supported in part by the National Key Research and Development Program of China (2016YFC0800805), National Basic Research Program of China (973 Program 2014CB340702), the National Natural Science Foundation of China (Grant No. 61170067).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Zhenyu Chen.

Additional information

This paper is an extension of paper “Comparing Collaborative Filtering Methods Based on User-Topic Ratings”, which was originally published in the proceedings of SEKE13.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

He, T., Chen, Z., Liu, J. et al. An empirical study on user-topic rating based collaborative filtering methods. World Wide Web 20, 815–829 (2017). https://doi.org/10.1007/s11280-016-0412-2

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11280-016-0412-2

Keywords

Navigation