Abstract
Rich side information concerning users and items are valuable for collaborative filtering (CF) algorithms for recommendation. For example, rating score is often associated with a piece of review text, which is capable of providing valuable information to reveal the reasons why a user gives a certain rating. Moreover, the underlying community and group relationship buried in users and items are potentially useful for CF. In this paper, we develop a new model to tackle the CF problem which predicts user’s ratings on previously unrated items by effectively exploiting interactions among review texts as well as the hidden user community and item group information. We call this model CMR (co-clustering collaborative filtering model with review text). Specifically, we employ the co-clustering technique to model the user community and item group, and each community–group pair corresponds to a co-cluster, which is characterized by a rating distribution in exponential family and a topic distribution. We have conducted extensive experiments on 22 real-world datasets, and our proposed model CMR outperforms the state-of-the-art latent factor models. Furthermore, both the user’s preference and item profile are drifting over time. Dynamic modeling the temporal changes in user’s preference and item profiles are desirable for improving a recommendation system. We extend CMR and propose an enhanced model called TCMR to consider time information and exploit the temporal interactions among review texts and co-clusters of user communities and item groups. In this TCMR model, each community–group co-cluster is characterized by an additional beta distribution for time modeling. To evaluate our TCMR model, we have conducted another set of experiments on 22 larger datasets with wider time span. Our proposed model TCMR performs better than CMR and the standard time-aware recommendation model on the rating score prediction tasks. We also investigate the temporal effect on the user–item co-clusters.






Similar content being viewed by others
References
Agarwal D, Chen B-C (2010) flda: matrix factorization through latent dirichlet allocation. In: Proceedings of the 3rd ACM international conference on web search and data mining, pp 91–100
Agarwal D, Merugu S (2007) Predictive discrete latent factor models for large scale dyadic data. In: Proceedings of the 13th ACM SIGKDD international conference on knowledge discovery and data mining, pp 26–35
Aizenberg N, Koren Y, Somekh O (2012) Build your own music recommender by modeling internet radio streams. In: Proceedings of the 21st international conference on world wide web, pp 1–10
Almahairi A, Kastner K, Cho K, Courville A (2015) Learning distributed representations from reviews for collaborative filtering. In: Proceedings of the 9th ACM conference on recommender systems, pp 147–154
Baltrunas L, Amatriain X (2009) Towards time-dependant recommendation based on implicit feedback. In: Workshop on context-aware recommender systems
Bao Y, Fang H, Zhang J (2014) Topicmf: Simultaneously exploiting ratings and reviews for recommendation. In: Proceedings of the 28th AAAI conference on artificial intelligence, pp 2–8
Bell R, Koren Y, Volinsky, C (2007) Modeling relationships at multiple scales to improve accuracy of large recommender systems. In: Proceedings of the 13rd ACM SIGKDD international conference on knowledge discovery and data mining, pp 95–104
Beutel A, Murray K, Faloutsos C, Smola A (2014) Cobafi: collaborative bayesian filtering. In: Proceedings of the 23rd international conference on world wide web, pp 97–108
Bishop CM (2006) Pattern recognition and machine learning. Information science and statistics. Springer, New York
Blei D, Ng A, Jordan M (2003) Latent dirichlet allocation. J Mach Learn Res 3:993–1022
Bobadilla J, Ortega F, Hernando A, Gutiérrez A (2013) Recommender systems survey. Knowl Based Syst 46:109–132
Cai Y, Leung H-F, Li Q, Min H, Tang J, Li J (2014) Typicality-based collaborative filtering recommendation. IEEE Trans Knowl Data Eng 26(3):766–779
Chen C, Li D, Zhao Y, Lv Q, Shang L (2015) Wemarec: accurate and scalable recommendation through weighted and ensemble matrix approximation. In: Proceedings of the 38th international ACM SIGIR conference on research and development in information retrieval, pp 303–312
Chen T, Han W, Wang H, Zhou Y, Xu B, Zang B (2007) Content recommendation system based on private dynamic user profile. In: IEEE international conference on machine learning and cybernetics, pp 2112–2118
Cheng Y, Church G (2000) Biclustering of expression data. In: Proceedings of the 8th international conference on intelligent systems for molecular biology, vol 8, pp 93–103
Chu W, Park S (2009) Personalized recommendation on dynamic content using predictive bilinear models. In: Proceedings of the 18th international conference on world wide web, pp 691–700
DeCoste D (2006) Collaborative prediction using ensembles of maximum margin matrix factorizations. In: Proceedings of the 23rd international conference on machine learning, pp 249–256
Dhillon I, Mallela S, Modha D (2003) Information-theoretic co-clustering. In: Proceedings of the 9th ACM SIGKDD international conference on knowledge discovery and data mining, pp 89–98
Diao Q, Qiu M, Wu C-Y, Smola AJ, Jiang J, Wang C (2014) Jointly modeling aspects, ratings and sentiments for movie recommendation (jmars). In: Proceedings of the 20th ACM SIGKDD international conference on knowledge discovery and data mining, pp 193–202
Ding Y, Li X (2005) Time weight collaborative filtering. In: Proceedings of the 14th ACM international conference on information and knowledge management, pp 485–492
Gaillard J, Renders J-M (2015) Time-sensitive collaborative filtering through adaptive matrix completion. In: Proceedings of the 37th European conference on information retrieval, pp 327–332
Ganu G, Elhadad Y, Marian A (2009) Beyond the stars: improving rating predictions using review text content. In: Proceedings of the 12th international workshop on the web and databases
Geman S, Geman D (1984) Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images. IEEE Trans Pattern Anal Mach Intell 6:721–741
George T, Merugu S (2005) A scalable collaborative filtering framework based on co-clustering. In: Proceedings of the 5th IEEE international conference on data mining, pp 625–628
Geuens S (2015) Factorization machines for hybrid recommendation systems based on behavioral, product, and customer data. In: Proceedings of the 9th ACM conference on recommender systems, pp 379–382
Griffiths T, Steyvers M (2004) Finding scientific topics. In: Proceedings of the National academy of Sciences of the United States of America, pp 5228–5235
Guan L, Alam MH, Ryu W, Lee S (2016) A phrase-based model to discover hidden factors and hidden topics in recommender systems. In: IEEE international conference on big data and smart computing (BigComp), pp 337–340
Hartigan J (1972) Direct clustering of a data matrix. J Am Stat Assoc 67(337):123–129
He X, Chen T, Kan M, Chen X (2015) Trirank: review-aware explainable recommendation by modeling aspects. In: Proceedings of the 24th ACM international conference on information and knowledge management, pp 1661–1670
Heckel R, Vlachos M (2016) Interpretable recommendations via overlapping co-clusters. arXiv preprint arXiv:1604.02071
Hoffman M, Bach F, Blei D (2010) Online learning for latent dirichlet allocation. In: Proceedings of advances neural information processing systems, pp 856–864
Hong L, Doumith A, Davison B (2013) Co-factorization machines: modeling user interests and predicting individual decisions in twitter. In: Proceedings of the sixth ACM international conference on web search and data mining, pp 557–566
Koren Y (2008) Factorization meets the neighborhood: a multifaceted collaborative filtering model. In: Proceedings of the 14th ACM SIGKDD international conference on knowledge discovery and data mining, pp 426–434
Koren Y (2010) Collaborative filtering with temporal dynamics. Commun ACM 53(4):89–97
Koren Y, Bell R, Volinsky C (2009) Matrix factorization techniques for recommender systems. Computer 42(8):30–37
Lacoste-Julien S, Sha F, Jordan M (2008) Disclda: discriminative learning for dimensionality reduction and classification. In: Proceedings of advances neural information processing systems, vol 83, p 85
Lathia N, Hailes S, Capra L (2009) Temporal collaborative filtering with adaptive neighbourhoods. In: Proceedings of the 32nd international ACM SIGIR conference on research and development in information retrieval, pp 796–797
Loni B, Shi Y, Larson M, Hanjalic A (2014) Cross-domain collaborative filtering with factorization machines. In: 36th European conference on information retrieval, pp 656–661
McAuley J, Leskovec J (2013) From amateurs to connoisseurs: modeling the evolution of user expertise through online reviews. In: Proceedings of the 22nd international conference on world wide web, pp 897–908
McAuley J, Leskovec J (2013) Hidden factors and hidden topics: understanding rating dimensions with review text. In: Proceedings of the 7th ACM conference on recommender systems, pp 165–172
McAuley J, Leskovec J, Jurafsky, D (2012) Learning attitudes and attributes from multi-aspect reviews. In: Proceedings of the 12th IEEE international conference on data mining, pp 1020–1025
Paterek A (2007) Improving regularized singular value decomposition for collaborative filtering. In: Proceedings of KDD cup and workshop, PP 5–8
Qiang R, Liang F, Yang J (2013) Exploiting ranking factorization machines for microblog retrieval. In: Proceedings of the 22nd ACM international conference on information and knowledge management, pp 1783–1788
Rendle S (2012) Factorization machines with libfm. ACM Trans Intell Syst Technol (TIST) 3(3):57
Rennie J, Srebro N (2005) Fast maximum margin matrix factorization for collaborative prediction. In: Proceedings of the 22nd international conference on machine learning, pp 713–719
Salakhutdinov R, Andriy M (2007) Probabilistic matrix factorization. In: Proceedings of advances neural information processing systems, vol 1, pp 1–2
Salakhutdinov R, Andriy M (2008) Bayesian probabilistic matrix factorization using markov chain monte carlo. In: Proceedings of the 33rd international conference on machine learning, pp 880–887
Sarwar B, Karypis G, Konstan J, Riedl J (2002) Incremental singular value decomposition algorithms for highly scalable recommender systems. In: Proceedings of 5th international conference on computer and information science, pp 27–28
Shan H, Banerjee A (2008) Bayesian co-clustering. In: Proceedings of the 8th IEEE international conference on data mining, pp 530–539
Shi Y, Larson M, Hanjalic A (2014) Collaborative filtering beyond the user-item matrix: a survey of the state of the art and future challenges. ACM Comput Surv (CSUR) 47(1):3
Srebro N, Alon N, Jaakkola T (2004) Generalization error bounds for collaborative prediction with low-rank matrices. In: Proceedings of advances neural information processing systems
Srebro N, Jaakkola T (2003) Weighted low-rank approximations. In: Proceedings of the 20th international conference on machine learning, vol 3, pp 720–727
Srebro N, Rennie J, Jaakkola T (2004) Maximum-margin matrix factorization. In: Proceedings of advances neural information processing systems, vol 17, pp 1329–1336
Tan C, Chi E, Huffaker D, Kossinets G, Alexander S (2013) Instant foodie: predicting expert ratings from grassroots. In: Proceedings of the 22nd ACM international conference on information and knowledge management, pp 1127–1136
Titov I, McDonald R (2008) A joint model of text and aspect ratings for sentiment summarization. In: Proceedings of the 46th annual meeting of the Association for Computational Linguistics, pp 308–316
Titov I, McDonald R (2008) Modeling online reviews with multi-grain topic models. In: Proceedings of the 17th international conference on world wide web, pp 111–120
Wang C, Blei D (2011) Collaborative topic modeling for recommending scientific articles. In: Proceedings of the 17th ACM SIGKDD international conference on knowledge discovery and data mining, pp 448–456
Wang X, McCallum A (2006) Topics over time: a non-markov continuous-time model of topical trends. In: Proceedings of the 12th ACM SIGKDD international conference on knowledge discovery and data mining, pp 424–433
Weimer M, Karatzoglou A, Le Q, Smola A (2007) Maximum margin matrix factorization for collaborative ranking. In: Proceedings of advances neural information processing systems, pp 1593–1600
Weimer M, Karatzoglou A, Smola A (2008) Improving maximum margin matrix factorization. Mach Learn 72(3):263–276
Weston J, Bengio S, Usunier N (2011) Wsabie: Scaling up to large vocabulary image annotation. In: Proceedings of the 22nd international joint conference on artificial intelligence, pp 2764–2770
Xiang L, Yuan Q, Zhao S, Chen L, Zhang X, Yang Q, Sun J (2010) Temporal recommendation on graphs via long-and short-term preference fusion. In: Proceedings of the 16th ACM SIGKDD international conference on knowledge discovery and data mining, pp 723–732
Xin X, Liu Z, Lin C, Huang H, Wei X, Guo P (2015) Cross-domain collaborative filtering with review text. In: Proceedings of the 24th international joint conference on artificial intelligence, pp 1827–1833
Xu Y, Lam W, Lin T (2014) Collaborative filtering incorporating review text and co-clusters of hidden user communities and item groups. In: Proceedings of the 23rd ACM international conference on information and knowledge management, pp 251–260
Yang S, Long B, Alexander S, Zha H, Zheng Z (2011) Collaborative competitive filtering: learning recommender using context of user choice. In: Proceedings of the 34th international ACM SIGIR conference on research and development in information retrieval, pp 295–304
Yin H, Cui B, Chen L, Hu Z, Zhou X (2015) Dynamic user modeling in social media systems. ACM Trans Inf Syst (TOIS) 33(3):10
Yu J, Shen Y, Yang Z (2014) Topic-stg: Extending the session-based temporal graph approach for personalized tweet recommendation. In: Proceedings of the companion publication of the 23rd international conference on world wide web companion, pp 413–414
Yu K, Zhu S, Lafferty J, Gong Y (2009) Fast nonparametric matrix factorization for large-scale collaborative filtering. In: Proceedings of the 32nd international ACM SIGIR conference on research and development in information retrieval, pp 211–218
Zhang T, Iyengar VS (2002) Recommender systems using linear classifiers. J Mach Learn Res 2:313–334
Zhang Y, Lai G, Zhang M, Zhang Y, Liu Y, Ma S (2014) Explicit factor models for explainable recommendation based on phrase-level sentiment analysis. In: Proceedings of the 37th international ACM SIGIR conference on research and development in information retrieval. ACM, pp 83–92
Zhang Y, Zhang M, Zhang Y, Lai G, Liu Y, Zhang H, Ma S (2015) Daily-aware personalized recommendation based on feature-level time series analysis. In: Proceedings of the 24th international conference on world wide web, pp 1373–1383
Zimdars A, Chickering DM, Meek C (2001) Using temporal data for making recommendations. In: Proceedings of the 7th conference on uncertainty in artificial intelligence, pp 580–588
Acknowledgements
The work described in this paper is substantially supported by grants from the Research Grant Council of the Hong Kong Special Administrative Region, China (Project Code: 14203414) and the Direct Grant of the Faculty of Engineering, CUHK (Project Code: 4055034). This work is also affiliated with the CUHK MoE-Microsoft Key Laboratory of Human-centric Computing and Interface Technologies.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Xu, Y., Yu, Q., Lam, W. et al. Exploiting interactions of review text, hidden user communities and item groups, and time for collaborative filtering. Knowl Inf Syst 52, 221–254 (2017). https://doi.org/10.1007/s10115-016-1005-1
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10115-016-1005-1