ABSTRACT
The collaborative filtering approach to recommender systems predicts user preferences for products or services by learning past user-item relationships. In this work, we propose novel algorithms for predicting user ratings of items by integrating complementary models that focus on patterns at different scales. At a local scale, we use a neighborhood-based technique that infers ratings from observed ratings by similar users or of similar items. Unlike previous local approaches, our method is based on a formal model that accounts for interactions within the neighborhood, leading to improved estimation quality. At a higher, regional, scale, we use SVD-like matrix factorization for recovering the major structural patterns in the user-item rating matrix. Unlike previous approaches that require imputations in order to fill in the unknown matrix entries, our new iterative algorithm avoids imputation. Because the models involve estimation of millions, or even billions, of parameters, shrinkage of estimated values to account for sampling variability proves crucial to prevent overfitting. Both the local and the regional approaches, and in particular their combination through a unifying model, compare favorably with other approaches and deliver substantially better results than the commercial Netflix Cinematch recommender system on a large publicly available data set.
- G. Adomavicius and A. Tuzhilin, "Towards the Next Generation of Recommender Systems: A Survey of the State-of-the-Art and Possible Extensions", IEEE Transactions on Knowledge and Data Engineering 17 (2005), 634--749. Google ScholarDigital Library
- R. Bell and Y. Koren, "Improved Neighborhood-based Collaborative Filtering", submitted, 2007.Google Scholar
- S. Deerwester, S. Dumais, G. W. Furnas, T. K. Landauer and R. Harshman, "Indexing by Latent Semantic Analysis", Journal of the Society for Information Science 41 (1990), 391--407.Google ScholarCross Ref
- D. Goldberg, D. Nichols, B. M. Oki and D. Terry, "Using Collaborative Filtering to Weave an Information Tapestry", Communications of the ACM 35 (1992), 61--70. Google ScholarDigital Library
- K. Goldberg, T. Roeder, D. Gupta and C. Perkins, "Eigentaste: A Constant Time Collaborative Filtering Algorithm", Information Retrieval 4 (2001), 133--151. Google ScholarDigital Library
- G. H. GolubandC. F. VanLoan, Matrix Computations, Johns Hopkins University Press, 1996.Google Scholar
- J. L. Herlocker, J. A. Konstan, A. Borchers and John Riedl, "An Algorithmic Framework for Performing Collaborative Filtering", Proc. 22nd ACM SIGIR Conference on Information Retrieval, pp. 230--237, 1999. Google ScholarDigital Library
- D. Kim and B. Yum, "Collaborative Filtering Based on Iterative Principal Component Analysis", Expert Systems with Applications 28 (2005), 823--830. Google ScholarDigital Library
- J. Konstan, B. Miller, D. Maltz, J. Herlocker, L. Gordon and J. Riedl, "GroupLens: Applying Collaborative Filtering to Usenet News", Communications of the ACM 40 (1997), 77--87, www.grouplens.org. Google ScholarDigital Library
- Netflix prize - www.netflixprize.com.Google Scholar
- G. Linden, B. Smith and J. York, "Amazon.com Recommendations: Item-to-item Collaborative Filtering", IEEE Internet Computing 7 (2003), 76--80. Google ScholarDigital Library
- J. Nocedal and S. Wright, Numerical Optimization, Springer (1999).Google Scholar
- S. Roweis, "EM Algorithms for PCA and SPCA", Advances in Neural Information Processing Systems 10, pp. 626--632, 1997. Google ScholarDigital Library
- B. M. Sarwar, G. Karypis, J. A. Konstan, and J. Riedl, "Application of Dimensionality Reduction in Recommender System - A Case Study", WEBKDD'2000.Google Scholar
- B. Sarwar, G. Karypis, J. Konstan and J. Riedl, "Item-based Collaborative Filtering Recommendation Algorithms", Proc. 10th International Conference on the World Wide Web, pp. 285--295, 2001. Google ScholarDigital Library
- R. Tibshirani, "Regression Shrinkage and Selection via the Lasso", Journal of the Royal Statistical Society B 58 (1996).Google Scholar
- J. Wang, A. P. de Vries and M. J. T. Reinders,"Unifying User-based and Item-based Collaborative Filtering Approaches by Similarity Fusion", Proc. 29th ACM SIGIR Conference on Information Retrieval, pp. 501--508, 2006. Google ScholarDigital Library
Index Terms
- Modeling relationships at multiple scales to improve accuracy of large recommender systems
Recommendations
Investigation of various matrix factorization methods for large recommender systems
NETFLIX '08: Proceedings of the 2nd KDD Workshop on Large-Scale Recommender Systems and the Netflix Prize CompetitionMatrix Factorization (MF) based approaches have proven to be efficient for rating-based recommendation systems. In this work, we propose several matrix factorization approaches with improved prediction accuracy. We introduce a novel and fast (semi)-...
A Scalable, Accurate Hybrid Recommender System
WKDD '10: Proceedings of the 2010 Third International Conference on Knowledge Discovery and Data MiningRecommender systems apply machine learning techniques for filtering unseen information and can predict whether a user would like a given resource. There are three main types of recommender systems: collaborative filtering, content-based filtering, and ...
Investigation of Various Matrix Factorization Methods for Large Recommender Systems
ICDMW '08: Proceedings of the 2008 IEEE International Conference on Data Mining WorkshopsMatrix Factorization (MF) based approaches have proven to be efficient for rating-based recommendation systems. In this work, we propose several matrix factorization approaches with improved prediction accuracy. We introduce a novel and fast (semi)-...
Comments