skip to main content
10.1145/1281192.1281206acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
Article

Modeling relationships at multiple scales to improve accuracy of large recommender systems

Published:12 August 2007Publication History

ABSTRACT

The collaborative filtering approach to recommender systems predicts user preferences for products or services by learning past user-item relationships. In this work, we propose novel algorithms for predicting user ratings of items by integrating complementary models that focus on patterns at different scales. At a local scale, we use a neighborhood-based technique that infers ratings from observed ratings by similar users or of similar items. Unlike previous local approaches, our method is based on a formal model that accounts for interactions within the neighborhood, leading to improved estimation quality. At a higher, regional, scale, we use SVD-like matrix factorization for recovering the major structural patterns in the user-item rating matrix. Unlike previous approaches that require imputations in order to fill in the unknown matrix entries, our new iterative algorithm avoids imputation. Because the models involve estimation of millions, or even billions, of parameters, shrinkage of estimated values to account for sampling variability proves crucial to prevent overfitting. Both the local and the regional approaches, and in particular their combination through a unifying model, compare favorably with other approaches and deliver substantially better results than the commercial Netflix Cinematch recommender system on a large publicly available data set.

References

  1. G. Adomavicius and A. Tuzhilin, "Towards the Next Generation of Recommender Systems: A Survey of the State-of-the-Art and Possible Extensions", IEEE Transactions on Knowledge and Data Engineering 17 (2005), 634--749. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. R. Bell and Y. Koren, "Improved Neighborhood-based Collaborative Filtering", submitted, 2007.Google ScholarGoogle Scholar
  3. S. Deerwester, S. Dumais, G. W. Furnas, T. K. Landauer and R. Harshman, "Indexing by Latent Semantic Analysis", Journal of the Society for Information Science 41 (1990), 391--407.Google ScholarGoogle ScholarCross RefCross Ref
  4. D. Goldberg, D. Nichols, B. M. Oki and D. Terry, "Using Collaborative Filtering to Weave an Information Tapestry", Communications of the ACM 35 (1992), 61--70. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. K. Goldberg, T. Roeder, D. Gupta and C. Perkins, "Eigentaste: A Constant Time Collaborative Filtering Algorithm", Information Retrieval 4 (2001), 133--151. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. G. H. GolubandC. F. VanLoan, Matrix Computations, Johns Hopkins University Press, 1996.Google ScholarGoogle Scholar
  7. J. L. Herlocker, J. A. Konstan, A. Borchers and John Riedl, "An Algorithmic Framework for Performing Collaborative Filtering", Proc. 22nd ACM SIGIR Conference on Information Retrieval, pp. 230--237, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. D. Kim and B. Yum, "Collaborative Filtering Based on Iterative Principal Component Analysis", Expert Systems with Applications 28 (2005), 823--830. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. J. Konstan, B. Miller, D. Maltz, J. Herlocker, L. Gordon and J. Riedl, "GroupLens: Applying Collaborative Filtering to Usenet News", Communications of the ACM 40 (1997), 77--87, www.grouplens.org. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Netflix prize - www.netflixprize.com.Google ScholarGoogle Scholar
  11. G. Linden, B. Smith and J. York, "Amazon.com Recommendations: Item-to-item Collaborative Filtering", IEEE Internet Computing 7 (2003), 76--80. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. J. Nocedal and S. Wright, Numerical Optimization, Springer (1999).Google ScholarGoogle Scholar
  13. S. Roweis, "EM Algorithms for PCA and SPCA", Advances in Neural Information Processing Systems 10, pp. 626--632, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. B. M. Sarwar, G. Karypis, J. A. Konstan, and J. Riedl, "Application of Dimensionality Reduction in Recommender System - A Case Study", WEBKDD'2000.Google ScholarGoogle Scholar
  15. B. Sarwar, G. Karypis, J. Konstan and J. Riedl, "Item-based Collaborative Filtering Recommendation Algorithms", Proc. 10th International Conference on the World Wide Web, pp. 285--295, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. R. Tibshirani, "Regression Shrinkage and Selection via the Lasso", Journal of the Royal Statistical Society B 58 (1996).Google ScholarGoogle Scholar
  17. J. Wang, A. P. de Vries and M. J. T. Reinders,"Unifying User-based and Item-based Collaborative Filtering Approaches by Similarity Fusion", Proc. 29th ACM SIGIR Conference on Information Retrieval, pp. 501--508, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Modeling relationships at multiple scales to improve accuracy of large recommender systems

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      KDD '07: Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
      August 2007
      1080 pages
      ISBN:9781595936097
      DOI:10.1145/1281192

      Copyright © 2007 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 12 August 2007

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • Article

      Acceptance Rates

      KDD '07 Paper Acceptance Rate111of573submissions,19%Overall Acceptance Rate1,133of8,635submissions,13%

      Upcoming Conference

      KDD '24

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader