skip to main content
10.1145/1571941.1571979acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
research-article

Fast nonparametric matrix factorization for large-scale collaborative filtering

Authors Info & Claims
Published:19 July 2009Publication History

ABSTRACT

With the sheer growth of online user data, it becomes challenging to develop preference learning algorithms that are sufficiently flexible in modeling but also affordable in computation. In this paper we develop nonparametric matrix factorization methods by allowing the latent factors of two low-rank matrix factorization methods, the singular value decomposition (SVD) and probabilistic principal component analysis (pPCA), to be data-driven, with the dimensionality increasing with data size. We show that the formulations of the two nonparametric models are very similar, and their optimizations share similar procedures. Compared to traditional parametric low-rank methods, nonparametric models are appealing for their flexibility in modeling complex data dependencies. However, this modeling advantage comes at a computational price--it is highly challenging to scale them to large-scale problems, hampering their application to applications such as collaborative filtering. In this paper we introduce novel optimization algorithms, which are simple to implement, which allow learning both nonparametric matrix factorization models to be highly efficient on large-scale problems. Our experiments on EachMovie and Netflix, the two largest public benchmarks to date, demonstrate that the nonparametric models make more accurate predictions of user ratings, and are computationally comparable or sometimes even faster in training, in comparison with previous state-of-the-art parametric matrix factorization models.

References

  1. J. Abernethy, F. Bach, T. Evgeniou, and J.-P. Vert. Low-rank matrix factorization with attributes. Technical report, Ecole des Mines de Paris, 2006.Google ScholarGoogle Scholar
  2. R. M. Bell, Y. Koren, and C. Volinsky. The BellKor solution to the Netflix prize. Technical report, AT&T Labs, 2007.Google ScholarGoogle Scholar
  3. E. J. Cand`es and T. Tao. The power of convex relaxation: Near-optimal matrix completion. Submitted for publication, 2009.Google ScholarGoogle Scholar
  4. D. DeCoste. Collaborative prediction using ensembles of maximum margin matrix factorization. In The 23rd International Conference on Machine Learning (ICML), 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. M. Kurucz, A. A. Benczur, and K. Csalogany. Methods for large scale SVD with missing values. In Proceedings of KDD Cup and Workshop, 2007.Google ScholarGoogle Scholar
  6. Y. J. Lim and Y. W. Teh. Variational Bayesian approach to movie rating prediction. In Proceedings of KDD Cup and Workshop, 2007.Google ScholarGoogle Scholar
  7. C. E. Rasmussen and C. K. I. Williams. Gaussian Processes for Machine Learning. The MIT Press, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. J. D. M. Rennie and N. Srebro. Fast maximum margin matrix factorization for collaborative prediction. In The 22nd International Conference on Machine Learning (ICML), 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. S. Roweis and Z. Ghahramani. A unifying review of linear Gaussian models. Neural Computation, 11:305--345, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. R. Salakhutdinov and A. Mnih. Bayesian probabilistic matrix factorization using Markov chain Monte Carlo. In The 25th International Conference on Machine Learning (ICML), 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. B. Schölkopf and A. J. Smola. Learning with Kernels. MIT Press, 2002.Google ScholarGoogle Scholar
  12. N. Srebro, J. D. M. Rennie, and T. S. Jaakola. Maximum-margin matrix factorization. In Advances in Neural Information Processing Systems 18 (NIPS), 2005.Google ScholarGoogle Scholar
  13. G. Takacs, I. Pilaszy, B. Nemeth, and D. Tikk. On the gravity recommendation system. In Proceedings of KDD Cup and Workshop, 2007.Google ScholarGoogle Scholar
  14. M. E. Tipping and C. M. Bishop. Probabilistic principal component analysis. Journal of the Royal Statisitical Scoiety, B(61):611--622, 1999.Google ScholarGoogle Scholar
  15. M. Wu. Collaborative filtering via ensembles of matrix factorizations. In Proceedings of KDD Cup and Workshop, 2007.Google ScholarGoogle Scholar
  16. K. Yu, J. Lafferty, S. Zhu, and Y. Gong. Large-scale collaborative prediction using a nonparametric random effects model. In The 25th International Conference on Machine Learning (ICML), 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. K. Yu and V. Tresp. Learning to learn and collaborative filtering. In NIPS workshop on "Inductive Transfer: 10 Years Later", 2005.Google ScholarGoogle Scholar
  18. Y. Zhang and J. Koren. Efficient Bayesian hierarchical user modeling for recommendation systems. In The 30th ACM SIGIR Conference, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Fast nonparametric matrix factorization for large-scale collaborative filtering

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        SIGIR '09: Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
        July 2009
        896 pages
        ISBN:9781605584836
        DOI:10.1145/1571941

        Copyright © 2009 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 19 July 2009

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        Overall Acceptance Rate792of3,983submissions,20%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader