ABSTRACT
Combining correlated information from multiple contexts can significantly improve predictive accuracy in recommender problems. Such information from multiple contexts is often available in the form of several incomplete matrices spanning a set of entities like users, items, features, and so on. Existing methods simultaneously factorize these matrices by sharing a single set of factors for entities across all contexts. We show that such a strategy may introduce significant bias in estimates and propose a new model that ameliorates this issue by positing local, context-specific factors for entities. To avoid over-fitting in contexts with sparse data, the local factors are connected through a shared global model. This sharing of parameters allows information to flow across contexts through multivariate regressions among local factors, instead of enforcing exactly the same factors for an entity, everywhere. Model fitting is done in an EM framework, we show that the E-step can be fitted through a fast multi-resolution Kalman filter algorithm that ensures scalability. Experiments on benchmark and real-world Yahoo! datasets clearly illustrate the usefulness of our approach. Our model significantly improves predictive accuracy, especially in cold-start scenarios.
- D. Agarwal and B.-C. Chen. Regression-based latent factor models. In KDD, 2009. Google ScholarDigital Library
- D. Agarwal and B.-C. Chen, et al. Online models for content optimization. In NIPS, 2008.Google Scholar
- R. Ando and T. Zhang. A high-performance semi-supervised learning method for text chunking. In ACL, 2005. Google ScholarDigital Library
- A. Argyriou, T. Evgeniou, and M. Pontil. Multi-task feature learning. In NIPS, 2007.Google ScholarDigital Library
- A. Argyriou, C. Micchelli, M. Pontil, and Y. Ying. A spectral regularization framework for multi-task structure learning. NIPS, 2008.Google Scholar
- S. Bickel, M. Brückner, and T. Scheffer. Discriminative learning for differing training and test distributions. In ICML, 2007. Google ScholarDigital Library
- D. Blackwell. Conditional expectation and unbiased sequential estimation. Annals of Math. Stat., 1947.Google ScholarCross Ref
- J. Blitzer, K. Crammer, A. Kulesza, F. Pereira, and J. Wortman. Learning bounds for domain adaptation. NIPS, 2008.Google ScholarDigital Library
- J. Blitzer, R. McDonald, and F. Pereira. Domain adaptation with structural correspondence learning. In EMNLP, 2006. Google ScholarDigital Library
- A. Blum and T. Mitchell. Combining labeled and unlabeled data with co-training. In COLT, 1998. Google ScholarDigital Library
- E. Bonilla, K. Chai, and C. Williams. Multi-task gaussian process prediction. NIPS, 2007.Google ScholarDigital Library
- R. Carroll, D. Ruppert, L. Stefanski, and C. Crainiceanu. Measurement Error in Nonlinear Models. Chapman and Hall, 2008.Google Scholar
- K. C. Chou, A. S. Willsky, and R. Nikoukhah. Multiscale systems, Kalman filters, and Ricatti equations. IEEE Trans. on Automatic Control, 1994.Google Scholar
- W. Dai, G. Xue, Q. Yang, and Y. Yu. Co-clustering based classification for out-of-domain documents. In KDD, 2007. Google ScholarDigital Library
- W. Dai, Q. Yang, G. Xue, and Y. Yu. Boosting for transfer learning. In ICML, 2007. Google ScholarDigital Library
- H. Daumé. Cross-task knowledge-constrained self training. In EMNLP, 2008. Google ScholarDigital Library
- H. Daume III and D. Marcu. Domain adaptation for statistical classifiers. Journal of Artificial Intelligence Research, 2006. Google ScholarDigital Library
- J. Davis and P. Domingos. Deep transfer via second-order markov logic. In AAAI Workshop on Transfer Learning for Complex Tasks, 2008.Google Scholar
- A. P. Dempster, N. M. Laird, and D. B. Rubin. Maximum likelihood from incomplete data via the EM algorithm. J. of the Royal Statistical Society, Series B, 1977.Google Scholar
- T. Evgeniou and M. Pontil. Regularized multi-task learning. In KDD, 2004. Google ScholarDigital Library
- J. L. Herlocker, J. A. Konstan, A. Borchers, and J. Riedl. An algorithmic framework for performing collaborative filtering. In SIGIR, 1999. Google ScholarDigital Library
- H. Huang and N. Cressie. Fast spatial prediction of global processes from satellite data. SIAM J. on Scientific Computing, 2000.Google Scholar
- H. Huang and N. Cressie. Multiscale graphical modeling in space: Applications to command and control. In Spatial Statistics Workshop, 2000.Google Scholar
- J. Huang, A. Smola, A. Gretton, K. Borgwardt, and B. Scholkopf. Correcting sample selection bias by unlabeled data. NIPS, 2007.Google ScholarDigital Library
- T. Jaakkola and M. Jordan. Bayesian logistic regression: a variational approach. Statistics and Computing, 2000.Google Scholar
- J. Jiang and C. Zhai. Instance weighting for domain adaptation in NLP. In ACL, 2007.Google Scholar
- H. Kaji and Y. Morimoto. Unsupervised word sense disambiguation using bilingual comparable corpora. In COLIN, 2002. Google ScholarDigital Library
- Y. Koren, R. Bell, and C. Volinsky. Matrix factorization techniques for recommender systems. Computer, 2009. Google ScholarDigital Library
- N. Kushmerick. Learning to remove internet advertisements. In Int. Conf. Autonomous Agents, 1999. Google ScholarDigital Library
- N. Lawrence and J. Platt. Learning to learn with the informative vector machine. In ICML, 2004. Google ScholarDigital Library
- S. Lee, V. Chatalbashev, D. Vickrey, and D. Koller. Learning a meta-level prior for feature relevance from multiple related tasks. In ICML, 2007. Google ScholarDigital Library
- X. Liao, Y. Xue, and L. Carin. Logistic regression with an auxiliary data source. In ICML, 2005. Google ScholarDigital Library
- X. Ling, W. Dai, G.-R. Xue, Q. Yang, and Y. Yu. Spectral domain-transfer learning. In KDD, 2008. Google ScholarDigital Library
- P. Massa and P. Avesani. Trust metrics in recommender systems. In Computing with Social Trust, 2009.Google ScholarCross Ref
- L. Mihalkova, T. Huynh, and R. Mooney. Mapping and revising markov logic networks for transfer learning. In AAAI, 2007. Google ScholarDigital Library
- L. Mihalkova and R. Mooney. Transfer learning by mapping with minimal target data. In AAAI Workshop on Transfer Learning for Complex Tasks, 2008.Google Scholar
- S. J. Pan and Q. Yang. A survey on transfer learning. Technical Report HKUST-CS08-08, Hong Kong University of Science and Technology, 2008.Google Scholar
- R. Raina, A. Battle, H. Lee, B. Packer, and A. Ng. Self-taught learning: Transfer learning from unlabeled data. In ICML, 2007. Google ScholarDigital Library
- A. Schwaighofer, V. Tresp, and K. Yu. Learning Gaussian process kernels via hierarchical Bayes. NIPS, 2005.Google Scholar
- A. P. Singh and G. J. Gordon. Relational learning via collective matrix factorization. In KDD, 2008. Google ScholarDigital Library
- D. H. Stern, R. Herbrich, and T. Graepel. Matchbox: large scale online bayesian recommendations. In WWW, 2009. Google ScholarDigital Library
- X. Su and T. M. Khoshgoftaar. A survey of collaborative filtering techniques. Adv. in Artif. Intell., 2009. Google ScholarDigital Library
- M. Sugiyama, S. Nakajima, H. Kashima, P. von Bunau, and M. Kawanabe. Direct importance estimation with model selection and its application to covariate shift adaptation. NIPS, 2008.Google ScholarDigital Library
- K. Yu and V. Tresp. Learning to learn and collaborative filtering. In In NIPS Workshop on Inductive Transfer: 10 Years Later, 2005.Google Scholar
- S. Yu, K. Yu, V. Tresp, and H.-P. Kriegel. Collaborative ordinal regression. In ICML, 2006. Google ScholarDigital Library
Index Terms
- Localized factor models for multi-context recommendation
Recommendations
Item recommendation in collaborative tagging systems via heuristic data fusion
Collaborative tagging systems have been popular on the Web. However, information overload results in the increasing need for recommender services from users, and thus item recommendation has been one of the key issues in such systems. In this paper, we ...
User-Specific Feature-Based Similarity Models for Top-n Recommendation of New Items
Survey Paper, Regular Papers and Special Section on Participatory Sensing and Crowd IntelligenceRecommending new items for suitable users is an important yet challenging problem due to the lack of preference history for the new items. Noncollaborative user modeling techniques that rely on the item features can be used to recommend new items. ...
Context Recommendation Using Multi-label Classification
WI-IAT '14: Proceedings of the 2014 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT) - Volume 02Context-aware recommender systems (CARS) are extensions of traditional recommenders that also take into account contextual condition of a user to whom a recommendation is made. The recommendation problem is, however, still focused on recommending a set ...
Comments