Elsevier

Neurocomputing

Volume 177, 12 February 2016, Pages 447-453
Neurocomputing

A survey of transfer learning for collaborative recommendation with auxiliary data

https://doi.org/10.1016/j.neucom.2015.11.059Get rights and content

Abstract

Intelligent recommendation technology has been playing an increasingly important role in various industry applications such as e-commerce product promotion and Internet advertisement display. Besides user feedbacks (e.g., numerical ratings) on items as usually exploited by some typical recommendation algorithms, there are often some additional data such as users׳ social circles and other behaviors. Such auxiliary data are usually related to user preferences on items behind numerical ratings. Collaborative recommendation with auxiliary data (CRAD) aims to leverage such additional information so as to improve personalized services. It has received much attention from both researchers and practitioners.

Transfer learning (TL) is proposed to extract and transfer knowledge from some auxiliary data in order to assist the learning task on the target data. In this survey, we consider the CRAD problem from a transfer learning view, especially on how to enable knowledge transfer from some auxiliary data, and discuss the representative transfer learning techniques. Firstly, we give a formal definition of transfer learning for CRAD (TL-CRAD). Secondly, we extend the existing categorization of TL techniques with three knowledge transfer strategies. Thirdly, we propose a novel and generic knowledge transfer framework for TL-CRAD. Fourthly, we describe some representative works of each specific knowledge transfer strategy in detail, which are expected to inspire further works. Finally, we conclude the survey with some summarized discussions and several future directions.

Introduction

Intelligent recommendation technology [1], [4], [18], [31], [45], [48] has been a standard component embedded in many Internet systems such as e-commerce and advertisement systems to provide personalized services. There are two main approaches widely used in personalized recommendation for an active user, i.e., content-based recommendation [3] and collaborative recommendation [14]. Content-based methods promote an item based on the relevance between a candidate item and the active user׳s consumed items, while collaborative recommendation techniques focus on collective intelligence and exploit the community׳s data so as to recommend preferred items from users with similar tastes. However, both methods are limited to users׳ feedbacks of explicit scores or implicit examinations, which may result in a challenging problem, data sparsity, due to the lack of users׳ behaviors.

Fortunately, there are often some additionally available data besides the users׳ feedbacks (e.g., numerical ratings) in a recommender system. There are at least four types of auxiliary data as shown in Table 1, such as content information [52], [56], time contextual information [23], [36], social or information networks [21], [49], [54] and additional feedbacks [19], [29], [39]. These auxiliary data have the potential to help relieve the aforementioned sparsity problem and thus improve the recommendation performance. In this survey, we study on how to exploit different types of auxiliary data in collaborative recommendation, which is coined as collaborative recommendation with auxiliary data (CRAD).

Specifically, we study the CRAD problem from an inductive transfer learning [37] view (instead of unsupervised or transductive transfer learning views [2]), in which we consider the users׳ feedback data as our target data or supervised information, and all the other additional information as our auxiliary data. In particular, we focus on how to enable knowledge transfer from some auxiliary data to the target data in order to address the aforementioned sparsity challenge. We discuss some representative transfer learning techniques, aiming to answer the fundamental question of transfer learning [37], i.e., “how to transfer”. With this focus in our survey, we extend previous categorization of transfer learning techniques in collaborative filtering [38], [43], and answer the above question from two dimensions, including knowledge transfer algorithm styles (i.e., adaptive, collective and integrative knowledge transfer) and knowledge transfer strategies (i.e., prediction rule, regularization and constraint). Then, we propose a novel and generic knowledge transfer framework and describe some representative works in each category to answer the “how to transfer” question in detail, in particular the main idea that may be generalized to other applications. Finally, we conclude the survey with some summarized discussions and several exciting future directions.

Section snippets

Problem definition

We have a target data set and an auxiliary data set. In the target data set, we have some feedbacks from n users and m items, which is usually represented as a rating matrix R=[rui]n×m and an indicator matrix Y{0,1}n×m, where yui=1 means that the feedback rui is observed. In the auxiliary data set, we have some additional data such as content, context, network and feedback information as shown in Table 1. Our goal is to predict the unobserved feedbacks in R by transferring knowledge from the

Adaptive knowledge transfer

Adaptive knowledge transfer aims to adapt the knowledge extracted from an auxiliary data domain to a target data domain. This is a directed knowledge transfer approach similar to traditional domain adaptation methods. In this section, we describe two adaptive knowledge transfer strategies as instantiated from Eq. (1), including (i) transfer via regularization, minΘE(Θ|R)+R(Θ|K), and (ii) transfer via constraint, minΘE(Θ|R),s.t.ΘC(K).

Collective knowledge transfer

Collective knowledge transfer usually jointly learns the shared knowledge and unshared effect of the target data and the auxiliary data simultaneously, which is a bi-directed knowledge transfer approach with richer interactions similar to multi-task learning algorithms. We describe some representative works of collective knowledge transfer via constraint on model parameters, minΘ,KE(Θ|R)+R(Θ)+E(K|A)+R(K),s.t.ΘC(K), which is also an instantiation of Eq. (1). Note that the model parameter Θ and

Integrative knowledge transfer

Integrative knowledge transfer incorporates the raw auxiliary data as known knowledge into the learning task on the target data. It can be considered as an embedded knowledge transfer approach similar to feature engineering, information fusion and data integration methods. Mathematically speaking, we can instantiate the generic framework in Eq. (1), and have (i) transfer via prediction rule, minΘE(Θ|R,A)+R(Θ), (ii) transfer via regularization, minΘE(Θ|R)+R(Θ|A), and (iii) transfer via

Discussions

We summarize some representative works of transfer learning for collaborative recommendation with auxiliary data (TL-CRAD) in Table 2. We can see that integrative knowledge transfer via prediction rule and collective knowledge transfer via constraint have recently received more attention, which are also the state-of-the-art TL-CRAD algorithms w.r.t. recommendation accuracy in corresponding problem settings. The interaction between auxiliary data and target data usually becomes richer from

Acknowledgment

I would like to thank Prof. Qiang Yang for advice and comments, Dr. Bin Li for linguistic improvement and helpful discussions, the editors and reviewers for constructive suggestions, and the support of Natural Science Foundation of Guangdong Province No. 2014A030310268, National Natural Science Foundation of China Nos. 61502307, 61170077, 61472258 and Natural Science Foundation of SZU No. 201436.

Weike Pan received the Ph.D. degree in Computer Science and Engineering from the Hong Kong University of Science and Technology, Kowloon, Hong Kong, China, in 2012. He is currently a Lecturer (research oriented) with the College of Computer Science and Software Engineering, Shenzhen University, Shenzhen, China. His research interests include transfer learning, recommender systems, and statistical machine learning.

References (61)

  • Chumki Basu, Haym Hirsh, William Cohen, Recommendation as classification: using social and content-based information in...
  • Stephen Boyd et al.

    Convex Optimization

    (2004)
  • Bin Cao, Nathan Nan Liu, Qiang Yang, Transfer learning for collective link prediction in multiple heterogenous domains,...
  • Sotirios Chatzis, Nonparametric Bayesian multitask collaborative filtering, in: Proceedings of the 22nd ACM...
  • Liang Chen et al.

    Analysis and detection of fake views in online video services

    ACM Trans. Multimed. Comput. Commun. Appl.

    (2015)
  • Liang Chen et al.

    Smart streaming for online video services

    IEEE Trans. Multimed.

    (2015)
  • Tianqi Chen et al.

    Svdfeaturea toolkit for feature-based collaborative filtering

    J. Mach. Learn. Res.

    (2012)
  • Wei Chen, Wynne Hsu, Mong Li Lee, Making recommendations from multiple domains, in: Proceedings of the 19th ACM SIGKDD...
  • Ignacio Fernandez-Tobias, Ivan Cantador, Marius Kaminskas, Francesco Ricci, Cross-domain recommender systems: a survey...
  • Sheng Gao, Hao Luo, Da Chen, Shantao Li, Patrick Gallinari, Jun Guo, Cross-domain recommendation via cluster-level...
  • David Goldberg et al.

    Using collaborative filtering to weave an information tapestry

    Commun. ACM

    (1992)
  • Robert E. Haskell, Transfer of Learning: Cognition, Instruction, and Reasoning, Educational Psychology Series. Academic...
  • Niklas Jakob, Stefan Hagen Weber, Mark Christoph Müller, Iryna Gurevych, Beyond the stars: exploiting free-text user...
  • Mohsen Jamali, Martin Ester, A matrix factorization technique with trust propagation for recommendation in social...
  • Dietmar Jannach et al.

    Recommender Systems: An Introduction

    (2010)
  • Gawesh Jawaheer et al.

    Modeling user preferences in recommender systemsa classification framework for explicit and implicit user feedback

    ACM Trans. Interact. Intell. Syst.

    (2014)
  • Meng Jiang et al.

    Social recommendation with cross-domain transferable knowledge

    IEEE Trans. Knowl. Data Eng.

    (2015)
  • Meng Jiang, Peng Cui, Fei Wang, Qiang Yang, Wenwu Zhu, Shiqiang Yang, Social recommendation across multiple relational...
  • Wolf Kienzle, Kumar Chellapilla, Personalized handwriting recognition via biased regularization, in: Proceedings of the...
  • Yehuda Koren, Collaborative filtering with temporal dynamics, in: Proceedings of the 15th ACM SIGKDD International...
  • Cited by (119)

    • CNNRec: Convolutional Neural Network based recommender systems - A survey

      2024, Engineering Applications of Artificial Intelligence
    • BAR: Behavior-aware recommendation for sequential heterogeneous one-class collaborative filtering

      2022, Information Sciences
      Citation Excerpt :

      Multi-feedback Bayesian personalized ranking (MF-BPR) [31] considers that different feedback reflect users’ preferences from different levels, and proposes a non-uniform sampler which places emphasis on high-level items to sample some positive items, while the negative items are sampled from a lower level than that of the positive items. To tackle the sparsity of the target feedback in OCCF, another way to make use of the dense examination data is to use transfer learning [32], which shares some knowledge from the auxiliary feedback with the task of modeling the target feedback. TJSL [33] introduces a new term that learns a similarity between a candidate item and an examined item in the prediction rule of FISM [15] to alleviate the sparsity of the target feedback.

    View all citing articles on Scopus

    Weike Pan received the Ph.D. degree in Computer Science and Engineering from the Hong Kong University of Science and Technology, Kowloon, Hong Kong, China, in 2012. He is currently a Lecturer (research oriented) with the College of Computer Science and Software Engineering, Shenzhen University, Shenzhen, China. His research interests include transfer learning, recommender systems, and statistical machine learning.

    View full text