Sliced Wasserstein based Canonical Correlation Analysis for Cross-Domain Recommendation

doi:10.1016/j.patrec.2021.06.015

Pattern Recognition Letters

Volume 150, October 2021, Pages 33-39

https://doi.org/10.1016/j.patrec.2021.06.015 Get rights and content

Highlights

•
A cross-domain recommendation model based on Sliced Wasserstein autoencoder is proposed.
•
An improved cross-domain transformation loss of orthogonal transformation is proposed.
•
Rigorous experiments were carried out to demonstrate the efficiency of the model.

Abstract

To solve the problem of data sparsity and cold start, the cross-domain recommendation is a promising research direction in the recommender system. The goal of cross-domain recommendation is to transfer learned knowledge from the source domain to the target domain by different means to improve the performance of the recommendation. But most approaches face the distribution misalignment. In this paper, we propose a joint learning cross-domain recommendation model that can extract domain-specific and common features simultaneously, and only use the implicit feedback data of users without additional auxiliary information. To the best of our knowledge, it is the first attempt to combine the sliced Wasserstein distance and canonical correlation analysis for the cross-domain recommendation scenario. Our one intuition is to reduce the reconstruction error caused by the variational inference based autoencoder model by the optimal transportation theory. Another attempt is to improve the correlation between domains by combining the idea of the canonical correlation analysis. With rigorous experiments, we empirically demonstrated that our model can achieve better performance compared with the state-of-the-art methods.

Introduction

In the era of the rapid development of the Internet, the data shows an exponential level of increase. Many users encounter difficulties in accessing information. For this reason, recommender systems are got more attention. E-commerce and news recommendation are the most widely used senior, for example, when a user buys something online or browsers some news, the online recommender systems will recommend items that meet their taste by modeling the users history preference. Therefore, recommender systems are designed to study the potential relationships between users and items in order to provide a more appropriate experience for users.

Collaborative filtering is a basic idea in the recommendation system. It calculates the similarity between users or items to recommend the corresponding items using explicit data like ratings. However, implicit data such as click history are more common in the real world but face with sparsity problems. Because the interaction matrix of user-items is too large, calculating the similarity of items or users will be time-consuming and can not extract meaningful information. Some matrix factorization based methods [14], [22] can obtain the latent factor of user and item by decomposing the user-item ratings or interactions matrix. But MF based method only captures shallow features. Although other deep learning-based methods [7] is proposed to learn high-level features, it can not significantly reduce the sparsity impact. Except for that, the cold start is another main challenge it. For example, it is hard to recommend a new item or user enters when the system lacks user or item preferences. Therefore, cross-domain recommendations are proposed to solve the aforementioned problem. It learns the information of the dense data field and transfers it to the target field to improve the recommendation performance. It is a particular research point of transfer learning that has been applied many times in cross-domain recommendations. Most cross-domain recommendation systems are based on user overlapped scenarios. For example, in a very sparse movie domain, when we recommend movies to users we learn useful knowledge from another data domain such as book that contains the same users. And we use this learned information and origin data knowledge to make better recommendations to those users.

In addition, many models based on autoencoder structure have been proposed to solve the above problems including single and cross-domain. For example, CDAE adds some noise to corrupt input and in order to recovery original input data by the denoising autoencoder structure. However, the DAE-like models [23], [27] only reconstruct the input and output, and only maps the data to a fixed space, which leads to the weak generalization ability of the model. VAE-like models [11], [21] are proposed for recommendation system to tackle the problem. This kind of model employed variational inference that makes up the generation ability of the above model. But it measures the difference between the learned space and the prior based on the K-L divergence, which results in a one-to-many reconstruction error and the intersection between the space, as shown in Fig. 1(a). Therefore, wae-likes model is proposed to solve the above problem, as shown in Fig. 1(b), which can alleviate the reconstruction error caused by VAE.

For the above discussion, we propose an end-to-end cross-domain recommendation model called SWCCA on the basis of the shared user scenario. The model only uses the implicit feedback of users in different domains as inputs and utilizes sliced Wasserstein autoencoder as the main architecture to learn the latent space of different domains. We also studied how to take the correlation of hidden variables into account. At the same time, different priors are applied to different domains to obtain more flexible specific features. The main contributions of this paper are as follows:

•
We propose a new cross-domain recommendation model based on sliced Wasserstein autoencoder with the idea of canonical correlation analysis which can alleviate the reconstruction error in latent factors and can extract the lower dimension information by the sliced projection.

•
We propose an enhanced orthogonal transformation constraint with sliced Wasserstein distance that can capture more common features between different learned features.

•
We designed rigorous experiments to verify the validity of the proposed method and the results show that our method can improve recommendation effects remarkably.

Section snippets

Single domain recommendation

The most commonly used method in single domain recommendations is collaborative filtering [7], [12], which is based on the idea that users with similar historical characteristics are likely to have similar buying characteristics in the future [25]. It usually calculates the similarity between users or items according to the user-item interaction matrix and makes recommendations according to the similarity [9]. Generally, collaborative filtering can be divided into model-based and memory-based

Approach

In this section, we will introduce our basic idea and the loss function of the proposed method. The framework of the proposed SWCCA is as shown in Fig. 2. It contains two major parts prior regularizer and orthogonal transformation module. For each domain’s data, it first through encoders and obtain the latent features. Then we propose to implement an enhanced orthogonal transformation to transfer the different domain knowledge. Finally, we decode the origin and transferred the latent feature to

Experiments

In this section, we will perform rigorous experiments to demonstrate the performance of our approaches. We use several subsets of the Amazon Product with shared users.

Conclusions

In this paper, we proposed a sliced Wasserstein based canonical correlation analysis model to reduce the data sparsity problem and improve recommendation performance. It is the first attempt to apply the sliced Wasserstein autoencoder architecture to the cross-domain recommendation scenario, and we combine the concept of canonical correlation analysis to further understand the correlation between domains. Our experiments show that SWCCA can achieve the best performance and tsne visualization

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgment

This work was supported in the part by the National Natural Science Foundation of China (61702471,61872326), Major Scientific and Technological Innovation Project Shandong (2019JZZY020705) and Fundamental Research Funds for the Central Universities(202042008).

References (28)

D. Wang et al.
A content-based recommender system for computer science publications
Knowl. Based Syst.
(2018)
G. Andrew et al.
Deep canonical correlation analysis
Proceedings of the International Conference on Machine Learning
(2013)
F. R. Bach, M. I. Jordan, A probabilistic interpretation of canonical correlation analysis...
X. Chen, Y. Zhang, I. Tsang, Y. Pan, J. Su, Towards equivalent transformation of user preferences in cross domain...
Z. Gao et al.
A pairwise attentive adversarial spatiotemporal network for cross-domain few-shot action recognition-r2
IEEE Trans. Image Process.
(2020)
Z. Gao et al.
Pairwise two-stream convnets for cross-domain action recognition with small data
IEEE Trans. Neural Netw. Learn. Syst.
(2020)
D.R. Hardoon et al.
Canonical correlation analysis: an overview with application to learning methods
Neural Comput.
(2004)
X. He et al.
Neural collaborative filtering
Proceedings of the 26th International Conference on World Wide Web
(2017)
S. Kolouri et al.
Sliced Wasserstein auto-encoders
Proceedings of the International Conference on Learning Representations
(2018)
Y. Koren et al.
Advances in collaborative filtering
Recommender Systems Handbook
(2015)

P. Li et al.

Ddtcdr: Deep dual transfer cross domain recommendation

Proceedings of the 13th International Conference on Web Search and Data Mining

(2020)

D. Liang et al.

Variational autoencoders for collaborative filtering

Proceedings of the 2018 World Wide Web Conference

(2018)

T. Man et al.

Cross-domain recommendation: an embedding and mapping approach.

Proceedings of the IJCAI

(2017)

J. McAuley et al.

Hidden factors and hidden topics: understanding rating dimensions with review text

Proceedings of the 7th ACM Conference on Recommender Systems

(2013)

Cited by (4)

Spatio-temporal scale information fusion of Functional Near-Infrared Spectroscopy signal for depression detection
2024, Knowledge-Based Systems
Depression, a prevalent and debilitating mental health disorder, necessitates early detection to enable effective intervention. Functional Near-Infrared Spectroscopy (fNIRS) has proven to be a promising technique for studying depressive symptoms due to its affordability, portability, high spatial resolution and non-invasiveness. However, existing fNIRS-based depression detection approaches often overlook signal intrinsic properties and intrinsic correlations among different feature types, as well as local spatial geometric structures of data. To overcome these limitations, we suggest a spatio-temporal scale information fusion framework that involves supervised local low-rank correlation (IFLRC) to detect depression. IFLRC incorporates a low-rank constraint, a sparse $ℓ_{2, 1}$ -norm term, multiple Laplacian regularizations, and a sparse penalty projection matrix to capture global spatial functional information, handle errors to enhance robustness to noise, preserve local spatial geometric structures, exploit correlations among multiple information sets, and enhance discriminative ability. An iterative algorithm, based on the alternating direction method of multipliers (ADMM), has been developed to solve the suggested objective function. The experimental findings validate the efficacy of IFLRC, yielding favorable detection rates of 95.8%, 87.3%, and 97.2% in response to positive, neutral, and negative emotional stimuli, respectively. These findings exceed existing methods, demonstrating the potential of fNIRS and the effectiveness of IFLRC for detecting depression. IFLRC presents notable advancements in feature extraction and fusion techniques for depression detection. Its capacity to capture spatio-temporal scale information, leverage intrinsic correlations, and preserve local geometric structures translates into improved performance. This framework holds promise for early detection and intervention in depressive disorders, offering valuable insights for clinical applications.
Adaptive preference transfer for personalized IoT entity recommendation
2022, Pattern Recognition Letters
Citation Excerpt :
Although they present promising results, when the user feedback information is sparse, it is difficult to accurately represent the feedback feature, resulting in limited recommendation performance. Cross-domain recommendation (CDR) is designed to transfer the feature model from the data-rich source domain to the data-insufficient target domain, which is a promising solution to alleviate the cold-start problem [9,10]. The core task of CDR is to bridge the user’s preferences in the source and target domains [11].
Internet of Things (IoT) recommendation can effectively improve the convenience and intelligence of users in obtaining information about interested physical entities in a huge entity search space. However, existing transfer learning-based recommendation methods mainly focus on Internet resources and services, lack personalized transfer weight consideration for different user preferences, and fail to address the entity rating matrix sparsity problem, resulting in limited IoT entity recommendation performance. Thus, we design an adaptive preference transfer IoT entity (APTE) recommendation method in this paper. Firstly, considering various user preference characteristics, an adaptive dual-domain transfer model for item domain and social domain is designed to meet users’ personalized requirements. Then, a lightweight user feedback embedding method is proposed to mine the explicit and implicit features and to embed the entity rating matrix to solve the problem of sparse user feedback information. Simulation results demonstrate that APTE can effectively improve the recommendation performance compared with state-of-the-art baselines.
Editorial paper for pattern recognition letters VSI on multi-view representation learning and multi-modal information representation
2022, Pattern Recognition Letters
Distributional learning in multi-objective optimization of recommender systems
2023, Journal of Ambient Intelligence and Humanized Computing

View full text

Sliced Wasserstein based Canonical Correlation Analysis for Cross-Domain Recommendation

Highlights

Abstract

Introduction

Section snippets

Single domain recommendation

Approach

Experiments

Conclusions

Declaration of Competing Interest

Acknowledgment

Knowl. Based Syst.

Deep canonical correlation analysis

Proceedings of the International Conference on Machine Learning

A pairwise attentive adversarial spatiotemporal network for cross-domain few-shot action recognition-r2

IEEE Trans. Image Process.

Pairwise two-stream convnets for cross-domain action recognition with small data

IEEE Trans. Neural Netw. Learn. Syst.

Canonical correlation analysis: an overview with application to learning methods

Neural Comput.

Neural collaborative filtering

Proceedings of the 26th International Conference on World Wide Web

Sliced Wasserstein auto-encoders

Proceedings of the International Conference on Learning Representations

Advances in collaborative filtering

Recommender Systems Handbook