Representation learning via Dual-Autoencoder for recommendation

doi:10.1016/j.neunet.2017.03.009

Neural Networks

Volume 90, June 2017, Pages 83-89

https://doi.org/10.1016/j.neunet.2017.03.009 Get rights and content

Abstract

Recommendation has provoked vast amount of attention and research in recent decades. Most previous works employ matrix factorization techniques to learn the latent factors of users and items. And many subsequent works consider external information, e.g., social relationships of users and items’ attributions, to improve the recommendation performance under the matrix factorization framework. However, matrix factorization methods may not make full use of the limited information from rating or check-in matrices, and achieve unsatisfying results. Recently, deep learning has proven able to learn good representation in natural language processing, image classification, and so on. Along this line, we propose a new representation learning framework called Recommendation via Dual-Autoencoder (ReDa). In this framework, we simultaneously learn the new hidden representations of users and items using autoencoders, and minimize the deviations of training data by the learnt representations of users and items. Based on this framework, we develop a gradient descent method to learn hidden representations. Extensive experiments conducted on several real-world data sets demonstrate the effectiveness of our proposed method compared with state-of-the-art matrix factorization based methods.

Introduction

In order to tackle the information overload problem, recommender systems are proposed to help users to find objects of interest by utilizing the user–item interaction information and/or content information associated with users and items. Recommender systems have attracted much attention from multiple disciplines, and many techniques have been proposed to build recommender systems (Adomavicius and Tuzhilin, 2005, Bell, 2011). It is also widely used in many E-commerce companies, such as for product sale on Amazon and movie rentals from Netflix (Srebro & Jaakkola, 2003).

Traditional recommender systems normally only utilize user–item rating feedback information for recommendations. Moreover, the matrix factorization technique is widely used in recommender systems, which factorizes a user–item rating matrix into two low rank user-specific and item-specific matrices, and then utilizes the factorized matrices to make further predictions (Koren et al., 2009, Srebro and Jaakkola, 2003). In order to comprehensively utilize rich information in recommender system, there is a surge of hybrid recommendation, such as social recommendation (Ma et al., 2008, Ma et al., 2011), location based recommendation (Lian et al., 2014, Liu et al., 2013), and heterogeneous network based recommendation (Shi et al., 2015, Yu et al., 2014). Most of these methods are based on a matrix factorization framework, in which the latent factors of users and items are usually obtained by directly factorizing the user–item rating matrix and additional information are usually used as a regularization constraint. Although these methods pay much attention to exploit additional information, we wonder they might not make full use of the user–item rating information. In other words, we might be able to obtain better latent factors of users and items through extensively exploiting rating information.

On the other hand, deep learning has shown its power in learning latent feature representation in many domains, such as image/video processing (Alex et al., 2009) and text data (Socher, Huang, Pennington, Ng, & Manning, 2011). Can we use deep learning techniques to learn latent representations for recommendation? Some researchers have pursued this goal. For example, the latent factor of music is extracted from audio signals with a deep convolutional neural network (Oord, Dieleman, & Schrauwen, 2013) and the tabular data is modeled through the adaption of Restricted Boltzmann Machines (Salakhutdinov, Mnih, & Hinton, 2007). Recently, Wang, Wang, and Yeung (2014) designed a collaborative deep learning method to utilize item content information. In essence, these methods utilize the powerful representation learning of deep learning to analyze the additional information (e.g., audio signal and text context), not rating information, for better recommendations. They did not directly learn the latent factors of users and items with deep learning. Moreover, the additional information sometimes is not easy to acquire and very sparse.

To the best of our knowledge, there has been little effort focused on employing deep learning for recommendations only on user–item rating information. Motivated by the success of the latent feature representation of deep learning on image and text data, we design a novel Recommendation framework via Dual-Autoencoders (ReDa), which is illustrated in Fig. 1. In this figure, ReDa simultaneously learns the new hidden representations of users and items using autoencoders, and minimizes the deviations of training data by the learnt representations of users and items. Moreover, a gradient descent method is derived to learn the hidden representations. Experiments on four real-world data sets demonstrate the effectiveness of our proposed model.

The remainder of this paper is organized as follows. We introduce the notations and preliminary knowledge in Section 2, and then propose the representation learning framework based on autoencoders for recommendation in Section 3. Extensive experiments conducted on several data sets are shown in Section 4, followed by the related work in Section 5 and conclusions in Section 6.

Section snippets

Notations and preliminaries

In this section, we first introduce some frequently used notations as presented in Table 1, and some preliminaries which will be used in our proposed framework.

Representation learning via Dual-Autoencoders

In this section, we first formulate the representation learning framework via dual-autoencoders for recommendation, and then derive the model solution using the gradient decent method.

Experiments

In this section, we conduct experiments on four real-world data sets to systemically evaluate the effectiveness of our proposed framework for recommendation.

Related work

Recent years have witnessed a boom of research work in recommendation systems. A number of techniques are employed for recommendations and many sources of data are fused to improve recommendation performances. Traditional recommender systems normally only utilize user–item rating feedback information for recommendation. Collaborative filtering is one of the most popular techniques, whose basic idea is to find similar objects for recommendation through interactive records. Recently, matrix

Conclusion and future work

To make full use of the user–item rating information and learn better latent representations, different from previous matrix factorization methods we aim to propose a new representation learning model based on autoencoders for recommendation in this paper. In our proposed framework, we simultaneously learn the latent factors of users and items, and minimize the derivations of training data using the learnt latent factors. Experiments on four data sets validate the superiority of our proposed

Acknowledgments

This work is supported by the National Natural Science Foundation of China (Nos. 61473273, 91546122, 61573335, 61602438), Guangdong provincial science and technology plan projects (No. 2015 B010109005), the Youth Innovation Promotion Association CAS 2017146 and 2015 Microsoft Research Asia Collaborative Research Program.

References (24)

G. Adomavicius et al.
Toward the next generation of recommender systems: A survey of the state-of-the-art and possible extensions
IEEE Transactions on Knowledge and Data Engineering
(2005)
Graves Alex et al.
A novel connectionist system for unconstrained handwriting recognition
IEEE Transactions on Pattern Analysis and Machine Intelligence
(2009)
Yoshua Bengio
Learning deep architectures for ai
Foundations and Trends in Machine Learning
(2009)
Yehuda Koren
Factorization meets the neighborhood: A multifaceted collaborative filtering model
Y. Koren et al.
Yehuda Koren et al.
Matrix factorization techniques for recommender systems
Computer
(2009)
Lee, J., Bengio, S., Kim, S., Lebanon, G., & Singer, Y. (2014). Local collaborative ranking for recommendation. In...
Daniel D. Lee et al.
Algorithms for non-negative matrix factorization
Lian, Defu, Zhao, Cong, Xie, Xing, Sun, Guangzhong, Chen, Enhong, & Rui, Yong (2014). Geomf: joint geographical...
Huizhi Liang et al.
A probabilistic rating auto-encoder for personalized recommender systems

Liu, Bin, Fu, Yanjie, Yao, Zijun, & Xiong, Hui (2013). Learning geographical preferences for point-of-interest...

Hao Ma et al.

Learning to recommend with social trust ensemble

Cited by (113)

GraphRec-based Korean expert recommendation using author contribution index and the paper abstracts in marine
2024, Engineering Applications of Artificial Intelligence
Expert recommendation systems recommend specialized experts in a particular field to users based on the knowledge of those experts. However, these systems are limited by the number of experts available and the potential for subjective evaluation, which may result in inappropriate recommendations. Furthermore, we explore the evolution from traditional to deep learning-based recommendation systems, emphasizing graph-based recommendation systems. Nonetheless, deep learning-based systems require large amounts of data, and marine expert recommendation training data are scarce. To address these issues, we constructed and utilized marine expert data in this study. The dataset contains abstracts of marine-related papers and information on their authors. Graphs were generated by assessing the similarity among the abstracts, representing them in a graph format indicative of this similarity, and using the author contribution index to depict the relationship between the abstracts and their respective authors. Various similarity methods and abstract embedding techniques were experimentally explored to realize performance optimization. In the experiments, the optimized model achieved a mean absolute error of 0.7556 and a root-mean-squared error of 1.0421. Notably, this study highlights the limitations of traditional evaluation metrics and proposes the averaged mean reciprocal rank as a suitable alternative. This metric facilitates the quantitative evaluation of model performance on newly created data, obviating a comparison model. Finally, applying the newly constructed data to the GraphRec model by using their graphical representation significantly improves the system performance.
An effective knowledge graph entity alignment model based on multiple information
2023, Neural Networks
Entity alignment refers to matching entities with the same realistic meaning in different knowledge graphs. The structure of a knowledge graph provides the global signal for entity alignment. But in the real world, a knowledge graph provides insufficient structural information in general. Moreover, the problem of knowledge graph heterogeneity is common. The semantic and string information can alleviate the problems caused by the sparse and heterogeneous nature of knowledge graphs, yet both of them have not been fully utilized by most existing work. Therefore, we propose an entity alignment model based on multiple information (EAMI), which employs structural, semantic and string information. EAMI learns the structural representation of a knowledge graph by using multi-layer graph convolutional networks. To acquire more accurate entity vector representation, we incorporate the attribute semantic representation into the structural representation. In addition, to further improve entity alignment, we study the entity name string information. There is no training required to calculate the similarity of entity names. Our model is tested on publicly available cross-lingual datasets and cross-resource datasets, and the experimental results demonstrate the effectiveness of our model.
NEFM: Neural embedding based factorization machines for user response prediction
2023, Expert Systems with Applications
Citation Excerpt :
In this paper, to address these shortcomings, we propose a neural embedding factorization machine (NEFM) model, which enhances FM models in modeling higher-order and non-linear feature interactions. As been reported in Bengio (2009) and Zhuang et al. (2017), an auto-encoder is powerful to capture feature representation for input data. Being inspired by Item2Vec (Barkan & Koenigstein, 2016), which draws on the idea behind neural embedding (Mikolov, Chen, Corrado and Dean, 2013; Mikolov, Sutskever, Chen, Corrado and Dean, 2013), and learns item representations via a probabilistic auto-encoder, NEFM can effectively initialize the embedding layer of FM models.
As Factorization Machine (FM) models linearly describe feature interactions, they cannot accurately capture complex non-linear features of data. Furthermore, random initialization in these FM models seriously affects system convergence and performance. Therefore, the random embedding process of FM models may not be sufficient to capture the data information. Although deep neural networks (DNNs)-based FM models have been recently proposed for advanced feature interactions, it is difficult to train. To address these challenges, we propose a neural embedding factorization machine (NEFM) model, which effectively initializes the embedding layers based on an unsupervised pre-training framework of probabilistic auto-encoder. The NEFM smartly couples the good linearity of FM models in modeling second-order feature interactions and the advantage of DNNs in modeling non-linear feature interactions. Experimental results show the effectiveness of the proposed NEFM. For example, the performance of NEFM is enhanced by at least 6.99% than the non-pre-trained FM models. Compared with the pre-trained FM by DNNs-based models, the NEFM model reduces at least 1.02% in test errors.
ExpGCN: Review-aware Graph Convolution Network for explainable recommendation
2023, Neural Networks
Existing works in recommender system have widely explored extracting reviews as explanations beyond user–item interactions, and formulated the explanation generation as a ranking task to enhance item recommendation performance. To associate explanations with users and items, graph neural networks (GNN) are usually employed to learn node representations on the heterogeneous user–item–explanation interaction graph. However, modeling heterogeneous graph convolution poses limitations in both message passing styles and computational efficiency, resulting in sub-optimal recommendation performance. To address the limitations, we propose an Explanation-aware Graph Convolution Network (ExpGCN). In particular, the heterogeneous interaction graph is divided to subgraphs regard to the edge types in ExpGCN. By aggregating information from distinct subgraphs, ExpGCN is capable of generating node representations for explanation ranking task and item recommendation task respectively. Task-oriented graph convolution can not only reduce the complexity of heterogeneous node aggregation, but also alleviate the performance degeneration caused by the conflicts between task learning objectives, which has been neglected in current studies. Extensive experiments on four public datasets show that ExpGCN significantly outperforms state-of-the-art baselines with high efficiency, demonstrating the effectiveness of ExpGCN in explainable recommendations.
Latent adversarial regularized autoencoder for high-dimensional probabilistic time series prediction
2022, Neural Networks
Many practical applications require probabilistic prediction of time series to model the distribution on future horizons. With ever-increasing dimensions, much effort has been invested into developing methods that often make an assumption about the independence between time series. Consequently, the probabilistic prediction in high-dimensional environments has become an essential topic with significant challenges. In this paper, we propose a novel probabilistic model called latent adversarial regularized autoencoder, abbreviated as TimeLAR, specifically for high-dimensional multivariate Time Series Prediction (TSP). It integrates the flexibility of Generative Adversarial Networks (GANs) and the capability of autoencoders in extracting higher-level non-linear features. Through flexible autoencoder mapping, TimeLAR learns cross-series relationships and encodes this global information into several latent variables. We design a modified Transformer for these latent variables to capture global temporal patterns and infer latent space prediction distributions, where only one step is required to output multi-step predictions. Furthermore, we employ the GAN to further refine the performance of latent space predictions, by using a discriminator to guide the training of the autoencoder and the Transformer in an adversarial process. Finally, complex distributions of multivariate time series data can be modeled by the non-linear decoder of the autoencoder. The effectiveness of TimeLAR is empirically underpinned by extensive experiments conducted on five real-world high-dimensional time series datasets in the fields of transportation, electricity, and web page views.
Handling data sparsity via item metadata embedding into deep collaborative recommender system
2022, Journal of King Saud University - Computer and Information Sciences
The tremendous growth in information over the last decade leads to information overwhelming problems for accessing personalized products. The recommender framework that retrieves user preferences on past interactions is known as collaborative filtering (CF). Although, CF is a prevalent technique amongst the techniques applied in the recommender environment. However, it suffers from many problems like information sparsity, scalability, cold-start, etc. Many investigations have been made to tackle some of these issues with the help of matrix factorization (MF) approaches. However, MF cannot handle the nonlinearity among the data. Deep learning is an advanced learning technique that has shown success in many applications such as image classification, computer vision, natural language processing, etc. Little work has been reported on deep learning techniques in the recommender domain. We propose an efficient deep collaborative recommender system that embeds item metadata to handle the nonlinearity in data and sparsity. The model consists of two stages, wherein the first stage, a neural network, is used to retrieve the data’s nonlinear features through embedding vectors. These vectors are concatenated together and fed as input to the second stage of the model. The output of the model yields a partially observed rating. The input and the parameters are simultaneously optimized and updated to minimize errors. The proposed strategy is evaluated against the benchmark techniques on two well-known datasets. The exploratory outcomes signify our approach’s exactitude and efficiency. Moreover, the missing values can also be recovered by propagating the embedding vectors from the input to the output layers.

View all citing articles on Scopus

View full text

Representation learning via Dual-Autoencoder for recommendation

Abstract

Introduction

Section snippets

Notations and preliminaries

Representation learning via Dual-Autoencoders

Experiments

Related work

Conclusion and future work

Acknowledgments

Toward the next generation of recommender systems: A survey of the state-of-the-art and possible extensions

IEEE Transactions on Knowledge and Data Engineering

A novel connectionist system for unconstrained handwriting recognition

IEEE Transactions on Pattern Analysis and Machine Intelligence

Learning deep architectures for ai

Foundations and Trends in Machine Learning

Factorization meets the neighborhood: A multifaceted collaborative filtering model

Matrix factorization techniques for recommender systems

Computer

Algorithms for non-negative matrix factorization

A probabilistic rating auto-encoder for personalized recommender systems

Learning to recommend with social trust ensemble