A movie recommendation method based on users' positive and negative profiles

doi:10.1016/j.ipm.2021.102531

Information Processing & Management

Volume 58, Issue 3, May 2021, 102531

https://doi.org/10.1016/j.ipm.2021.102531 Get rights and content

Abstract

In the traditional content-based recommendation method, we usually use the movies users watched before or rated to represent their profile. However, there are many movies that users have never seen or rated. For an unrated movie, there are two possibilities: maybe the user likes it or does not like it. In this paper, we first focus on how to identify users' preferences for movies by using a collaborative filtering algorithm to predict the users’ movie ratings. We can then create two movie lists for each user, where one is the movies the user likes (with higher predicting or true ratings), and the other is the movies the user does not like (with lower predicting or true ratings). Based on these two movie lists, we establish a user positive profile and a user negative profile. Therefore, our algorithm will recommend to users movies that are most similar to their positive profile and most different from their negative profile. Finally, our experiments show that our method can improve the MAE index of the traditional collaborative filtering method by 12.54%, the MAPE index by 17.68%, and the F1 index by 10.16%.

Introduction

Movies have become a daily leisure activity for many people. Since the Lumière Brothers released the first film in 1895, the vigorous development and creation of the film industry in various countries has made films an indispensable leisure activity for the general public. However, nowadays, after more than 100 years of film development, with the rapid development of technology, the traditional screen viewing that originally required audiences to enter a theater has gradually transformed into an online streaming platform that can be watched immediately at home. This online streaming media platform combines novel technologies, focuses on the trends of the big data era, and has improved and established various functions. These functions not only allow users to rate movies, but also allow friends to exchange movie experiences. In addition, the platform uses score data from many users to build a recommendation system to predict the preferred recommendations of individual users.

The recommendation system is widely used in online streaming platforms. The most basic and extensive recommendation methods can be classified as the following categories (He, Parra & Verbert, 2016; Lu, Wu, Mao, Wang & Zhang, 2015; Shah, Gaudani & Balani, 2016; Thorat et al., 2015): (1) Collaborative Filtering, (2) Content-based filtering, and (3) Hybrid Recommendation.

Collaborative filtering aims to use the ratings of similar users or items to predict the user's rating on the next item. On the other hand, content-based filtering (Bergamaschi & Po, 2015) is mainly based on information retrieval and information filtering. It uses the user profile and product content as data, and compares the information contained in the product with the user profile. Content-based methods usually use products that users have purchased in the past to build user profiles. The intuition underpinning this method is that what the users buy represents what they like. Therefore, we can use the features extracted from the purchased products (such as the product's theme, attributes or categories) to represent the user's interests and preferences. In addition, user profiles can also reflect users’ hidden interests through the similarity and relevance of product topics. Content-based filtering will extract product characteristics to represent the target user's preference profile. It then compares the user profile with the characteristics of the product, and recommends the most relevant product set to the user.

When the content-based filtering method is applied to movie recommendation, the method creates a profile representing the user's preference based on the movies previously watched by the user, and then generates the top K movie recommendations from the movies that the user has not yet watched. The criteria for selecting these recommended movies are based on the highest similarity between the content of these movies and the user's interest profile. However, the disadvantage of this method is that it ignores information about movies that the user has never watched before. This missing information has two possible meanings: either the user likes them or the user does not like them. If it is the latter case, the negative information can help the recommendation system make more accurate recommendations by deleting those negative movies. Unfortunately, this negative information has not been used to filter out movies that users do not like. Therefore, in the past, movies recommended by CB to users may hide movies that users do not like. In this paper, we address this issue by first predicting the rating of each user for each item based on the CF algorithm. If the predicted rating value is high, the target user may like the item. On the other hand, if the predicted rating value is low, the target user may not like the item. A positive profile is established based on those items with a higher predicted rating, and a negative profile is established based on those items with a lower predicted rating. If an item has a high degree of similarity with the positive profile of the target user, but a low degree of similarity with the negative profile of the target user, then it may be ideal for recommendation to the target user.

When we are faced with an application situation where we have both product user rating information and product content information, the natural way is to use a hybrid method that combines collaborative filtering methods and content-based methods. This paper uses a hybrid approach to recommend movies that users like. However, there is a big difference between our method and the previous research, that is, we not only use the user's positive preference profile, but we also use the user's negative preference profile to keep favorite movies and delete disliked movies. As a result, our method can improve the accuracy of movie recommendation compared with the traditional CB method.

In order to construct our hybrid method, we first apply the CF method to obtain each user's prediction score for all items. These prediction scores are obtained based on the correlation of the users’ scoring behaviors. However, the main disadvantage of obtaining scores through CF is that user characteristics and movie characteristics are not considered when generating predictions. In order to further improve the prediction results, we divide each user's movie set into two groups. Each user's first movie set contains movies for which the user has a higher predicted score. In other words, it represents the user's positive preference profile. Similarly, each user's second movie set contains movies for which the user has a lower predicted score, which represents the user's negative preference profile. Based on these two profiles, we will recommend movies that are similar to the user's positive profile but different from the user's negative profile.

The advantages of our method include the following: (1) we consider the relevance of the users' scoring behavior; (2) we consider user characteristics and movie characteristics; and (3) we use users' negative profiles to further enhance the recommendation results.

The main contributions of this research are:

•
We propose a new hybrid method that combines CF and CB.
•
Our method can build a positive profile and a negative profile for each user.
•
This is the first paper that considers negative user profiles in CB, so we can delete movies negatively related to user preferences from the recommendation set.
•
Our experimental results show that considering negative profiles can improve the traditional CB that only considers positive profiles. Based on this finding, all past studies in CB can try to further improve their recommendation results by using negative profiles.

The rest of this paper is organized as follows. In Section 2, we state our research objective. Section 3 is a review of related work. Section 4 introduces a hybrid movie recommendation method based on the user's positive and negative profiles. Section 5 contains a series of experiments to prove the effectiveness of our recommendation algorithm. Section 6 is the conclusion of this paper. Finally, in Section 7 we discuss the implications of this research and future work.

Section snippets

Research objective

Content-based filtering uses user profiles and product content as data, and compares the information contained in the product with the user profile. Content-based methods usually use products purchased in the past to build user profiles. The intuition underpinning the previous approach is that what users buy represents what they like. However, for a certain product, the user may like or dislike it. If we can discern the relationship between users and unrated products, then this negative

Related work

The recommendation system is an information filtering mechanism, mainly used to reduce the extra cost involved in the process of searching for information. Based on the user's preferences, interests, behaviors, or needs, it recommends information, services, or products that the user potentially needs. The most popular recommendation algorithms can be classified as three main categories: content-based, collaborative filtering and hybrid approaches.

The process of content-based filtering is

Recommendation algorithm

In this section, we will outline our recommendation system. Fig. 1 shows the architecture of the entire system. In the system architecture, we have designed two main modules, namely the Collaborative-Filtering module (CF) and the Profile-Based module (PB). The CF module first collects actual rating data from the MovieLens dataset. Since the entire user-movie rating matrix is very sparse, we apply the user-based collaborative filtering algorithm (Herlocker, Konstan, Borchers & Riedl, 2015) to

Experiments

In this section, we first describe how we collected the dataset, and discuss the measurement metric. Then we conducted a series of experiments. Finally, we discuss the experimental results.

Conclusion

This paper proposes a new approach that is different from traditional recommendations, that is, including positive/negative user profiles to obtain more accurate recommendations. In previous studies, recommendation algorithms usually generated recommendation results based on items that users like, items that similar users like, or items that the user previously purchased. In other words, previous research mainly focused on the positive profile to start the research on recommendation. No

Implications of the study and future work

The results of this study indicate that negative profiles (users’ negative preferences) should be included when designing recommendation algorithms. However, it is not easy to identify the user's negative preferences because almost all datasets only store the user's positive feedback, such as click, purchase or browsing. In this paper, we apply the CF algorithm to obtain the user's predicted score for all items, so we can divide the user's preferences into a positive profile and a negative

CRediT authorship contribution statement

Yen-Liang Chen: Conceptualization, Methodology, Supervision, Validation, Writing - original draft, Writing - review & editing. Yi-Hsin Yeh: Data curation, Software, Validation. Man-Rong Ma: Conceptualization, Methodology.

References (35)

R. Alghamdi et al.
A survey of topic modeling in text mining
International Journal of Advanced Computer Science and Applications
(2015)
S. Bansal et al.
User tweets based genre prediction and movie recommendation using LSI and SVD
Beel, J. (2019). And the winner is MovieLens - On the popularity of recommender system datasets....
S. Bergamaschi et al.
Comparing LDA and LSA topic models for content-based movie recommendation systems
K. Bougiatiotis et al.
Enhanced movie content similarity based on textual, auditory and visual information
Expert Systems with Applications
(2018)
B. Chikhaoui et al.
An improved hybrid recommender system by combining predictions
T. Ha et al.
Item-network-based collaborative filtering: A personalized recommendation method based on a user's item network
Information Processing & Management
(2017)
F.M. Harper et al.
The movielens datasets: History and context
ACM Transactions on Interactive Intelligent Systems
(2015)
C. He et al.
Interactive recommender systems: A survey of the state of the art and future research challenges and opportunities
Expert Systems with Applications
(2016)
J. Herlocker et al.
An algorithmic framework for performing collaborative filtering
ACM SIGIR Forum
(2015)

T.C.K. Huang et al.

A novel recommendation model with google similarity

Decision Support Systems

(2016)

K. Indira et al.

Efficient machine learning model for movie recommender systems using multi-cloud environment

Mobile Networks and Applications

(2019)

F.O. Isinkaye et al.

Recommendation systems: Principles methods and evaluation

Egyptian Informatics Journal

(2015)

H. Jelodar et al.

Latent Dirichlet allocation (LDA) and topic modeling: Models, applications, a survey

Multimedia Tools and Applications

(2019)

C.Q. Jiang et al.

Hybrid collaborative filtering for high-involvement products: A solution to opinion sparsity and dynamics

Decision Support Systems

(2015)

Y. Koren et al.

Matrix factorization techniques for recommender systems

Computer

(2009)

J. Lu et al.

Recommender system application developments: A survey

Decision Support Systems

(2015)

Cited by (30)

Multi-objective optimization and integrated indicator-driven two-stage project recommendation in time-dependent software ecosystem
2024, Information and Software Technology
Time-dependent software ecosystem is a complex system, where there are many projects and developers. Recommending projects to developers in a time-dependent software ecosystem can improve their quality and development speeds. However, the time-dependence of projects and developers results in an increased difficulty of project recommendation.
To better recommend projects to developers in a time-dependent software ecosystem, we propose a method of multi-objective optimization and integrated indicator-driven two-stage project recommendation, which is fulfilled according to the change of developer communities and their projects.
According to the change of developer communities and their projects, a method of multi-objective optimization and integrated indicator-driven two-stage project recommendation is fulfilled. In the first stage, a constrained multi-objective optimization model for project recommendation to developer communities is established, and an improved NSGA-II algorithm is adopted to solve this model, with the purpose of obtaining the recommended projects to a developer community. For the second stage, an integrated indicator for project recommendation to developers is built to determine the developers of a project.
The proposed method is applied to project recommendation for nine time-dependent software ecosystems in GitHub, and compared with six state-of-the-art ones. The experimental results show that our method has significant advantages in recommendation accuracy and efficiency.
According to the experimental results, we conclude that the proposed method can timely and accurately recommend projects to developers in a time-dependent software ecosystem, which reduces the difficulty of solving the problem of project recommendation.
Neural TV program recommendation based on dynamic long-short term interest
2023, Applied Soft Computing
TV program recommendation can help user find interested programs and improve user experience. The heterogeneous information of programs is important for alleviating the problem of data sparsity. In addition, the existing TV program recommendation methods are lacking in dynamics. This paper proposes a neural TV program recommendation based on dynamic long-short term interest (NPR-DLSTI), which mainly includes two modules: program and user encoder. In the program encoder module, we use convolutional neural network and attention mechanism to learn the heterogeneous information of the program and realize program representation. In the user encoder module, we use gated recurrent unit and personalized attention to learn the dynamic change law of users’ interests. Experiments on real data sets show that our method can effectively improve the effectiveness and dynamics of TV program recommendation than other existing models.
Negative Can Be Positive: Signed Graph Neural Networks for Recommendation
2023, Information Processing and Management
Most of the existing GNN-based recommender system models focus on learning users’ personalized preferences from these (explicit/implicit) positive feedback to achieve personalized recommendations. However, in the real-world recommender system, the users’ feedback behavior also includes negative feedback behavior (e.g., click dislike button), which also reflects users’ personalized preferences. How to utilize negative feedback is a challenging research problem. In this paper, we first qualitatively and quantitatively analyze the three kinds of negative feedback that widely existed in real-world recommender systems and investigate the role of negative feedback in recommender systems. We found that it is different from what we expected — not all negative items are ranked low, and some negative items are even ranked high in the overall items. Then, we propose a novel Signed Graph Neural Network Recommendation model (SiGRec) to encode the users’ negative feedback behavior. Our SiGRec can learn positive and negative embeddings of users and items via positive and negative graph neural network encoders, respectively. Besides, we also define a new Sign Cosine (SiC) loss function to adaptively mine the information of negative feedback for different types of negative feedback. Extensive experiments on four datasets demonstrate the proposed model outperforms several existing models. Specifically, on the Zhihu dataset, SiGRec outperforms the unsigned GNN model (i.e., LightGCN), 27.58% 29.81%, and 31.21% in P@20, R@20, and nDCG@20, respectively. We hope our work can open the door to further exploring the negative feedback in recommendations.
Will they take this offer? A machine learning price elasticity model for predicting upselling acceptance of premium airline seating
2023, Information and Management
Employing customer information from one of the world's largest airline companies, we develop a price elasticity model (PREM) using machine learning to identify customers likely to purchase an upgrade offer from economy to premium class and predict a customer's acceptable price range. A simulation of 64.3 million flight bookings and 14.1 million email offers over three years mirroring actual data indicates that PREM implementation results in approximately 1.12 million (7.94%) fewer non-relevant customer email messages, a predicted increase of 72,200 (37.2%) offers accepted, and an estimated $72.2 million (37.2%) of increased revenue. Our results illustrate the potential of automated pricing information and targeting marketing messages for upselling acceptance. We also identified three customer segments: (1) Never Upgrades are those who never take the upgrade offer, (2) Upgrade Lovers are those who generally upgrade, and (3) Upgrade Lover Lookalikes have no historical record but fit the profile of those that tend to upgrade. We discuss the implications for airline companies and related travel and tourism industries.
Selection bias mitigation in recommender system using uninteresting items based on temporal visibility
2023, Expert Systems with Applications
Citation Excerpt :
Secondly, the key is how to identify the items in the missing data that the user is not interested in. Unrated items can be caused by the following two reasons (Chen et al., 2021): The user has not seen the item and does not know its existence so that there is no rating.
Most collaborative filtering recommendation algorithms rely too much on the user's historical rating data. However, selection bias is common in explicit feedback data, which makes the learning of user preferences face more challenges. We verify the influence of selection bias on topN recommendation, and propose a data filling strategy using uninteresting items based on temporal visibility to alleviate the selection bias in the data. Specifically, our method includes a weighted matrix factorization model to learn users' pre-use preferences for unrated items. According to the experience of items that users have seen but not interacted show negative preferences, we combine user activity, item popularity and temporal rating information to carry out non-uniform weighting to evaluate the confidence of unrated items as a negative example. Then the items with low pre-use preferences are taken as uninteresting items and filled in a low value to restore the user's real rating distribution. Experiments on two real world datasets show that our algorithm can effectively alleviate the selection bias and improve the recommendation accuracy.
Ensemble transfer learning-based multimodal sentiment analysis using weighted convolutional neural networks
2022, Information Processing and Management
Citation Excerpt :
The text2vec model and CNN network were used to extract the features of the texts in this paper, and The SVM was performed for sentiment analysis. Some related studies in the field of scene segmentation (Chen et al., 2020), movie genre classification (Yadav & Vishwakarma, 2020), video activity recognition (Mliki, Bouhlel, & Hammami, 2020) and movie recommendation (Chen, Yeh, & Ma, 2021) were also carried out in this area. Reinforcement learning is one of the important categories of machine learning, which has been utilized in various areas of NLP, including sentiment analysis.
Huge amounts of multimodal content and comments in a mixture form of text, image, and emoji are continuously shared by users on various social networks. Most of the comments of the users in these networks have emotional aspects, which make the multimodal sentiment analysis (MSA) an important and attractive research topics in this area. In this paper, an ensemble transfer learning method is exploited to propose a hybrid MSA model based on weighted convolutional neural networks. The extended Dempster–Shafer (Yager) theory is also utilized in the proposed method of this paper to fuse the outputs of text and image classifiers to determine the final polarity at the decision level. The pre-trained VGG16 network is firstly used to extract visual features and fine-tune on the MVSA-Multiple and T4SA datasets for image sentiment classification. The Mask-RCNN model is then exploited to determine the objects in the images and convert them to text. The BERT model receives the output of this step along with the textual descriptions of the images for extracting the text features and embedding the words. The output of the BERT model is then imported into a weighted convolutional neural network ensemble (WCNNE). The texts are classified by several weak learners using the AdaBoost that is an ensemble learning technique in which, classifiers are trained sequentially. The combined use of several weak classifiers results in a strong classification. The WCNNE improves the performance and increases the accuracy of the results. As a fusing phase at the decision level, the outputs of the VGG16 and the WCNNE models will be finally merged using the extended Dempster-Shafer theory to obtain the correct sentiment label. The results of the experiments on the MVSA-Multiple and T4SA datasets show that the proposed model is better than the other compared methods and achieved an appropriate accuracy of 0.9348 on MVSA and 0.9689 on the T4SA datasets. Moreover, the proposed model reduces training time due to the use of transfer learning and the proposed AdaBoostCNN achieves better results compared to the single CNN.

View all citing articles on Scopus

View full text

A movie recommendation method based on users' positive and negative profiles

Abstract

Introduction

Section snippets

Research objective

Related work

Recommendation algorithm

Experiments

Conclusion

Implications of the study and future work

CRediT authorship contribution statement

A survey of topic modeling in text mining

International Journal of Advanced Computer Science and Applications

User tweets based genre prediction and movie recommendation using LSI and SVD

Comparing LDA and LSA topic models for content-based movie recommendation systems

Enhanced movie content similarity based on textual, auditory and visual information

Expert Systems with Applications

An improved hybrid recommender system by combining predictions

Item-network-based collaborative filtering: A personalized recommendation method based on a user's item network

Information Processing & Management

The movielens datasets: History and context

ACM Transactions on Interactive Intelligent Systems

Interactive recommender systems: A survey of the state of the art and future research challenges and opportunities

Expert Systems with Applications

An algorithmic framework for performing collaborative filtering

ACM SIGIR Forum

A novel recommendation model with google similarity

Decision Support Systems

Efficient machine learning model for movie recommender systems using multi-cloud environment

Mobile Networks and Applications

Recommendation systems: Principles methods and evaluation

Egyptian Informatics Journal

Latent Dirichlet allocation (LDA) and topic modeling: Models, applications, a survey

Multimedia Tools and Applications

Hybrid collaborative filtering for high-involvement products: A solution to opinion sparsity and dynamics

Decision Support Systems

Matrix factorization techniques for recommender systems

Computer

Recommender system application developments: A survey

Decision Support Systems