A parameter-free KNN for rating prediction
Introduction
The purpose of recommender systems is to predict users’ preferences over a wide range of items, and then offer them the items most likely to interest them. Since users are often overwhelmed by the huge amount of items provided by electronic retailers, recommendation is a salient feature for all types of e-commerce [1], [2]. That is why recommender systems attract a lot of attention due to their great commercial value [3], [4], [5], [6].
Collaborative filtering is a widely used category of recommender systems. It is based on the analysis of existing relationships between users and items to identify centers of interest concerning users [7], [8], [9], [10]. Based on these identifications, recommendations are made. Among the most popular collaborative filtering algorithms are the ones based on the nearest neighbors (KNN) method. They automate the common principle of word-of-mouth where one relies on the opinions of people sharing the same interests to assess the value of an item. These algorithms are recognized as easy to use with explainable recommendations, in addition to be efficient and stable [8], [11].
In the literature, KNN methods are well studied and many approaches have been proposed to predict the rating that a user could have on an item [9], [12], [13]. The idea behind these methods is that the rating of a user on an item is probably close to the one of another user , if both users and have similar ratings on other items.
In their basic functioning, the number of neighbors to consider is the only used parameter. However, finding an optimal value of the parameter remains a challenge. An inappropriate value of can negatively affect the quality of the recommendations. Therefore, the parameter is usually set to a tuned value by Grid Search.
Some authors have already addressed the problem of dynamically finding an appropriate value of in order to optimize the quality of recommendations [14], [15], [16]. In their approaches, they introduce additional parameters which make it possible to determine an appropriate value of but must be also calibrated in turn.
In this paper, we propose a parameter-free KNN method for rating prediction that we named freeKNN. It is able to dynamically select an appropriate number of neighbors to take into account depending on the user and the item to be rated. The experiments that we conducted on four public datasets demonstrate its effectiveness.
The remainder of this paper is organized as follows. In Section 2, we present related works on dynamic search for an appropriate value of in order to optimize the quality of recommendations. In Section 3, we detail the freeKNN method which does not use any parameter to make predictions. We explain how it dynamically seeks an appropriate number of neighbors to consider. Section 4 presents the results of our evaluation which demonstrate the effectiveness of freeKNN. Finally we conclude this paper in Section 5.
Section snippets
Related works
Recommender systems (RS) seek to suggest high-interest items to users, whether in terms of predicting the rating that the user would give to an item or estimating the likelihood of interaction1 between a user and an item.
In this context, the task of recommendation can be reduced to finding the items for which a user would give the highest ratings and recommending them to him. Formally, the task can be defined as follows: where is the set of
freeKNN
We named our method freeKNN. It can predict the rating that a user would give to an item without setting a number of neighbors to be considered beforehand.
The operation of the freeKNN method is based on the notion of Top-neighbors which we define as the smallest sublist of neighbors of whose global opinion is representative from that of all the neighbors.
For the sake of simplicity, we present in Section 3.1 some important definitions to understand how freeKNN works before explaining its
Experiment
For the validation of our method, we compared it to those presented in our state of the art in Section 2. In addition to these methods, we have also included the results of the standard KNN method with the parameter optimized on the best number of neighbors determined from an exhaustive search.
Conclusion
In the context of collaborative filtering and rating prediction, we addressed the problem of improving KNN methods. We focused on optimizing the number of neighbors to take into account while recommending an item for a user. We have proposed a method for dynamic selection of an appropriate number of neighbors.
Our method does not require any parameters unlike those of related works. It estimates, for a given user and item, the importance of the possible rating values that have been assigned in
Declaration of Competing Interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Junior Fopa Medjeu holds a master’s degree in computer science specializing in Business Intelligence. His research theme as part of his master’s internship focused on recommender systems.
He did his humanities in Cameroon and benefited from an intra-African mobility grant which brought him to Cheikh Anta Diop University in Dakar, Senegal.
References (29)
An entropy-based neighbor selection approach for collaborative filtering
Knowl.-Based Syst.
(2014)- et al.
A novel kNN algorithm with data-driven k parameter computation
Pattern Recognit. Lett.
(2018) - et al.
Recommender systems in e-commerce
- et al.
The million dollar programming prize
IEEE Spectr.
(2009) - et al.
The value of personalised recommender systems to e-business: a case study
- et al.
A case study on the effectiveness of recommendations in the mobile internet
- et al.
Amazon.com recommendations: item-to-item collaborative filtering
IEEE Internet Comput.
(2003) - et al.
Recommender systems and their impact on sales diversity
- et al.
A survey of collaborative filtering algorithms for social recommender systems
- et al.
A survey of collaborative filtering techniques
Adv. in Artif. Intell.
(2009)
Matrix factorization techniques for recommender systems
Computer
Yahoo! music recommendations: modeling music ratings with temporal dynamics and item taxonomy
Cited by (10)
A data-driven approach for collaborative optimization of large-scale electric vehicles considering energy consumption uncertainty
2023, Electric Power Systems ResearchSimilarity matrix enhanced collaborative filtering for e-government recommendation
2023, Data and Knowledge EngineeringEnhancing E-commerce Recommendation Accuracy Using KNN and Hybrid Approaches: An Empirical Study
2024, Lecture Notes in Networks and Systems
Junior Fopa Medjeu holds a master’s degree in computer science specializing in Business Intelligence. His research theme as part of his master’s internship focused on recommender systems.
He did his humanities in Cameroon and benefited from an intra-African mobility grant which brought him to Cheikh Anta Diop University in Dakar, Senegal.
Modou Gueye is an Assistant Professor at Cheikh Anta Diop University. He holds a Ph.D. degree from Telecom Paris, a leading French engineering school specialized in computer science.
He mainly works in designing scalable, but accurate too, recommender systems.
Samba Ndiaye is a Professor at Cheikh Anta Diop University. He supervised many doctorates on diverse fields of Data mining.
His research interests are in large scale data management and mining, recommender systems and web information extraction.
Hubert Naacke is an Assistant Professor at Pierre and Marie Curi University, also known as Paris 6, a public research university in Paris, France.
He is the author and co-author of several publications in international conferences and journals, national conferences, and book chapters. A part of his research interests are in recommender systems.