ABSTRACT
This paper studies correlation-based item-item similarity measures for recommendation systems. While current research on recommender systems is directed toward deep learning-based approaches, nearest neighbor methods have been still used extensively in commercial recommender systems due to their simplicity. A crucial step in item-based nearest neighbor methods is to compute similarities between items, which are generally estimated through correlation measures like Pearson. The purpose of this paper is to re-investigate the effectiveness of correlation-based nearest neighbor methods on several benchmark datasets that have been used for recommendation evaluation in recent years. This paper also provides a more effective estimation method for correlation measures than the classical Pearson correlation coefficient and shows that this leads to significant improvements in recommendation performance.
- Deepak Agarwal, Liang Zhang, and Rahul Mazumder. 2011. Modeling item-item similarities for personalized recommendations on Yahoo! front page. The Annals of applied statistics (2011), 1839--1875.Google Scholar
- James Bennett, Stan Lanning, et al. 2007. The Netflix Prize. In Proceedings of KDD cup and workshop, Vol. 2007. New York, NY, USA., 35.Google Scholar
- Thierry Bertin-Mahieux, Daniel P. W. Ellis, Brian Whitman, and Paul Lamere. 2011. The Million Song Dataset. In Proceedings of the 12th International Society for Music Information Retrieval Conference, ISMIR 2011, Miami, Florida, USA, October 24--28, 2011, Anssi Klapuri and Colby Leider (Eds.). 591--596.Google Scholar
- Mukund Deshpande and George Karypis. 2004. Item-based top-n recommendation algorithms. ACM Transactions on Information Systems (TOIS) 22, 1 (2004), 143--177.Google ScholarDigital Library
- F Maxwell Harper and Joseph A Konstan. 2015. The Movielens Datasets: History and Context. ACM transactions on interactive intelligent systems (tiis) 5, 4 (2015), 1--19.Google Scholar
- Xiangnan He, Lizi Liao, Hanwang Zhang, Liqiang Nie, Xia Hu, and Tat-Seng Chua. 2017. Neural collaborative filtering. In Proceedings of the 26th international conference on world wide web. 173--182.Google ScholarDigital Library
- Yifan Hu, Yehuda Koren, and Chris Volinsky. 2008. Collaborative filtering for implicit feedback datasets. In 2008 Eighth IEEE international conference on data mining. Ieee, 263--272.Google ScholarDigital Library
- Maurice George Kendall et al. 1946. The advanced theory of statistics. The advanced theory of statistics. 2nd Ed (1946).Google Scholar
- Halime Khojamli and Jafar Razmara. 2021. Survey of similarity functions on neighborhood-based collaborative filtering. Expert Systems with Applications 185 (2021), 115482.Google ScholarDigital Library
- Peter Knees, Dominik Schnitzer, and Arthur Flexer. 2014. Improving neighborhood-based collaborative filtering by reducing hubness. In Proceedings of International Conference on Multimedia Retrieval. 161--168.Google ScholarDigital Library
- Yehuda Koren. 2008. Factorization meets the neighborhood: a multifaceted collaborative filtering model. In Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining. 426--434.Google ScholarDigital Library
- Yehuda Koren, Robert Bell, and Chris Volinsky. 2009. Matrix factorization techniques for recommender systems. Computer 42, 8 (2009), 30--37.Google ScholarDigital Library
- Dawen Liang, Rahul G Krishnan, Matthew D Hoffman, and Tony Jebara. 2018. Variational autoencoders for collaborative filtering. In Proceedings of the 2018 world wide web conference. 689--698.Google ScholarDigital Library
- Greg Linden, Brent Smith, and Jeremy York. 2003. Amazon. com recommendations: Item-to-item collaborative filtering. IEEE Internet computing 7, 1 (2003), 76--80.Google Scholar
- Sam Lobel, Chunyuan Li, Jianfeng Gao, and Lawrence Carin. 2019. RaCT: Toward Amortized Ranking-Critical Training For Collaborative Filtering. In International Conference on Learning Representations.Google Scholar
- Xin Luo, Mengchu Zhou, Yunni Xia, and Qingsheng Zhu. 2014. An efficient non-negative matrix-factorization-based approach to collaborative filtering for recommender systems. IEEE Transactions on Industrial Informatics 10, 2 (2014), 1273--1284.Google ScholarCross Ref
- Hao Ma, Irwin King, and Michael R Lyu. 2007. Effective missing data prediction for collaborative filtering. In Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval. 39--46.Google ScholarDigital Library
- Xia Ning and George Karypis. 2011. Slim: Sparse linear methods for top-n recommender systems. In 2011 IEEE 11th international conference on data mining. IEEE, 497--506.Google ScholarDigital Library
- Badrul Sarwar, George Karypis, Joseph Konstan, and John Riedl. 2001. Item-based collaborative filtering recommendation algorithms. In Proceedings of the 10th international conference on World Wide Web. 285--295.Google ScholarDigital Library
- Harald Steck. 2019. Embarrassingly shallow autoencoders for sparse data. In The World Wide Web Conference. 3251--3257.Google ScholarDigital Library
- Marwan Torki. 2018. A document descriptor using covariance of word vectors. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers). 527--532.Google ScholarCross Ref
- Koen Verstrepen and Bart Goethals. 2014. Unifying nearest neighbors collaborative filtering. In Proceedings of the 8th ACM Conference on Recommender systems. 177--184.Google ScholarDigital Library
- Joe Whittaker. 1990. Graphical Models in Applied Multivariate Statistics. Wiley, New York, NY.Google Scholar
- Yao Wu, Christopher DuBois, Alice X Zheng, and Martin Ester. 2016. Collaborative denoising auto-encoders for top-n recommender systems. In Proceedings of the ninth ACM international conference on web search and data mining. 153--162.Google ScholarDigital Library
- Yin Zheng, Bangsheng Tang, Wenkui Ding, and Hanning Zhou. 2016. A neural autoregressive approach to collaborative filtering. In International Conference on Machine Learning. PMLR, 764--773.Google Scholar
Index Terms
- Rethinking Correlation-based Item-Item Similarities for Recommender Systems
Recommendations
A new similarity function for selecting neighbors for each target item in collaborative filtering
As one of the collaborative filtering (CF) techniques, memory-based CF technique which recommends items to users based on rating information of like-minded users (called neighbors) has been widely used and has also proven to be useful in many practices ...
Improving Accuracy of Recommender System by Item Clustering
Recommender System (RS) predicts user's ratings towards items, and then recommends highly-predicted items to user. In recent years, RS has been playing more and more important role in the agent research field. There have been a great deal of researches ...
Item Based Collaborative Filtering Approach in Movie Recommendation System Using Different Similarity Measures
ICCTA '20: Proceedings of the 2020 6th International Conference on Computer and Technology ApplicationsRecommender systems use the concept of data analysis and knowledge discovery to essentially provide personalized recommendations on the web. A number of different algorithms has been proposed to effectively capture users interest and provide accurate ...
Comments