research-article

Large-scale Collaborative Ranking in Near-Linear Time

Authors:

Liwei Wu,

Cho-Jui Hsieh,

James SharpnackAuthors Info & Claims

KDD '17: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

Pages 515 - 524

https://doi.org/10.1145/3097983.3098071

Published: 04 August 2017 Publication History

Get Access

Abstract

In this paper, we consider the Collaborative Ranking (CR) problem for recommendation systems. Given a set of pairwise preferences between items for each user, collaborative ranking can be used to rank un-rated items for each user, and this ranking can be naturally used for recommendation. It is observed that collaborative ranking algorithms usually achieve better performance since they directly minimize the ranking loss; however, they are rarely used in practice due to the poor scalability. All the existing CR algorithms have time complexity at least O(|Ω|r) per iteration, where r is the target rank and |Ω| is number of pairs which grows quadratically with number of ratings per user. For example, the Netflix data contains totally 20 billion rating pairs, and at this scale all the current algorithms have to work with significant subsampling, resulting in poor prediction on testing data.

In this paper, we propose a new collaborative ranking algorithm called Primal-CR that reduces the time complexity to O(|Ω|+d₁ |d₂ r), where d₁ is number of users and |d₂ is the averaged number of items rated by a user. Note that d₁ |d₂ is strictly smaller and often much smaller than |Ω|.

Furthermore, by exploiting the fact that most data is in the form of numerical ratings instead of pairwise comparisons, we propose Primal-CR++ with O(d₁|d₂ (r+ log |d₂)) time complexity. Both algorithms have better theoretical time complexity than existing approaches and also outperform existing approaches in terms of NDCG and pairwise error on real data sets. To the best of our knowledge, this is the first collaborative ranking algorithm capable of working on the full Netflix dataset using all the 20 billion rating pairs, and this leads to a model with much better recommendation compared with previous models trained on subsamples. Finally, compared with classical matrix factorization algorithm which also requires O(d₁d₂r) time, our algorithm has almost the same efficiency while making much better recommendations since we consider the ranking loss.

Supplementary Material

MP4 File (wu_collaborative_ranking.mp4)

Download
419.17 MB

References

[1]

Suhrid Balakrishnan and Sumit Chopra. 2012. Collaborative ranking. In Proceedings of the fifth ACM international conference on Web search and data mining. ACM, 143--152.

Abstract

Supplementary Material

References

Cited By

Index Terms

Recommendations

Graph-based collaborative ranking

Item-based collaborative ranking

Collaborative Multi-objective Ranking

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Share

Share this Publication link

Share on social media

Affiliations