research-article

Up next: retrieval methods for large scale related video suggestion

Authors:

Michael Bendersky,

Lluis Garcia-Pueyo,

Jeremiah Harmsen,

Vanja Josifovski,

Dima LepikhinAuthors Info & Claims

KDD '14: Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining

Pages 1769 - 1778

https://doi.org/10.1145/2623330.2623344

Published: 24 August 2014 Publication History

Abstract

The explosive growth in sharing and consumption of the video content on the web creates a unique opportunity for scientific advances in video retrieval, recommendation and discovery. In this paper, we focus on the task of video suggestion, commonly found in many online applications. The current state-of-the-art video suggestion techniques are based on the collaborative filtering analysis, and suggest videos that are likely to be co-viewed with the watched video. In this paper, we propose augmenting the collaborative filtering analysis with the topical representation of the video content to suggest related videos. We propose two novel methods for topical video representation. The first method uses information retrieval heuristics such as tf-idf, while the second method learns the optimal topical representations based on the implicit user feedback available in the online scenario. We conduct a large scale live experiment on YouTube traffic, and demonstrate that augmenting collaborative filtering with topical representations significantly improves the quality of the related video suggestions in a live setting, especially for categories with fresh and topically-rich video content such as news videos. In addition, we show that employing user feedback for learning the optimal topical video representations can increase the user engagement by more than 80% over the standard information retrieval representation, when compared to the collaborative filtering baseline.

Supplementary Material

MP4 File (p1769-sidebyside.mp4)

Download
268.97 MB

References

[1]

Give YouTube topics on search a whirl. http://youtube-global.blogspot.com/2010/11/give-youtube-topics-on-search-whirl.html.

[2]

Youtube -- statistics. http://youtube.com/yt/press/statistics.html.

[3]

Youtube data API - searching with Freebase topics. https://developers.google.com/youtube/v3/guides/searching_by_topic.

[4]

A. Ahmed, B. Kanagal, S. Pandey, V. Josifovski, L. G. Pueyo, and J. Yuan. Latent factor models with additive and hierarchically-smoothed user preferences. In Proceedings of WSDM, pages 385--394, 2013.

Digital Library

[5]

B. Bai, J. Weston, D. Grangier, R. Collobert, K. Sadamasa, Y. Qi, O. Chapelle, and K. Weinberger. Supervised semantic indexing. In Proceedings of CIKM 2009, pages 187--196, 2009.

Digital Library

[6]

P. Bailey, N. Craswell, I. Soboroff, P. Thomas, A. P. de Vries, and E. Yilmaz. Relevance assessment: are judges exchangeable and does it matter. In Proceedings of SIGIR, pages 667--674, 2008.

Digital Library

[7]

S. Baluja, R. Seth, D. Sivakumar, Y. Jing, J. Yagnik, S. Kumar, D. Ravichandran, and M. Aly. Video suggestion and discovery for youtube: taking random walks through the view graph. In Proceedings of WWW, pages 895--904, 2008.

Digital Library

[8]

A. Z. Broder, D. Carmel, M. Herscovici, A. Soffer, and J. Zien. Efficient query evaluation using a two-level retrieval process. In Proceedings of CIKM, pages 426--434. ACM, 2003.

Digital Library

[9]

C. Burges, T. Shaked, E. Renshaw, A. Lazier, M. Deeds, N. Hamilton, and G. Hullender. Learning to rank using gradient descent. In Proceedings of ICML, pages 89--96, 2005.

Digital Library

[10]

R. Burke. Hybrid recommender systems: Survey and experiments. User Modeling and User-Adapted Interaction, 12(4):331--370, Nov. 2002.

Digital Library

[11]

B. Chen, J. Wang, Q. Huang, and T. Mei. Personalized video recommendation through tripartite graph propagation. In Proceedings of MM, pages 1133--1136, 2012.

Digital Library

[12]

M. Collins, R. E. Schapire, and Y. Singer. Logistic regression, adaboost and bregman distances. Machine Learning, 48(1--3):253--285, Sept. 2002.

Digital Library

[13]

J. Davidson, B. Liebald, J. Liu, P. Nandy, T. Van Vleet, U. Gargi, S. Gupta, Y. He, M. Lambert, B. Livingston, and D. Sampath. The youtube video recommendation system. In Proceedings of RecSys, RecSys '10, pages 293--296, New York, NY, USA, 2010. ACM.

Digital Library

[14]

M. Fontoura, V. Josifovski, J. Liu, S. Venkatesan, X. Zhu, and J. Zien. Evaluation strategies for top-k queries over memory-resident inverted indexes. Proceedings of the VLDB Endowment, 4(12):1213--1224, 2011.

Digital Library

[15]

A. Gunawardana and C. Meek. A unified approach to building hybrid recommender systems. In Proceedings of RecSys, pages 117--124, 2009.

Digital Library

[16]

J. Jeon, V. Lavrenko, and R. Manmatha. Automatic image annotation and retrieval using cross-media relevance models. In Proceedings of SIGIR, pages 119--126, 2003.

Digital Library

[17]

T. Joachims. Optimizing search engines using clickthrough data. In Proceedings of KDD, pages 133--142, 2002.

Digital Library

[18]

H. Li. Learning to rank for information retrieval and natural language processing. Synthesis Lectures on Human Language Technologies, 4(1):1--113, 2011.

[19]

C. D. Manning, P. Raghavan, and H. Schütze. Introduction to Information Retrieval. Cambridge University Press, New York, NY, USA, 2008.

Digital Library

[20]

P. Over, G. Awad, J. Fiscus, B. Antonishek, M. Michel, A. F. Smeaton, W. Kraaij, G. Quénot, et al. TRECVID 2012 -- an overview of the goals, tasks, data, evaluation mechanisms and metrics. In TRECVID 2012-TREC Video Retrieval Evaluation Online, 2012.

[21]

F. Radlinski, M. Kurup, and T. Joachims. How does clickthrough data reflect retrieval quality? In Proceedings of CIKM, pages 43--52, 2008.

Digital Library

[22]

D. Read, G. Loewenstein, and S. Kalyanaraman. Mixing virtue and vice: Combining the immediacy effect and the diversification heuristic. Journal of Behavioral Decision Making, 12(4):257--273, 1999.

[23]

G. Salton and C. Buckley. Term-weighting approaches in automatic text retrieval. Information processing & management, 24(5):513--523, 1988.

Digital Library

[24]

G. Shani and A. Gunawardana. Evaluating recommendation systems. In Recommender systems handbook, pages 257--297. Springer, 2011.

[25]

V. Simonet. Classifying youtube channels: a practical system. In Proceedings of WOLE 2013, in Proceedings of WWWW companion, pages 1295--1304, 2013.

Digital Library

[26]

A. Singhal, C. Buckley, and M. Mitra. Pivoted document length normalization. In Proceedings of SIGIR, pages 21--29, 1996.

Digital Library

[27]

C. G. M. Snoek and M. Worring. Multimodal video indexing: A review of the state-of-the-art. Multimedia Tools and Applications, 25(1):5--35, Jan. 2005.

Digital Library

[28]

T. Tsikrika, C. Diou, A. P. de Vries, and A. Delopoulos. Image annotation using clickthrough data. In Proceedings of CIVR, pages 14:1--14:8, 2009.

Digital Library

[29]

C. Vondrick, D. Patterson, and D. Ramanan. Efficiently scaling up crowdsourced video annotation. International Journal of Computer Vision, pages 1--21. 10.1007/s11263-012-0564-1.

Digital Library

[30]

J. Weston, S. Bengio, and N. Usunier. Large scale image annotation: learning to rank with joint word-image embeddings. Machine Learning, 81(1):21--35, Oct. 2010.

Digital Library

[31]

B. Yang, T. Mei, X.-S. Hua, L. Yang, S.-Q. Yang, and M. Li. Online video recommendation based on multimodal fusion and relevance feedback. In Proceedings of CIVR 2007, CIVR '07, pages 73--80, 2007.

Digital Library

[32]

Y. Yue, R. Patel, and H. Roehrig. Beyond position bias: examining result attractiveness as a source of presentation bias in clickthrough data. In Proceedings of WWW, pages 1011--1018, 2010.

Digital Library

Cited By

Broadbridge VMangió FDi Domenico G(2023)How Brand Managers Can Maximize Engagement with ASMR YouTube ContentJournal of Advertising Research10.2501/JAR-2023-02663:4(313-334)Online publication date: 23-Nov-2023
https://doi.org/10.2501/JAR-2023-026
Salles Dde Medeiros PSantini RBarros C(2023)The Far-Right Smokescreen: Environmental Conspiracy and Culture Wars on Brazilian YouTubeSocial Media + Society10.1177/205630512311968769:3Online publication date: 30-Sep-2023
https://doi.org/10.1177/20563051231196876
Baron P(2023)Using YouTube's Social Media Analytics for Engineering Educators2023 IEEE Global Engineering Education Conference (EDUCON)10.1109/EDUCON54358.2023.10125146(1-10)Online publication date: 1-May-2023
https://doi.org/10.1109/EDUCON54358.2023.10125146
Show More Cited By

Index Terms

Up next: retrieval methods for large scale related video suggestion
1. Information systems
  1. Information retrieval

Recommendations

Learning Personal Preference From Viewer's Operations for Browsing and Its Application to Baseball Video Retrieval and Summarization

Personalization is one of the most important mechanisms to make multimedia systems easy to use. In video applications, its embodiment is to tailor video contents for a particular viewer. For this purpose, we are now developing a system of retrieving and ...
Understanding item consumption orders for right-order next-item recommendation

Although the relevance problem in recommender systems, which typically refers to the similarity between the preference of the user and the items the system recommends, has been well studied, the issue of making recommendations in right orders has barely ...
What to read next?: making personalized book recommendations for K-12 users
RecSys '13: Proceedings of the 7th ACM conference on Recommender systems

Finding books that children/teenagers are interested in these days is a non-trivial task due to the diversity of topics covered in huge volumes of books with varied readability levels. Even though K-12 readers can turn to book recommenders to look for ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

KDD '14: Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining

August 2014

2028 pages

ISBN:9781450329569

DOI:10.1145/2623330

General Chairs:
Sofus Macskassy
Facebook
,
Claudia Perlich
Dstillery
,
Program Chairs:
Jure Leskovec
Stanford University
,
Wei Wang
UCLA
,
Rayid Ghani
University of Chicago

Copyright © 2014 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 24 August 2014

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

KDD '14

Sponsor:

KDD '14: The 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

August 24 - 27, 2014

New York, New York, USA

Acceptance Rates

KDD '14 Paper Acceptance Rate 151 of 1,036 submissions, 15%;

Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

Upcoming Conference

KDD '25

Sponsor:
sigkdd
sigkdd

The 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining

August 3 - 7, 2025

Toronto , ON , Canada

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

24
Total Citations
View Citations
649
Total Downloads

Downloads (Last 12 months)11
Downloads (Last 6 weeks)2

Reflects downloads up to 07 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

Broadbridge VMangió FDi Domenico G(2023)How Brand Managers Can Maximize Engagement with ASMR YouTube ContentJournal of Advertising Research10.2501/JAR-2023-02663:4(313-334)Online publication date: 23-Nov-2023
https://doi.org/10.2501/JAR-2023-026
Salles Dde Medeiros PSantini RBarros C(2023)The Far-Right Smokescreen: Environmental Conspiracy and Culture Wars on Brazilian YouTubeSocial Media + Society10.1177/205630512311968769:3Online publication date: 30-Sep-2023
https://doi.org/10.1177/20563051231196876
Baron P(2023)Using YouTube's Social Media Analytics for Engineering Educators2023 IEEE Global Engineering Education Conference (EDUCON)10.1109/EDUCON54358.2023.10125146(1-10)Online publication date: 1-May-2023
https://doi.org/10.1109/EDUCON54358.2023.10125146
Chandrakala S(2023)Anomalous human activity detection in videos using Bag-of-Adapted-Models-based representationPattern Analysis and Applications10.1007/s10044-023-01177-526:3(1101-1112)Online publication date: 21-Jun-2023
https://doi.org/10.1007/s10044-023-01177-5
Shi XJia MLi JChen QLiu GLiu Q(2022)Users' Feedback on COVID-19 Lockdown Documentary: An Emotion Analysis and Topic Modeling AnalysisFrontiers in Psychology10.3389/fpsyg.2022.94404913Online publication date: 28-Jun-2022
https://doi.org/10.3389/fpsyg.2022.944049
Baron P(2022)YouTube's Social Media Analytics as an Evaluation of Educational Teaching Videos2022 IEEE IFEES World Engineering Education Forum - Global Engineering Deans Council (WEEF-GEDC)10.1109/WEEF-GEDC54384.2022.9996202(1-8)Online publication date: 27-Nov-2022
https://doi.org/10.1109/WEEF-GEDC54384.2022.9996202
Degardin BProença H(2021)Human Behavior Analysis: A Survey on Action RecognitionApplied Sciences10.3390/app1118832411:18(8324)Online publication date: 8-Sep-2021
https://doi.org/10.3390/app11188324
Ren JXia FChen XLiu JHou MShehzad ASultanova NKong X(2021)Matching Algorithms: Fundamentals, Applications and ChallengesIEEE Transactions on Emerging Topics in Computational Intelligence10.1109/TETCI.2021.30676555:3(332-350)Online publication date: Jun-2021
https://doi.org/10.1109/TETCI.2021.3067655
Airoldi M(2021)The techno-social reproduction of taste boundaries on digital platforms: The case of music on YouTubePoetics10.1016/j.poetic.2021.10156389(101563)Online publication date: Dec-2021
https://doi.org/10.1016/j.poetic.2021.101563
Losada DElsweiler DHarvey MTrattner C(2021)A day at the racesApplied Intelligence10.1007/s10489-021-02719-2Online publication date: 17-Aug-2021
https://doi.org/10.1007/s10489-021-02719-2
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten