research-article

An ML-Based Approach for Near Real-Time Content Caching

Authors:
Dimas S. Lima

Globo, Rio de Janeiro, Brazil

Globo, Rio de Janeiro, Brazil
View Profile

,
Bruno Guimarães Oliveira

Globo, Rio de Janeiro, Brazil

Globo, Rio de Janeiro, Brazil
View Profile

,
Paulo Renato C. Mendes

Globo, Rio de Janeiro, Brazil

Globo, Rio de Janeiro, Brazil
View Profile

,
Lucas Costa

Globo, Rio de Janeiro, Brazil

Globo, Rio de Janeiro, Brazil
View Profile

,
Yago Coelho

Globo, Rio de Janeiro, Brazil

Globo, Rio de Janeiro, Brazil
View Profile

VisNEXT'21: Proceedings of the Workshop on Design, Deployment, and Evaluation of Network-assisted Video StreamingDecember 2021Pages 8–14https://doi.org/10.1145/3488662.3498658

Published:07 December 2021Publication History

VisNEXT'21: Proceedings of the Workshop on Design, Deployment, and Evaluation of Network-assisted Video Streaming

Pages 8–14

ABSTRACT

Content caching is a well-known promising solution to address large demands for streaming companies. This paper presents an ongoing work towards improving CDN network traffic focusing on users' quality of experience (QoE) by anticipating which videos will be popular on Globo's platform. To do so, a deep neural network approach was chosen to model video's popularity based on its metadata and a near real-time framework is presented describing how to make content caching in a preemptive way. Additionally, a threshold selection approach is presented defining whether a video should be cached or not. The presented approach allows making content cache without any user interaction, aiming to decide about the admission of the content before it starts to receive requests. This approach is important to most of the daily published videos at Globo, especially for breaking news. Using Globo's real-world data, we demonstrate the popularity predictor results and conclude with some directions for future works.

Supplemental Material

3488662.3498658.mp4

mp4

599.9 MB

Download

References

Charu C Aggarwal et al. 2016. Recommender Systems. Springer.Google Scholar
Parnia Bahar, Tobias Bieschke, and Hermann Ney. 2019. A comparative study on end-to-end speech to text translation. In 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU). IEEE, 792--799.Google ScholarCross Ref
Daniel S Berger, Ramesh K Sitaraman, and Mor Harchol-Balter. 2017. Adaptsize: Orchestrating the hot object memory cache in a content delivery network. In 14th {USENIX} Symposium on Networked Systems Design and Implementation ({NSDI} 17). 483--498.Google Scholar
Piotr Bojanowski, Edouard Grave, Armand Joulin, and Tomas Mikolov. 2017. Enriching word vectors with subword information. Transactions of the Association for Computational Linguistics 5 (2017), 135--146.Google ScholarCross Ref
Fangfei Chen, Ramesh K Sitaraman, and Marcelo Torres. 2015. End-user mapping: Next generation request routing for content delivery. ACM SIGCOMM Computer Communication Review 45, 4 (2015), 167--181.Google ScholarDigital Library
Ludmila Cherkasova. 1998. Improving WWW proxies performance with greedy-dual-size-frequency caching policy. Hewlett-Packard Laboratories.Google Scholar
Mattia Antonino Di Gangi, Matteo Negri, Roldano Cattoni, Dessi Roberto, and Marco Turchi. 2019. Enhancing transformer for end-to-end speech-to-text translation. In Machine Translation Summit XVII. European Association for Machine Translation, 21--31.Google Scholar
Fernando Ferraz do Nascimento, Dani Gamerman, and Hedibert Freitas Lopes. 2012. A semiparametric Bayesian approach to extreme value estimation. Statistics and Computing 22, 2 (2012), 661--675.Google ScholarDigital Library
Nathan Hartmann, Erick Fonseca, Christopher Shulby, Marcos Treviso, Jessica Rodrigues, and Sandra Aluisio. 2017. Portuguese word embeddings: Evaluating on word analogies and natural language tasks. arXiv preprint arXiv:1708.06025 (2017).Google Scholar
Gareth James, Daniela Witten, Trevor Hastie, and Robert Tibshirani. 2013. An introduction to Statistical Learning. Vol. 112. Springer.Google ScholarDigital Library
Vadim Kirilin, Aditya Sundarrajan, Sergey Gorinsky, and Ramesh K Sitaraman. 2020. RL-Cache: Learning-based cache admission for content delivery. IEEE Journal on Selected Areas in Communications 38, 10 (2020), 2372--2385.Google ScholarCross Ref
Samuel Kotz and Saralees Nadarajah. 2000. Extreme value distributions: theory and applications. World Scientific.Google Scholar
Donghee Lee, Jongmoo Choi, Jong-Hun Kim, Sam H Noh, Sang Lyul Min, Yookun Cho, and Chong Sang Kim. 2001. LRFU: A spectrum of policies that subsumes the least recently used and least frequently used policies. IEEE transactions on Computers 50, 12 (2001), 1352--1361.Google Scholar
Bruce M Maggs and Ramesh K Sitaraman. 2015. Algorithmic nuggets in content delivery. ACM SIGCOMM Computer Communication Review 45, 3 (2015), 52--66.Google ScholarDigital Library
Jose L Martinez-Rodriguez, Aidan Hogan, and Ivan Lopez-Arevalo. 2020. Information extraction meets the semantic web: a survey. Semantic Web 11, 2 (2020), 255--335.Google ScholarDigital Library
Iacopo Masi, Yue Wu, Tal Hassner, and Prem Natarajan. 2018. Deep face recognition: A survey. In 2018 31st SIBGRAPI conference on graphics, patterns and images (SIBGRAPI). IEEE, 471--478.Google ScholarCross Ref
Paulo Renato C Mendes, Antonio José G Busson, Sérgio Colcher, Daniel Schwabe, Álan Lívio V Guedes, and Carlos Laufer. 2020. A Cluster-Matching-Based Method for Video Face Recognition. In Proceedings of the Brazilian Symposium on Multimedia and the Web. 97--104.Google ScholarDigital Library
Kathlene Morales and Byeong Kil Lee. 2012. Fixed segmented LRU cache replacement scheme with selective caching. In 2012 IEEE 31st International Performance Computing and Communications Conference (IPCCC). IEEE, 199--200.Google ScholarCross Ref
Diego Moussallem, Ricardo Usbeck, Michael Röeder, and Axel-Cyrille Ngonga Ngomo. 2017. MAG: A multilingual, knowledge-base agnostic and deterministic entity linking approach. In Proceedings of the Knowledge Capture Conference. 1--8.Google ScholarDigital Library
R Gary Parker and Ronald L Rardin. 2014. Discrete optimization. Elsevier.Google Scholar
Rafael Pena, Felipe A Ferreira, Frederico Caroli, Luiz José Schirmer Silva, and Hélio Lopes. 2020. Globo Face Stream: A System for Video Meta-data Generation in an Entertainment Industry Setting.. In ICEIS (1). 350--358.Google Scholar
Mirco Ravanelli, Titouan Parcollet, and Yoshua Bengio. 2019. The pytorch-kaldi speech recognition toolkit. In ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 6465--6469.Google ScholarCross Ref
Ozge Sevgili, Artem Shelmanov, Mikhail Arkhipov, Alexander Panchenko, and Chris Biemann. 2020. Neural entity linking: A survey of models based on deep learning. arXiv preprint arXiv:2006.00575 (2020).Google Scholar
SM Shahrear Tanzil, William Hoiles, and Vikram Krishnamurthy. 2017. Adaptive scheme for caching YouTube content in a cellular network: Machine learning approach. Ieee Access 5 (2017), 5870--5881.Google ScholarCross Ref

Index Terms

An ML-Based Approach for Near Real-Time Content Caching
1. Computing methodologies
  1. Machine learning
    1. Machine learning approaches
2. Networks
  1. Network algorithms
    1. Control path algorithms
      1. Network resources allocation

Recommendations

CoPUP: content popularity and user preferences aware content caching framework in mobile edge computing
Abstract
Mobile edge computing (MEC) enables intelligent content caching at the network edge to reduce traffic and enhance content delivery efficiency. In MEC architecture, popular content can be deployed at the MEC server to improve users’ quality of ...
Read More
Cooperative caching for adaptive bit rate streaming in content delivery networks
IMCOM '15: Proceedings of the 9th International Conference on Ubiquitous Information Management and Communication

This work proposes a cooperative caching model which supports adaptive bit-rate streaming in content delivery networks. A linear program (LP) problem is applied to maximize the total user satisfaction. The optimal content placement and content fetching ...
Read More
Popularity prediction caching based on logistic regression in vehicular content centric networks

To improve the network performance caused by mobility and sporadic connectivity in the vehicular network, vehicular content centric network (VCCN) is proposed by applying CCN into the vehicular network. The open in-network caching of CCN makes nodes cache ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in

VisNEXT'21: Proceedings of the Workshop on Design, Deployment, and Evaluation of Network-assisted Video Streaming
December 2021
31 pages
ISBN:9781450391375
DOI:10.1145/3488662

Copyright © 2021 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 7 December 2021
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Content Caching
Content Delivery Network
Deep Neural Network
Machine Learning
Natural Language Processing
Popularity Prediction
Qualifiers
- research-article
- Research
- Refereed limited
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 134
  Total Downloads
- Downloads (Last 12 months)21
- Downloads (Last 6 weeks)4
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

An ML-Based Approach for Near Real-Time Content Caching

VisNEXT'21: Proceedings of the Workshop on Design, Deployment, and Evaluation of Network-assisted Video Streaming

ABSTRACT

Supplemental Material

References

Cited By

Index Terms

Recommendations

CoPUP: content popularity and user preferences aware content caching framework in mobile edge computing

Cooperative caching for adaptive bit rate streaming in content delivery networks

Popularity prediction caching based on logistic regression in vehicular content centric networks

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

An ML-Based Approach for Near Real-Time Content Caching

VisNEXT'21: Proceedings of the Workshop on Design, Deployment, and Evaluation of Network-assisted Video Streaming

ABSTRACT

Supplemental Material

References

Cited By

Index Terms

Recommendations

CoPUP: content popularity and user preferences aware content caching framework in mobile edge computing

Cooperative caching for adaptive bit rate streaming in content delivery networks

Popularity prediction caching based on logistic regression in vehicular content centric networks

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media