skip to main content
10.1145/3488662.3498658acmconferencesArticle/Chapter ViewAbstractPublication PagescommConference Proceedingsconference-collections
research-article

An ML-Based Approach for Near Real-Time Content Caching

Published:07 December 2021Publication History

ABSTRACT

Content caching is a well-known promising solution to address large demands for streaming companies. This paper presents an ongoing work towards improving CDN network traffic focusing on users' quality of experience (QoE) by anticipating which videos will be popular on Globo's platform. To do so, a deep neural network approach was chosen to model video's popularity based on its metadata and a near real-time framework is presented describing how to make content caching in a preemptive way. Additionally, a threshold selection approach is presented defining whether a video should be cached or not. The presented approach allows making content cache without any user interaction, aiming to decide about the admission of the content before it starts to receive requests. This approach is important to most of the daily published videos at Globo, especially for breaking news. Using Globo's real-world data, we demonstrate the popularity predictor results and conclude with some directions for future works.

Skip Supplemental Material Section

Supplemental Material

3488662.3498658.mp4

mp4

599.9 MB

References

  1. Charu C Aggarwal et al. 2016. Recommender Systems. Springer.Google ScholarGoogle Scholar
  2. Parnia Bahar, Tobias Bieschke, and Hermann Ney. 2019. A comparative study on end-to-end speech to text translation. In 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU). IEEE, 792--799.Google ScholarGoogle ScholarCross RefCross Ref
  3. Daniel S Berger, Ramesh K Sitaraman, and Mor Harchol-Balter. 2017. Adaptsize: Orchestrating the hot object memory cache in a content delivery network. In 14th {USENIX} Symposium on Networked Systems Design and Implementation ({NSDI} 17). 483--498.Google ScholarGoogle Scholar
  4. Piotr Bojanowski, Edouard Grave, Armand Joulin, and Tomas Mikolov. 2017. Enriching word vectors with subword information. Transactions of the Association for Computational Linguistics 5 (2017), 135--146.Google ScholarGoogle ScholarCross RefCross Ref
  5. Fangfei Chen, Ramesh K Sitaraman, and Marcelo Torres. 2015. End-user mapping: Next generation request routing for content delivery. ACM SIGCOMM Computer Communication Review 45, 4 (2015), 167--181.Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Ludmila Cherkasova. 1998. Improving WWW proxies performance with greedy-dual-size-frequency caching policy. Hewlett-Packard Laboratories.Google ScholarGoogle Scholar
  7. Mattia Antonino Di Gangi, Matteo Negri, Roldano Cattoni, Dessi Roberto, and Marco Turchi. 2019. Enhancing transformer for end-to-end speech-to-text translation. In Machine Translation Summit XVII. European Association for Machine Translation, 21--31.Google ScholarGoogle Scholar
  8. Fernando Ferraz do Nascimento, Dani Gamerman, and Hedibert Freitas Lopes. 2012. A semiparametric Bayesian approach to extreme value estimation. Statistics and Computing 22, 2 (2012), 661--675.Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Nathan Hartmann, Erick Fonseca, Christopher Shulby, Marcos Treviso, Jessica Rodrigues, and Sandra Aluisio. 2017. Portuguese word embeddings: Evaluating on word analogies and natural language tasks. arXiv preprint arXiv:1708.06025 (2017).Google ScholarGoogle Scholar
  10. Gareth James, Daniela Witten, Trevor Hastie, and Robert Tibshirani. 2013. An introduction to Statistical Learning. Vol. 112. Springer.Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Vadim Kirilin, Aditya Sundarrajan, Sergey Gorinsky, and Ramesh K Sitaraman. 2020. RL-Cache: Learning-based cache admission for content delivery. IEEE Journal on Selected Areas in Communications 38, 10 (2020), 2372--2385.Google ScholarGoogle ScholarCross RefCross Ref
  12. Samuel Kotz and Saralees Nadarajah. 2000. Extreme value distributions: theory and applications. World Scientific.Google ScholarGoogle Scholar
  13. Donghee Lee, Jongmoo Choi, Jong-Hun Kim, Sam H Noh, Sang Lyul Min, Yookun Cho, and Chong Sang Kim. 2001. LRFU: A spectrum of policies that subsumes the least recently used and least frequently used policies. IEEE transactions on Computers 50, 12 (2001), 1352--1361.Google ScholarGoogle Scholar
  14. Bruce M Maggs and Ramesh K Sitaraman. 2015. Algorithmic nuggets in content delivery. ACM SIGCOMM Computer Communication Review 45, 3 (2015), 52--66.Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Jose L Martinez-Rodriguez, Aidan Hogan, and Ivan Lopez-Arevalo. 2020. Information extraction meets the semantic web: a survey. Semantic Web 11, 2 (2020), 255--335.Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Iacopo Masi, Yue Wu, Tal Hassner, and Prem Natarajan. 2018. Deep face recognition: A survey. In 2018 31st SIBGRAPI conference on graphics, patterns and images (SIBGRAPI). IEEE, 471--478.Google ScholarGoogle ScholarCross RefCross Ref
  17. Paulo Renato C Mendes, Antonio José G Busson, Sérgio Colcher, Daniel Schwabe, Álan Lívio V Guedes, and Carlos Laufer. 2020. A Cluster-Matching-Based Method for Video Face Recognition. In Proceedings of the Brazilian Symposium on Multimedia and the Web. 97--104.Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Kathlene Morales and Byeong Kil Lee. 2012. Fixed segmented LRU cache replacement scheme with selective caching. In 2012 IEEE 31st International Performance Computing and Communications Conference (IPCCC). IEEE, 199--200.Google ScholarGoogle ScholarCross RefCross Ref
  19. Diego Moussallem, Ricardo Usbeck, Michael Röeder, and Axel-Cyrille Ngonga Ngomo. 2017. MAG: A multilingual, knowledge-base agnostic and deterministic entity linking approach. In Proceedings of the Knowledge Capture Conference. 1--8.Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. R Gary Parker and Ronald L Rardin. 2014. Discrete optimization. Elsevier.Google ScholarGoogle Scholar
  21. Rafael Pena, Felipe A Ferreira, Frederico Caroli, Luiz José Schirmer Silva, and Hélio Lopes. 2020. Globo Face Stream: A System for Video Meta-data Generation in an Entertainment Industry Setting.. In ICEIS (1). 350--358.Google ScholarGoogle Scholar
  22. Mirco Ravanelli, Titouan Parcollet, and Yoshua Bengio. 2019. The pytorch-kaldi speech recognition toolkit. In ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 6465--6469.Google ScholarGoogle ScholarCross RefCross Ref
  23. Ozge Sevgili, Artem Shelmanov, Mikhail Arkhipov, Alexander Panchenko, and Chris Biemann. 2020. Neural entity linking: A survey of models based on deep learning. arXiv preprint arXiv:2006.00575 (2020).Google ScholarGoogle Scholar
  24. SM Shahrear Tanzil, William Hoiles, and Vikram Krishnamurthy. 2017. Adaptive scheme for caching YouTube content in a cellular network: Machine learning approach. Ieee Access 5 (2017), 5870--5881.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. An ML-Based Approach for Near Real-Time Content Caching

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        VisNEXT'21: Proceedings of the Workshop on Design, Deployment, and Evaluation of Network-assisted Video Streaming
        December 2021
        31 pages
        ISBN:9781450391375
        DOI:10.1145/3488662

        Copyright © 2021 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 7 December 2021

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article
        • Research
        • Refereed limited
      • Article Metrics

        • Downloads (Last 12 months)21
        • Downloads (Last 6 weeks)4

        Other Metrics

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader