skip to main content
10.1145/3605573.3605581acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicppConference Proceedingsconference-collections
research-article

Smart Cache Insertion and Promotion Policy for Content Delivery Networks

Published: 13 September 2023 Publication History

Abstract

Improving hit rates can be achieved by enhancing cache replacement algorithms with the identification of zero-reuse objects (ZROs) and inserting them at the end of the cache queue. Note that the promotion policy needs to achieve a similar task as the above insertion policy since the hit object may immediately become a ZRO (called P-ZRO) that is not suitable for placement at the front of the queue. However, existing studies have yet to consider P-ZROs, and current insertion algorithms struggle to simultaneously identify both ZROs and P-ZROs. To address these issues, we propose integrating the insertion and promotion policies. We do this by treating hit objects as special missing objects and employing reinforcement learning to create a unified model for both policies, where the learning function recognizes the relationship between performance changes and the emergence of ZROs and P-ZROs. Our proposed solution is a smart cache insertion and promotion policy (SCIP) that dynamically adjusts the insertion position using a bimodal insertion policy for both missing and hit objects, guided by the model. Extensive experiments demonstrate that SCIP significantly improves overall performance in real-world content delivery network systems and outperforms state-of-the-art insertion policies in terms of miss ratios in the simulator. In addition, deploying SCIP on optimal cache replacement algorithms can further decrease their miss ratios.

Supplemental Material

PDF File
"Appendix"

References

[1]
Odd O Aalen. 1989. A linear regression model for the analysis of life times. Statistics in medicine 8, 8 (1989), 907–925.
[2]
Nathan Beckmann, Haoxian Chen, and Asaf Cidon. 2018. LHD: Improving Cache Hit Rate by Maximizing Hit Density. In USENIX NSDI. 389–403.
[3]
Laszlo A. Belady. 1966. A study of replacement algorithms for a virtual-storage computer. IBM Systems journal 5, 2 (1966), 78–101.
[4]
Daniel S Berger, Ramesh K Sitaraman, and Mor Harchol-Balter. 2017. Adaptsize: Orchestrating the hot object memory cache in a content delivery network. In NSDI, Vol. 17. 483–498.
[5]
Ludmila Cherkasova and Gianfranco Ciardo. 2001. Role of aging, frequency, and size in web cache replacement policies. In Springer HPCN. 114–123.
[6]
Gil Einziger, Roy Friedman, and Ben Manes. 2017. Tinylfu: A highly efficient cache admission policy. ACM TOS 13, 4 (2017), 1–31.
[7]
Jerome H Friedman. 2001. Greedy function approximation: a gradient boosting machine. Annals of statistics (2001), 1189–1232.
[8]
Xinran He, Junfeng Pan, Ou Jin, Tianbing Xu, Bo Liu, Tao Xu, Yanxin Shi, Antoine Atallah, Ralf Herbrich, Stuart Bowers, 2014. Practical lessons from predicting clicks on ads at facebook. In Proceedings of the eighth international workshop on data mining for online advertising. 1–9.
[9]
Song Jiang and Xiaodong Zhang. 2002. LIRS: An efficient low inter-reference recency set replacement policy to improve buffer cache performance. ACM SIGMETRICS Performance Evaluation Review 30, 1 (2002), 31–42.
[10]
Daniel A Jiménez. 2013. Insertion and promotion for tree-based pseudolru last-level caches. In IEEE/ACM MICRO. 284–296.
[11]
Thorsten Joachims. 1998. Making large-scale SVM learning practical. Technical Report. Technical report.
[12]
Samira Khan and Daniel A Jiménez. 2010. Insertion policy selection using decision tree analysis. In IEEE ICCD. IEEE, 106–111.
[13]
Chunhua Li, Man Wu, Yuhan Liu, Ke Zhou, Ji Zhang, and Yunqing Sun. 2022. SS-LRU: a smart segmented LRU caching. In ACM/IEEE DAC. 397–402.
[14]
Sujit Kr Mahto, Suhit Pai, Virendra Singh, 2017. DAAIP: Deadblock aware adaptive insertion policy for high performance caching. In IEEE ICCD. 345–352.
[15]
Nimrod Megiddo and Dharmendra S Modha. 2003. ARC: A Self-Tuning, Low Overhead Replacement Cache. In USENIX FAST, Vol. 3. 115–130.
[16]
Elizabeth J O’neil, Patrick E O’neil, and Gerhard Weikum. 1993. The LRU-K page replacement algorithm for database disk buffering. Acm Sigmod Record 22, 2 (1993), 297–306.
[17]
Moinuddin K Qureshi, Aamer Jaleel, Yale N Patt, Simon C Steely, and Joel Emer. 2007. Adaptive insertion policies for high performance caching. ACM SIGARCH Computer Architecture News 35, 2 (2007), 381–391.
[18]
Liana V Rodriguez, Farzana Beente Yusuf, Steven Lyons, Eysler Paz, Raju Rangaswami, Jason Liu, Ming Zhao, and Giri Narasimhan. 2021. Learning Cache Replacement with CACHEUS. In FAST. 341–354.
[19]
Subhash Sethumurugan, Jieming Yin, and John Sartori. 2021. Designing a cost-effective cache replacement policy using machine learning. In IEEE HPCA. 291–303.
[20]
D Shasha and T Johnson. 1994. 2q: A low overhead high performance buffer management replacement algorithm. In ACM VLDB. 439–450.
[21]
Walter L Smith. 1955. Regenerative stochastic processes. Proceedings of the Royal Society of London. Series A. Mathematical and Physical Sciences 232, 1188 (1955), 6–31.
[22]
Zhenyu Song, Daniel S Berger, Kai Li, Anees Shaikh, Wyatt Lloyd, Soudeh Ghorbani, Changhoon Kim, Aditya Akella, Arvind Krishnamurthy, Emmett Witchel, 2020. Learning relaxed belady for content distribution network caching. In USENIX NSDI. 529–544.
[23]
Giuseppe Vietri, Liana V Rodriguez, Wendy A Martinez, Steven Lyons, Jason Liu, Raju Rangaswami, Ming Zhao, and Giri Narasimhan. 2018. Driving Cache Replacement with ML-based LeCaR. In HotStorage. 928–936.
[24]
Hua Wang, Xinbo Yi, Ping Huang, Bin Cheng, and Ke Zhou. 2018. Efficient SSD caching by avoiding unnecessary writes using machine learning. In ACM ICPP. 1–10.
[25]
Peng Wang, Yu Liu, Zhelong Zhao, Ke Zhou, Zhihai Huang, and Yanxiong Chen. 2022. Adaptive Size-Aware Cache Insertion Policy for Content Delivery Networks. In IEEE ICCD. 195–202.
[26]
Carole-Jean Wu, Aamer Jaleel, Will Hasenplaugh, Margaret Martonosi, Simon C Steely Jr, and Joel Emer. 2011. SHiP: Signature-based hit predictor for high performance caching. In IEEE/ACM MICRO. 430–441.
[27]
Yuejian Xie and Gabriel H Loh. 2009. PIPP: Promotion/insertion pseudo-partitioning of multi-core shared caches. ACM SIGARCH Computer Architecture News 37, 3 (2009), 174–183.
[28]
Gang Yan, Jian Li, and Don Towsley. 2021. Learning from optimal caching for content delivery. In ACM CoNEXT. 344–358.
[29]
Juncheng Yang, Ziming Mao, Yao Yue, and KV Rashmi. 2023. { GL-Cache} : Group-level learning for efficient and high-performance caching. In USENIX FAST. 115–134.
[30]
Yu Zhang, Ping Huang, Ke Zhou, Hua Wang, Jianying Hu, Yongguang Ji, and Bin Cheng. 2020. OSCA: An online-model based cache allocation scheme in cloud block storage systems. In USENIX ATC. 785–798.
[31]
Ke Zhou, Si Sun, Hua Wang, Ping Huang, Xubin He, Rui Lan, Wenyan Li, Wenjie Liu, and Tianming Yang. 2018. Demystifying cache policies for photo stores at scale: A tencent case study. In ACM ICS. 284–294.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICPP '23: Proceedings of the 52nd International Conference on Parallel Processing
August 2023
858 pages
ISBN:9798400708435
DOI:10.1145/3605573
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 13 September 2023

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. content delivery network
  2. insertion policy
  3. replacement algorithm

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Funding Sources

  • National Natural Science Foundation of China (key program)
  • Natural Science Foundation of Hubei Province

Conference

ICPP 2023
ICPP 2023: 52nd International Conference on Parallel Processing
August 7 - 10, 2023
UT, Salt Lake City, USA

Acceptance Rates

Overall Acceptance Rate 91 of 313 submissions, 29%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 176
    Total Downloads
  • Downloads (Last 12 months)79
  • Downloads (Last 6 weeks)5
Reflects downloads up to 27 Feb 2025

Other Metrics

Citations

Cited By

View all

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media