research-article

Smart Cache Insertion and Promotion Policy for Content Delivery Networks

Authors:

Yanxiong ChenAuthors Info & Claims

ICPP '23: Proceedings of the 52nd International Conference on Parallel Processing

Pages 183 - 192

https://doi.org/10.1145/3605573.3605581

Published: 13 September 2023 Publication History

Abstract

Improving hit rates can be achieved by enhancing cache replacement algorithms with the identification of zero-reuse objects (ZROs) and inserting them at the end of the cache queue. Note that the promotion policy needs to achieve a similar task as the above insertion policy since the hit object may immediately become a ZRO (called P-ZRO) that is not suitable for placement at the front of the queue. However, existing studies have yet to consider P-ZROs, and current insertion algorithms struggle to simultaneously identify both ZROs and P-ZROs. To address these issues, we propose integrating the insertion and promotion policies. We do this by treating hit objects as special missing objects and employing reinforcement learning to create a unified model for both policies, where the learning function recognizes the relationship between performance changes and the emergence of ZROs and P-ZROs. Our proposed solution is a smart cache insertion and promotion policy (SCIP) that dynamically adjusts the insertion position using a bimodal insertion policy for both missing and hit objects, guided by the model. Extensive experiments demonstrate that SCIP significantly improves overall performance in real-world content delivery network systems and outperforms state-of-the-art insertion policies in terms of miss ratios in the simulator. In addition, deploying SCIP on optimal cache replacement algorithms can further decrease their miss ratios.

Supplemental Material

PDF File

"Appendix"

Download
220.99 KB

References

[1]

Odd O Aalen. 1989. A linear regression model for the analysis of life times. Statistics in medicine 8, 8 (1989), 907–925.

[2]

Nathan Beckmann, Haoxian Chen, and Asaf Cidon. 2018. LHD: Improving Cache Hit Rate by Maximizing Hit Density. In USENIX NSDI. 389–403.

[3]

Laszlo A. Belady. 1966. A study of replacement algorithms for a virtual-storage computer. IBM Systems journal 5, 2 (1966), 78–101.

Digital Library

[4]

Daniel S Berger, Ramesh K Sitaraman, and Mor Harchol-Balter. 2017. Adaptsize: Orchestrating the hot object memory cache in a content delivery network. In NSDI, Vol. 17. 483–498.

[5]

Ludmila Cherkasova and Gianfranco Ciardo. 2001. Role of aging, frequency, and size in web cache replacement policies. In Springer HPCN. 114–123.

[6]

Gil Einziger, Roy Friedman, and Ben Manes. 2017. Tinylfu: A highly efficient cache admission policy. ACM TOS 13, 4 (2017), 1–31.

Digital Library

[7]

Jerome H Friedman. 2001. Greedy function approximation: a gradient boosting machine. Annals of statistics (2001), 1189–1232.

[8]

Xinran He, Junfeng Pan, Ou Jin, Tianbing Xu, Bo Liu, Tao Xu, Yanxin Shi, Antoine Atallah, Ralf Herbrich, Stuart Bowers, 2014. Practical lessons from predicting clicks on ads at facebook. In Proceedings of the eighth international workshop on data mining for online advertising. 1–9.

Digital Library

[9]

Song Jiang and Xiaodong Zhang. 2002. LIRS: An efficient low inter-reference recency set replacement policy to improve buffer cache performance. ACM SIGMETRICS Performance Evaluation Review 30, 1 (2002), 31–42.

Digital Library

[10]

Daniel A Jiménez. 2013. Insertion and promotion for tree-based pseudolru last-level caches. In IEEE/ACM MICRO. 284–296.

[11]

Thorsten Joachims. 1998. Making large-scale SVM learning practical. Technical Report. Technical report.

[12]

Samira Khan and Daniel A Jiménez. 2010. Insertion policy selection using decision tree analysis. In IEEE ICCD. IEEE, 106–111.

[13]

Chunhua Li, Man Wu, Yuhan Liu, Ke Zhou, Ji Zhang, and Yunqing Sun. 2022. SS-LRU: a smart segmented LRU caching. In ACM/IEEE DAC. 397–402.

[14]

Sujit Kr Mahto, Suhit Pai, Virendra Singh, 2017. DAAIP: Deadblock aware adaptive insertion policy for high performance caching. In IEEE ICCD. 345–352.

[15]

Nimrod Megiddo and Dharmendra S Modha. 2003. ARC: A Self-Tuning, Low Overhead Replacement Cache. In USENIX FAST, Vol. 3. 115–130.

[16]

Elizabeth J O’neil, Patrick E O’neil, and Gerhard Weikum. 1993. The LRU-K page replacement algorithm for database disk buffering. Acm Sigmod Record 22, 2 (1993), 297–306.

Digital Library

[17]

Moinuddin K Qureshi, Aamer Jaleel, Yale N Patt, Simon C Steely, and Joel Emer. 2007. Adaptive insertion policies for high performance caching. ACM SIGARCH Computer Architecture News 35, 2 (2007), 381–391.

Digital Library

[18]

Liana V Rodriguez, Farzana Beente Yusuf, Steven Lyons, Eysler Paz, Raju Rangaswami, Jason Liu, Ming Zhao, and Giri Narasimhan. 2021. Learning Cache Replacement with CACHEUS. In FAST. 341–354.

[19]

Subhash Sethumurugan, Jieming Yin, and John Sartori. 2021. Designing a cost-effective cache replacement policy using machine learning. In IEEE HPCA. 291–303.

[20]

D Shasha and T Johnson. 1994. 2q: A low overhead high performance buffer management replacement algorithm. In ACM VLDB. 439–450.

[21]

Walter L Smith. 1955. Regenerative stochastic processes. Proceedings of the Royal Society of London. Series A. Mathematical and Physical Sciences 232, 1188 (1955), 6–31.

[22]

Zhenyu Song, Daniel S Berger, Kai Li, Anees Shaikh, Wyatt Lloyd, Soudeh Ghorbani, Changhoon Kim, Aditya Akella, Arvind Krishnamurthy, Emmett Witchel, 2020. Learning relaxed belady for content distribution network caching. In USENIX NSDI. 529–544.

[23]

Giuseppe Vietri, Liana V Rodriguez, Wendy A Martinez, Steven Lyons, Jason Liu, Raju Rangaswami, Ming Zhao, and Giri Narasimhan. 2018. Driving Cache Replacement with ML-based LeCaR. In HotStorage. 928–936.

[24]

Hua Wang, Xinbo Yi, Ping Huang, Bin Cheng, and Ke Zhou. 2018. Efficient SSD caching by avoiding unnecessary writes using machine learning. In ACM ICPP. 1–10.

[25]

Peng Wang, Yu Liu, Zhelong Zhao, Ke Zhou, Zhihai Huang, and Yanxiong Chen. 2022. Adaptive Size-Aware Cache Insertion Policy for Content Delivery Networks. In IEEE ICCD. 195–202.

[26]

Carole-Jean Wu, Aamer Jaleel, Will Hasenplaugh, Margaret Martonosi, Simon C Steely Jr, and Joel Emer. 2011. SHiP: Signature-based hit predictor for high performance caching. In IEEE/ACM MICRO. 430–441.

[27]

Yuejian Xie and Gabriel H Loh. 2009. PIPP: Promotion/insertion pseudo-partitioning of multi-core shared caches. ACM SIGARCH Computer Architecture News 37, 3 (2009), 174–183.

Digital Library

[28]

Gang Yan, Jian Li, and Don Towsley. 2021. Learning from optimal caching for content delivery. In ACM CoNEXT. 344–358.

[29]

Juncheng Yang, Ziming Mao, Yao Yue, and KV Rashmi. 2023. { GL-Cache} : Group-level learning for efficient and high-performance caching. In USENIX FAST. 115–134.

[30]

Yu Zhang, Ping Huang, Ke Zhou, Hua Wang, Jianying Hu, Yongguang Ji, and Bin Cheng. 2020. OSCA: An online-model based cache allocation scheme in cloud block storage systems. In USENIX ATC. 785–798.

[31]

Ke Zhou, Si Sun, Hua Wang, Ping Huang, Xubin He, Rui Lan, Wenyan Li, Wenjie Liu, and Tianming Yang. 2018. Demystifying cache policies for photo stores at scale: A tencent case study. In ACM ICS. 284–294.

Cited By

Index Terms

Smart Cache Insertion and Promotion Policy for Content Delivery Networks
1. Computer systems organization
  1. Dependable and fault-tolerant systems and networks
    1. Redundancy
  2. Embedded and cyber-physical systems
    1. Embedded systems
    2. Robotics
2. Networks
  1. Network properties
    1. Network reliability

Recommendations

Bypass and insertion algorithms for exclusive last-level caches
ISCA '11

Inclusive last-level caches (LLCs) waste precious silicon estate due to cross-level replication of cache blocks. As the industry moves toward cache hierarchies with larger inner levels, this wasted cache space leads to bigger performance losses compared ...
Bypass and insertion algorithms for exclusive last-level caches
ISCA '11: Proceedings of the 38th annual international symposium on Computer architecture

Inclusive last-level caches (LLCs) waste precious silicon estate due to cross-level replication of cache blocks. As the industry moves toward cache hierarchies with larger inner levels, this wasted cache space leads to bigger performance losses compared ...
Combining recency of information with selective random and a victim cache in last-level caches

Memory latency has become an important performance bottleneck in current microprocessors. This problem aggravates as the number of cores sharing the same memory controller increases. To palliate this problem, a common solution is to implement cache ...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

ICPP '23: Proceedings of the 52nd International Conference on Parallel Processing

August 2023

858 pages

ISBN:9798400708435

DOI:10.1145/3605573

Copyright © 2023 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 13 September 2023

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Funding Sources

National Natural Science Foundation of China (key program)
Natural Science Foundation of Hubei Province

Conference

ICPP 2023

ICPP 2023: 52nd International Conference on Parallel Processing

August 7 - 10, 2023

UT, Salt Lake City, USA

Acceptance Rates

Overall Acceptance Rate 91 of 313 submissions, 29%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
176
Total Downloads

Downloads (Last 12 months)79
Downloads (Last 6 weeks)5

Reflects downloads up to 27 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Figures

Tables

Media

View Table of Conten