Enhancing Last-Level Cache Performance by Block Bypassing and Early Miss Determination

Dybdahl, Haakon; Stenström, Per

doi:10.1007/11859802_6

Haakon Dybdahl¹⁸ &
Per Stenström¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 4186))

Included in the following conference series:

Asia-Pacific Conference on Advances in Computer Systems Architecture

801 Accesses

Abstract

While bypassing algorithms have been applied to the first-level cache, we study for the first time their effectiveness for the last-level caches for which miss penalties are significantly higher and where algorithm complexity is not constrained by the speed of the pipeline. Our algorithm monitors the reuse behavior of blocks that are touched by delinquent loads and re-classify them on-the-fly. Blocks classified as bypassed are only installed in the level-1 cache. We leverage the algorithm to early send out a miss request for loads expected to request blocks classified to be bypassed. Such requests are sent to memory directly without tag checks at intermediary levels in the cache hierarchy. Overall, we find that we can robustly reduce the miss rate by 23% and improve IPC with 14% on average for memory bound SPEC2000 applications without degrading performance of the other SPEC2000 applications.

This work is partly sponsored by the HiPEAC Network of Excellence funded by EU under FP6.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Locality-aware data replication in the last-level cache for large scale multicores

Article 04 February 2016

Improving System Turnaround Time with Intel CAT by Identifying LLC Critical Applications

Reducing the second-level cache conflict misses using a set folding technique

Article 01 November 2017

References

Abraham, S.G., Sugumar, R.A., Windheiser, D., Rau, B.R., Gupta, R.: Predictability of load/store instruction latencies. In: MICRO, vol. 26 (1993)
Google Scholar
Austin, T., Larson, E., Ernst, D.: SimpleScalar: an infrastructure for computer system modeling. IEEE Computer 35(2) (2002)
Google Scholar
Belady, L.: A study of replacement algorithms for a virtual-storage computer. IBM Systems Journal 5(2), 78–101 (1966)
Article Google Scholar
Chi, C.-H., Dietz, H.: Improving cache performance by selective cache bypass. In: Annual Hawaii International Conference on System Sciences (1989)
Google Scholar
Jalminger, J., Stenstrom, P.: A cache block reuse prediction scheme. Microprocessors and Microsystems V28, 373–385 (2004)
Article Google Scholar
John, L.K., Subramanian, A.: Design and performance evaluation of a cache assist to implement selective caching. In: Proc. of Intl. Conf. on Comp. Design (1997)
Google Scholar
Johnson, T.L., Connors, D.A., Merten, M.C., Hwu, W.-M.W.: Run-time cache bypassing. IEEE Transactions on Computers 48(12), 1338–1354 (1999)
Article Google Scholar
Kampe, M., Stenström, P., Dubois, M.: Self-correcting LRU replacement policies. In: CF 2004: Proc. of the 1st conf. on Computing frontiers (2004)
Google Scholar
Karlsson, M., Hagersten, E.: Timestamp-based selective cache allocation. In: High Performance Memory Systems. Springer, Heidelberg (2003)
Google Scholar
McFarling, S.: Cache replacement with dynamic exclusion. In: ISCA 1992, pp. 191–200. ACM Press, New York (1992)
Chapter Google Scholar
Memik, G., Reinman, G., Mangione-Smith, W.H.: Just say no: Benefits of early cache miss determination. In: HPCA (2003)
Google Scholar
Panait, V.-M., Sasturkar, A., Wong, W.-F.: Static identification of delinquent loads. In: Int. symp. on Code generation and optimization (2004)
Google Scholar
Rivers, J.A., Tam, E.S., Tyson, G.S., Davidson, E.S., Farrens, M.: Utilizing reuse information in data cache management. In: ICS (1998)
Google Scholar
Sugumar, R.A., Abraham, S.G.: Efficient simulation of caches under optimal replacement with applications to miss characterization. In: Joint International Conference on Measurement and modeling of computer systems (1993)
Google Scholar
Tyson, G., Farrens, M., Matthews, J., Pleszkun, A.R.: A modified approach to cache management. In: MICRO (1995)
Google Scholar
Wong, W.A., Baer, J.-L.: Modified LRU policies for improving second-level cache behavior. In: HPCA (2000)
Google Scholar

Download references

Author information

Authors and Affiliations

Dept. of Computer and Information Science, Norwegian University of Science and Technology, N-7491, Trondheim, Norway
Haakon Dybdahl
Dept. of Computer Engineering, Dept. of Computer Engineering, Chalmers University of Technology, S-412 96, Goteborg, Sweden
Per Stenström

Authors

Haakon Dybdahl
View author publications
You can also search for this author in PubMed Google Scholar
Per Stenström
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Computer Systems Architecture Group, University of Amsterdam, The Netherlands
Chris Jesshope
School of Computer Science, University of Hertfordshire, College Lane, AL10 9AB, Hatfield, UK
Colin Egan

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Dybdahl, H., Stenström, P. (2006). Enhancing Last-Level Cache Performance by Block Bypassing and Early Miss Determination. In: Jesshope, C., Egan, C. (eds) Advances in Computer Systems Architecture. ACSAC 2006. Lecture Notes in Computer Science, vol 4186. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11859802_6

Download citation

DOI: https://doi.org/10.1007/11859802_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40056-1
Online ISBN: 978-3-540-40058-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics