The Combinatorics of Cache Misses during Matrix Multiplication

doi:10.1006/jcss.2001.1756

Journal of Computer and System Sciences

Volume 63, Issue 1, August 2001, Pages 80-126

https://doi.org/10.1006/jcss.2001.1756 Get rights and content

Under an Elsevier user license

open archive

Abstract

In this paper we construct an analytic model of cache misses during matrix multiplication. The analysis in this paper applies to square matrices of size 2^m where the array layout function is given in terms of a function Θ that interleaves the bits in the binary expansions of the row and column indices. We first analyze the number of cache misses for direct-mapped caches and then indicate how to extend this analysis to $A$ -way associative caches. The work in this paper accomplishes two things. First, we construct fast algorithms to estimate the number of cache misses. Second, we develop a theoretical understanding of cache misses that will allow us, in subsequent work, to approach the problem of minimizing cache misses by appropriately choosing the bit interleaving function that goes into the array layout function.

Cited by (0)

^☆: This work was supported in part by DARPA Grant DABT63-98-1-0001, NSF Grants EIA-97-26370 and CDA-95-12356, NSF Career Award MIP-97-02547, The University of North Carolina at Chapel Hill, Duke University, and an equipment donation through Intel Corporation's Technology for Education 2000 Program. The views and conclusions contained herein are those of the authors and should not be interpreted as representing the official policies or endorsements, either expressed or implied, of DARPA or the U.S. Government.

Journal of Computer and System Sciences

Regular ArticleThe Combinatorics of Cache Misses during Matrix Multiplication☆

Abstract

Regular Article
The Combinatorics of Cache Misses during Matrix Multiplication☆