Abstract
Memory accesses often account for about half of a microprocessor system's power consumption. Customizing a microprocessor cache's total size, line size, and associativity to a particular program is well known to have tremendous benefits for performance and power. Customizing caches has until recently been restricted to core-based flows, in which a new chip will be fabricated. However, several configurable cache architectures have been proposed recently for use in prefabricated microprocessor platforms. Tuning those caches to a program is still, however, a cumbersome task left for designers, assisted in part by recent computer-aided design (CAD) tuning aids. We propose to move that CAD on-chip, which can greatly increase the acceptance of tunable caches. We introduce on-chip hardware implementing an efficient cache tuning heuristic that can automatically, transparently, and dynamically tune the cache to an executing program. Our heuristic seeks not only to reduce the number of configurations that must be examined, but also traverses the search space in a way that minimizes costly cache flushes. By simulating numerous Powerstone and MediaBench benchmarks, we show that such a dynamic self-tuning cache saves on average 40% of total memory access energy over a standard nontuned reference cache.
- Agarwal, A., Horowitz, M., and Hennessy, J. 1989. An analytical cache model. ACM Trans. Comput. Syst. 7, 2, 184--215. Google ScholarDigital Library
- Agarwal, A., Li, H., and Roy, K. 2002. DRG-cache: A data retention gated-ground cache for low power. In Proceedings of 39th Design Automation Conference, New York, NY, June 2002. ACM, 473--478. Google Scholar
- Albonesi, D. H. 1999. Selective cache way: On-demand cache resource allocation. In Proceedings of the 32nd Annual ACM/IEEE International Symposium on Microarchitecture, Los Alamitos, CA, USA. IEEE Computer Society, 248--259. Google Scholar
- Balasubramonian, R., Albonesi, D., Buyuktosunoglu, A., and Dwarkadas, S. 2000. Memory hierarchy reconfiguration for energy and performance in general-purpose processor architectures. In Proceedings 33rd Annual IEEE/ACM International Symposium on Microarchitecture, Piscataway, NJ, USA. IEEE, 245--257. Google Scholar
- Burger, D. and Austin, T. M. 1997. The SimpleScalar Tool Set, Version 2.0. Technical Report #1342, Department of Computer Sciences, University of Wisconsin-Madison.Google Scholar
- Givargis, T. and Vahid, F. 2002. Platune: A tuning framework for system-on-a-chip platforms. IEEE Trans. CAD 21, 11. Google ScholarDigital Library
- Inoue, K., Ishihara, T., and Murakami, K. 1999. Way-predictive set-associative cache for high performance and low energy consumption. In Proceedings of International Symposium on Low Power Electronic Design. Google Scholar
- Kaxiras, S., Hu, Z., and Martonosi, M. 2001. Cache decay: Exploiting generational behavior to reduce cache leakage power. In 28th Annual International Symposium on Computer Architecture. Google Scholar
- Lee, C., Potkonjak, M., and Mangione-Smith, W. 1997. MediaBench: A tool for evaluating and synthesizing multimedia and communications systems. In International Symposium on Microarchitecture. Google ScholarDigital Library
- Malik, A., Moyer, B., and Cermak, D. 2000. A low power unified cache architecture providing power and performance flexibility. In International Symposium on Low Power Electronics and Design. Google Scholar
- Mips Technologies Inc. 2003. http://www.mips.com/products/s2p3.html.Google Scholar
- Powell, M., Agaewal, A., Vijaykumar, T., Falsafi, B., and Roy, K. 2001. Reducing set-associative cache energy via way-prediction and selective direct mapping. In 34th International Symposium on Microarchitecture. Google Scholar
- Reinman, G. and Jouppi, N. P., 1999. CACTI2.0: An Integrated Cache Timing and Power Model. COMPAQ Western Research Lab.Google Scholar
- Veidenbaum, A., Tang, W., Gupta, R., Nicolau, and Ji. X., 1999. Adapting cache line size to application behavior. In Proceedings of the 1999 International Conference on Supercomputing. ACM, New York, NY, USA, 145--154. Google Scholar
- Segars, S. 2001. Low power design techniques for microprocessors. In IEEE International Solid-State Circuits Conference Tutorial.Google Scholar
- Spec. 2000. Standard Performance Evaluation Corporation. http://www.specbench.org.Google Scholar
- Zhang, C., Vahid, F., and Najjar, W. 2003a. Energy benefits of a configurable line size cache for embedded systems. In Proceedings of IEEE Computer Society Annual Symposium on VLSI. New Trends and Technologies for VLSI Systems Design, Florida, USA. IEEE Computer Society, Los Alamitos, CA, USA, 87--91. Google Scholar
- Zhang, C., Vahid, F., and Najjar, W. 2003b. A highly configurable cache architecture for embedded systems. In Proceedings of the 30th ACM/IEEE International Symposium on Computer Architecture, San Diego, CA. 136--146. Google Scholar
Index Terms
- A self-tuning cache architecture for embedded systems
Recommendations
A highly configurable cache architecture for embedded systems
ISCA '03: Proceedings of the 30th annual international symposium on Computer architectureEnergy consumption is a major concern in many embedded computing systems. Several studies have shown that cache memories account for about 50% of the total energy consumed in these systems. The performance of a given cache architecture is largely ...
A highly configurable cache for low energy embedded systems
Energy consumption is a major concern in many embedded computing systems. Several studies have shown that cache memories account for about 50% of the total energy consumed in these systems. The performance of a given cache architecture is determined, to ...
A Self-Tuning Cache Architecture for Embedded Systems
DATE '04: Proceedings of the conference on Design, automation and test in Europe - Volume 1Memory accesses can account for about half of a microprocessor system's power consumption. Customizing a microprocessor cache's total size, line size and associativity to a particular program is well known to have tremendous benefits for performance and ...
Comments