ABSTRACT
As feature sizes shrink, leakage energy reduction has become increasingly important, especially for cache memories. Recent research in drowsy instruction cache shows that the leakage energy of the instruction cache can be significantly reduced with little performance degradation by exploiting the instruction spatial locality at the cache sub-bank level[5]. The performance penalty due to the sub-bank wake-up latency is dramatically reduced by using a prediction buffer to pre-activate the next sub-bank at runtime. However, consulting the prediction buffer at every cache access consumes non-trivial dynamical energy, which can compromise the overall energy savings substantially. This paper proposes a static approach to capture the sub-bank transition behavior at link time and to pre-activate the instruction cache sub-bank at runtime according to the compiler-directed hints. We also propose a hybrid approach to exploit both the static and dynamic information for reducing the performance penalty further with little dynamic energy overhead. Our experiments reveal that the static approach is very successful in capturing the sub-bank transition behavior for reducing the performance penalty and it also reduces 38.2% more leakage energy than the hardware-based approach, taking the dynamic energy overhead into account. Moreover, our results show that the hybrid approach is the best strategy for the drowsy instruction cache to balance leakage energy reduction and performance.
- Y. Ye, S. Borkar, and V. De. A new technique for standby leakage reduction in high-performance circuits. In Proc. the Symposium on VLSI Circuits, pp. 40--41, 1998Google ScholarCross Ref
- M. D. Powell, S. Yang, B. Falsafi, K. Roy, and T. N. Vijaykumar. Reducing Leakage in a High-Performance Deep-Submicron Instruction Cache. IEEE Transactions on VLSI, Vol. 9, No. 1, February 2001. Google ScholarDigital Library
- S. Kaxiras, Z. Hu, M. Martonosi. Cache decay: exploiting generational behavior to reduce cache leakage power. In Proc. of ISCA, 2001. Google ScholarDigital Library
- H. Zhou, M. C. Toburen, E. Rotenberg, and T. M. Conte. Adaptive mode control: a static power-efficient cache design. In Proc. of PACT, 2001. Google ScholarDigital Library
- N.S. Kim, K. Flautner, D. Blaauw, T. Mudge. Drowsy instruction caches. Proceedings of the 35th annual ACM/IEEE international symposium on Microarchitecture, 2002.Google ScholarDigital Library
- K. Flautner, N.S. Kim, S. Martin, D. Blaauw, T. Mudge. Drowsy caches: simple techniques for reducing leakage power. In Proc. of ISCA 2002. Google ScholarDigital Library
- S. Heo, K. Barr, M. Hampton, and K. Asanovic. Dynamic fine-grain leakage reduction using leakage-biased bitlines. In Proc. of ISCA 2002. Google ScholarDigital Library
- J. A. Butts and G. Sohi. A static power model for architects. In Proc. the International Symposium on Microarchitecture, December 2000. Google ScholarDigital Library
- http://www.trimaran.org.Google Scholar
- http://www.spec.org.Google Scholar
- C. Lee and M. Potkonjak, and W. H. Mangione-Smith. MediaBench: a tool for evaluating and synthesizing multimedia and communications systems. In Proc. the International Symposium on Microarchitecture, pp. 330--335, 1997. Google ScholarDigital Library
- P. Shivakumar and N. Jouppi. CACTI 3.0: An integrated cache timing, power and area model. WRL Research Report 2001.Google Scholar
- W. Zhang, J.S. Hu, V. Degalahal, M. Kandemir, N. Vijaykrishnan, M.J. Irwin. Compiler-directed instruction cache leakage optimization. Proceedings of the 35th annual ACM/IEEE international symposium on Microarchitecture, 2002. Google ScholarDigital Library
- T. Ball and J. R. Larus. Branch prediction for free. In Proceedings of SIGPLAN Conference on Programming Language Design and Implementation, 1993. Google ScholarDigital Library
- W. W. Hwu, T. M. Conte and P. P. Chang. Comparing software and hardware schemes for reducing the cost of branches. In Proc. of ISCA 1999. Google ScholarDigital Library
Index Terms
- Static next sub-bank prediction for drowsy instruction cache
Recommendations
Compiler-guided next sub-bank prediction for reducing instruction cache leakage energy
Embeded Processors and Systems: Architectural Issues and Solutions for Emerging ApplicationsWith the scaling of technology, leakage energy reduction has become increasingly important, especially for cache memories. Recent studies in drowsy instruction cache show that the leakage energy of the instruction cache can be significantly reduced with ...
An efficient direct mapped instruction cache for application-specific embedded systems
CODES+ISSS '05: Proceedings of the 3rd IEEE/ACM/IFIP international conference on Hardware/software codesign and system synthesisCaches may consume half of a microprocessor's total power and cache misses incur accessing off-chip memory, which is both time consuming and energy costly. Therefore, minimizing cache power consumption and reducing cache misses are important to reduce ...
An Energy-Efficient Partitioned Instruction Cache Architecture for Embedded Processors
Energy efficiency of cache memories is crucial in designing embedded processors. Reducing energy consumption in the instruction cache is especially important, since the instruction cache consumes a significant portion of total processor energy. This ...
Comments