Abstract
In the current embedded processors for media applications, up to 30% of the total processor power is consumed in the instruction memory hierarchy. In this context, we present an inherently low energy clustered instruction memory hierarchy template. Small instruction memories are distributed over groups of functional units and the interconnects are localized in order to minimize energy consumption. Furthermore, we present a simple profile based algorithm to optimally synthesize the L0 clusters, for a given application. Using a few representative multimedia benchmarks we show that up to 45% of the L0 buffer energy can be reduced using our clustering approach.
This work is supported in part by MESA under the MEDEA+ program
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
R. S. Bajwa, et al., “Instruction Buffering to Reduce Power in Processors for Signal Processing”, IEEE Transactions on VLSI systems, vol 5, no 4, Dec 1997.
N. Bellas, et al., “Architectural and Compiler Support for Energy Reduction in Memory Hierarchy of High Performance Microprocessors”, ISLPED 1998.
L. H. Lee, et al., “Instruction Fetch Energy Reduction Using Loop Caches For Applications with Small Tight Loops”, ISLPED 1999.
M. Jacome, et al., “Design Challenges for New Application Specific Processors”, Special Issue System Design of Embedded Systems, IEEE Design & Test of Computers, April–June 2000.
M. Jacome, et al., “Exploring Performance Tradeofis for Clustered VLIW ASIPs”, ICCAD, November 2000.
Texas Instruments Inc, Technical Report, “TMS3206000 Power Consumption Summary”, http://www.ti.com
Texas Instruments Inc, “TMS320C6000 CPU and Instruction Set Reference Guide”, http://www.ti.com
M. Jayapala, et al., “Loop Cache (Buffer) Organization: Energy Analysis and Partitioning”, Technical Report K.U.Leuven/ESAT, 22 Jan 2002.
A. Wolfe, et al., “Datapath Design for a VLIW Video Signal Processor”, IEEE Symposium on High-Performance Computer Architecture (HPCA’97).
S. Kim, et al., “Power-aware Partitioned Cache Architectures”, ISLPED 2001.
M. Huang, et al., “L1 Data Cache Decomposition for Energy Efficiency”, ISLPED 2001.
Trimaran, An Infrastructure for Research in Instruction-Level Parallelism, 1999. http://www.trimaran.org
Ching-Long Su, et al., “Cache Design Trade-offs for Power and Performance Optimization: A Case Study”, ISLPED 1995.
L. Nachtergaele, V. Tiwari and N. Dutt, “System and Architectural-Level Power Reduction of Microprocessor-based Communication and Multimedia Applications”, ICCAD 2000.
D. Brooks, et al., “Wattch: A Framework for Architectural-Level Power Analysis and Optimizations”, ISCA 2000.
S.J.E. Wilton and N.P. Jouppi, “CACTI: an enhanced cache access and cycle time model”, IEEE Journal of Solid-State Circuits, 31(5):677–688, 1996.
P. Petrov and A. Orailoglu, “Power Efficient Embedded Processor IPs through Application-Specific Tag Compression in Data Caches”, Design and Test in Europe Conf (DATE), April 2002.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Jayapala, M., Barat, F., de Beeck, P.O., Catthoor, F., Deconinck, G., Corporaal, H. (2002). A Low Energy Clustered Instruction Memory Hierarchy for Long Instruction Word Processors. In: Hochet, B., Acosta, A.J., Bellido, M.J. (eds) Integrated Circuit Design. Power and Timing Modeling, Optimization and Simulation. PATMOS 2002. Lecture Notes in Computer Science, vol 2451. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45716-X_26
Download citation
DOI: https://doi.org/10.1007/3-540-45716-X_26
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44143-4
Online ISBN: 978-3-540-45716-9
eBook Packages: Springer Book Archive