skip to main content
article

Cache optimization for embedded processor cores: An analytical approach

Published: 01 October 2004 Publication History

Abstract

Embedded microprocessor cores are increasingly being used in embedded and mobile devices. The software running on these embedded microprocessor cores is often a priori known; thus, there is an opportunity for customizing the cache subsystem for improved performance. In this work, we propose an efficient algorithm to directly compute cache parameters satisfying desired performance criteria. Our approach avoids simulation and exhaustive exploration, and, instead, relies on an exact algorithmic approach. We demonstrate the feasibility of our algorithm by applying it to a large number of embedded system benchmarks.

References

[1]
Baer, J. and Wang, W. 1998. On the inclusion properties for multi-level cache hierarchies. In Proceedings of the International Conference on Computer Architecture.
[2]
Balasubramonian, R., Albonesi, D., Buyuktosunoglu, A., and Dwarkadas, S. 2000. Memory hierarchy reconfiguration for energy and performance in general-purpose processor architectures. In Proceedings of the International Symposium on Microarchitecture.
[3]
Harper, J., Kerbyson, D., and Nudd, G. 1999. Analytical modeling of set-associative cache behavior. IEEE Trans. Comput. 48, 10 (Oct.), 1009--1024.
[4]
Hill, M. 1987. Aspects of cache memory and instruction buffer performance. Ph.D. dissertation. University of California, Berkeley, Berkeley, Calif.
[5]
Intel Pentium IV. ftp://download.intel.com/design/Pentium4/manuals/24896609.pdf.
[6]
International Technology Roadmap for Semiconductors. public.itrs.net.
[7]
Kirovski, D., Lee, C., Potkonjak, M., and Mangione-Smith, W. 1998. Synthesis of power efficient systems-on-silicon. In Proceedings of Asian South Pacific Design Automation Conference.
[8]
Kozyrakis, C. and Patterson, D. 1998. A new direction for computer architecture research. IEEE Comput. 31, 11 (Nov.), 24--32.
[9]
Lajolo, M., Raghunathan, A., Dey, S., Lavagno, L., and Sangiovanni-Vincentelli, A. 1999. Efficient power estimation techniques for HW/SW systems. In Proceedings of IEEE Alessandro Volta Memorial Workshop on Low-Power Design. IEEE Computer Society Press, Los Alamitos, Calif.
[10]
Lee, C., Potkonjak, M., and Mangione-Smith, W. 1997. MediaBench: A tool for evaluating and synthesizing multimedia and communications systems. In Proceedings of the International Symposium on Microarchitecture.
[11]
Li, Y. and Henkel, J. 1998. A framework for estimating and minimizing energy dissipation of embedded HW/SW systems. In Proceedings of the Design Automation Conference.
[12]
Brehob, M. and Enbody, R. J. 1996. An analytical model of locality and caching. Tech. rep., Michigan State University.
[13]
Malik, A., Moyer, B., and Cermak, D. 2000. A lower power unified cache architecture providing power and performance flexibility. In Proceedings of the International Symposium on Low Power Electronics and Design.
[14]
Mattson, R., Gecsei, J., Slutz, D., and Traiger, I. 1970. Evaluation techniques for storage hierarchies. IBM Syst. J. 9, 2.
[15]
Motorola MPC500. http://e www.motorola.com/files/platforms/doc/ref_manual/MGT560RM.pdf.
[16]
Motorola MPC5200. http://e www.motorola.com/files/32bit/doc/ref_manual/G2CORERM.pdf.
[17]
Motorola MPC823. http://e-www.motorola.com/files/if/cnb/MPC823UM.pdf.
[18]
Petrov, P. and Orailoglu, A. 2001. Towards effective embedded processors in codesigns: Customizable partitioned caches. In Proceedings of the International Workshop on HW/SW Codesign.
[19]
Sato, T. 2000. Evaluating trace cache on moderate-scale processors. IEEE Comput. 147, 6 (Nov), 369--374.
[20]
Shiue, W. and Chakrabarti, C. 1999. Memory exploration for low power embedded systems. In Proceedings of the Design Automation Conference.
[21]
Su, C. and Despain, A. 1995. Cache design trade-offs for power and performance optimization: A case study. In Proceedings of the International Symposium on Low Power Electronics and Design.
[22]
Suzuki, K., Arai, T., and Kouhei, N. 1998. V830R/AV: Embedded multimedia superscalar RISC processor. IEEE Micro 18, 2 (Mar.), 36--47.
[23]
Vahid, F., and Givargis, T. 1999. The case for a configure-and-execute paradigm. In Proceedings of the International Symposium on Low Power Electronics and Design.
[24]
Wilton, S. and Jouppi, N. 1996. CACTI: An enhanced cache access and cycle time model. IEEE J. Solid State Circ. 31, 5 (May), 677--688.
[25]
Wu, Z. and Wolf, W. 1999. Iterative cache simulation of embedded CPUs with trace stripping. In Proceedings of International Workshop on HW/SW Codesign.

Cited By

View all
  • (2020)Bayesian Optimization for Efficient Accelerator SynthesisACM Transactions on Architecture and Code Optimization10.1145/342737718:1(1-25)Online publication date: 30-Dec-2020
  • (2018)Realizing Closed-Loop, Online Tuning and Control for Configurable-Cache Embedded Systems: Progress and Challenges2018 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)10.1109/ISVLSI.2018.00136(719-725)Online publication date: Jul-2018
  • (2018)Analytical Two-Level Near Threshold Cache Exploration for Low Power Biomedical ApplicationsAdvanced Computer Architecture10.1007/978-981-13-2423-9_8(95-108)Online publication date: 13-Sep-2018
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Design Automation of Electronic Systems
ACM Transactions on Design Automation of Electronic Systems  Volume 9, Issue 4
October 2004
144 pages
ISSN:1084-4309
EISSN:1557-7309
DOI:10.1145/1027084
Issue’s Table of Contents

Publisher

Association for Computing Machinery

New York, NY, United States

Journal Family

Publication History

Published: 01 October 2004
Published in TODAES Volume 9, Issue 4

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Cache optimization
  2. core-based design
  3. design space exploration
  4. system-on-a-chip

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)7
  • Downloads (Last 6 weeks)1
Reflects downloads up to 03 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2020)Bayesian Optimization for Efficient Accelerator SynthesisACM Transactions on Architecture and Code Optimization10.1145/342737718:1(1-25)Online publication date: 30-Dec-2020
  • (2018)Realizing Closed-Loop, Online Tuning and Control for Configurable-Cache Embedded Systems: Progress and Challenges2018 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)10.1109/ISVLSI.2018.00136(719-725)Online publication date: Jul-2018
  • (2018)Analytical Two-Level Near Threshold Cache Exploration for Low Power Biomedical ApplicationsAdvanced Computer Architecture10.1007/978-981-13-2423-9_8(95-108)Online publication date: 13-Sep-2018
  • (2017)PACT: Priority-Aware Phase-Based Cache Tuning for Embedded Systems2017 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)10.1109/ISVLSI.2017.77(403-408)Online publication date: Jul-2017
  • (2015)Superoptimized Memory Subsystems for Streaming ApplicationsProceedings of the 2015 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays10.1145/2684746.2689069(126-135)Online publication date: 22-Feb-2015
  • (2015)Speeding up single pass simulation of PLRUt cachesThe 20th Asia and South Pacific Design Automation Conference10.1109/ASPDAC.2015.7059091(695-700)Online publication date: Jan-2015
  • (2015)Superoptimizing Memory Subsystems for Multiple ObjectivesEuro-Par 2015: Parallel Processing Workshops10.1007/978-3-319-27308-2_29(352-363)Online publication date: 18-Dec-2015
  • (2014)Superoptimization of memory subsystemsACM SIGPLAN Notices10.1145/2666357.259781649:5(145-154)Online publication date: 12-Jun-2014
  • (2014)Superoptimization of memory subsystemsProceedings of the 2014 SIGPLAN/SIGBED conference on Languages, compilers and tools for embedded systems10.1145/2597809.2597816(145-154)Online publication date: 12-Jun-2014
  • (2014)MASH{fifo}Proceedings of the 51st Annual Design Automation Conference10.1145/2593069.2593159(1-6)Online publication date: 1-Jun-2014
  • Show More Cited By

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media