skip to main content
10.1145/1128022.1128069acmconferencesArticle/Chapter ViewAbstractPublication PagescfConference Proceedingsconference-collections
Article

Static cache partitioning robustness analysis for embedded on-chip multi-processors

Published: 03 May 2006 Publication History

Abstract

In this paper we analyze the robustness of multi-tasking applications when mapped on an on-chip multiprocessor platform. We assume a multiprocesso structure which embeds a hierarchical cache organization with two levels. The first one is private to the processor cores while the second one is shared among the processors. To enable compositionality, i.e, to be able to evaluate the system's performance out of the individual task's performance, the second level of cache (L2) is partitioned per task basis. Two robustness aspects are relevant in this context: internal (performance deviations are caused by the tasks comprising the application) and external (performance variations are caused by external stimuli). First we introduce two metrics to quantify the robustness. The internal robustness is estimated by a sensitivity function which measures the performance variations induced by the %private inter-task cache interference. The external robustness is quantified by a stability function which reflects the variations induced by different input data on the partitioned L2 behavior. Subsequently, we exercise our method on two applications (H.264 and picture-in-picture TV) running on a CAKE multi-processor platform. Our experiments indicate that, if the cache is partitioned, the sensitivity is 8% and 5% for the H.264 and PiPTV, respectively. For the shared cache scenario the sensitivity is 40% and 50% for the H.264 and PiPTV, respectively. The variations induced in the L2 behavior by various input data sets are at most 4% for the PiPTV application, respectively 9% for the H.264 decoder. This accounts for a stability of at least 96%, respectively 91%, therefore, for the investigated applications, we can conclude that the static cache partitioning is quite robust to input stimuli.

References

[1]
A. Stevens, "Level 2 Cache for High-performance ARM Core-based SoC Systems", ARM white paper, 2004
[2]
B.A. Nayfeh and K. Olukotun, "Exploring the Design Space for a Shared-Cache Multiprocessor", In Proceedings, ISCA, pages 166--175, 1994
[3]
J.T.J. van Eijndhoven, J. Hoogerbrugge, M.N. Jayram, P. Stravers, and A. Terechko, "Cache-Coherent Heterogeneous Multiprocessing as Basis for Streaming Applications", In "Dynamic and robust streaming between connected CE-devices", Kluwer Academic Publishers, 2005
[4]
P. Ranganathan, S. Adve, and N.P. Jouppi, "Reconfigurable caches and their application to media processing", In Proceedings, 27th Annual International Symposium on Computer Architecture, pages 214--224, 2000
[5]
J.L. Hennesy and D.A. Patterson, "Computer Architecture: A Quantitative Approach", Morgan Kaufmann Publishers, 2003
[6]
A.M. Molnos, M.J.M. Heijligers, S.D. Cotofana, and J.T.J. van Eijndhoven, "Compositional, efficient caches for a chip multi-processor", In Proceedings, Design, Automation and Test in Europe, to appear in 2006
[7]
ftp://ftp.ldv.e-technik.tu-muenchen.de/pub/test_sequences/
[8]
G.E. Suh, L. Rudolph, and S. Devadas, "Dynamic Partitioning of Shared Cache Memory", The Journal of Supercomputing, volume 28, number 1, pages 7--26, 2004
[9]
Y. Tan and V.J. Mooney, "A Prioritized Cache for Multi-tasking Real-Time Systems", In Proceedings of the 11th Workshop on Synthesis And System Integration of Mixed Information Technologies, pages 168--175, 2003
[10]
A.M. Molnos, M.J.M. Heijligers, S.D. Cotofana, and J.T.J. van Eijndhoven, "Compositional memory systems for multimedia communicating tasks", In Proceedings, Design, Automation and Test in Europe, pages 932--937, 2005
[11]
E.B. van der Tol, E.G. Jaspers, and R.H. Gelderblom, "Mapping of H.264 decoding on a multiprocessor architecture", In Proceedings, SPIE Conference on Image and Video Communications and Processing, 2003
[12]
P. van der Wolf, P. Lieverse, M. Goel, D. La Hei, K.A. Vissers "An MPEG-2 Decoder Case Study as a Driver for a System Level Design Methodology", In Proceedings, 7th International Workshop on Hardware/Software Co-Design, pages 33--37, 1999
[13]
E. A. de Kock, W. J. M. Smits, P. van der Wolf, J.-Y. Brunel, W. M. Kruijtzer, P. Lieverse, K. A. Vissers, and G. Essink "YAPI: application modeling for signal processing systems", In Proceedings, 37th conference on Design Automation, pages 402--405, 2000

Cited By

View all
  • (2008)Performance advantage of reconfigurable cache design on multicore processor systemsInternational Journal of Parallel Programming10.1007/s10766-008-0075-436:3(347-360)Online publication date: 1-Jun-2008
  • (2007)Two methods to enhance the master thread's performance in SMT Chip2007 IFIP International Conference on Network and Parallel Computing Workshops (NPC 2007)10.1109/NPC.2007.45(578-583)Online publication date: Sep-2007
  • (2007)Static Cache Partitioning Robustness Analysis for Embedded On-Chip Multi-processorsTransactions on High-Performance Embedded Architectures and Compilers I10.1007/978-3-540-71528-3_18(279-297)Online publication date: 21-Jul-2007
  • Show More Cited By

Index Terms

  1. Static cache partitioning robustness analysis for embedded on-chip multi-processors

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image ACM Conferences
        CF '06: Proceedings of the 3rd conference on Computing frontiers
        May 2006
        430 pages
        ISBN:1595933026
        DOI:10.1145/1128022
        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Sponsors

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 03 May 2006

        Permissions

        Request permissions for this article.

        Check for updates

        Author Tags

        1. cache partitioning
        2. multi-processors
        3. robustness

        Qualifiers

        • Article

        Conference

        CF06
        Sponsor:
        CF06: Computing Frontiers Conference
        May 3 - 5, 2006
        Ischia, Italy

        Acceptance Rates

        Overall Acceptance Rate 273 of 785 submissions, 35%

        Upcoming Conference

        CF '25

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)1
        • Downloads (Last 6 weeks)0
        Reflects downloads up to 05 Mar 2025

        Other Metrics

        Citations

        Cited By

        View all
        • (2008)Performance advantage of reconfigurable cache design on multicore processor systemsInternational Journal of Parallel Programming10.1007/s10766-008-0075-436:3(347-360)Online publication date: 1-Jun-2008
        • (2007)Two methods to enhance the master thread's performance in SMT Chip2007 IFIP International Conference on Network and Parallel Computing Workshops (NPC 2007)10.1109/NPC.2007.45(578-583)Online publication date: Sep-2007
        • (2007)Static Cache Partitioning Robustness Analysis for Embedded On-Chip Multi-processorsTransactions on High-Performance Embedded Architectures and Compilers I10.1007/978-3-540-71528-3_18(279-297)Online publication date: 21-Jul-2007
        • (2007)CMP Cache Architecture and the OpenMP PerformanceProceedings of the 3rd international workshop on OpenMP: A Practical Programming Model for the Multi-Core Era10.1007/978-3-540-69303-1_7(77-88)Online publication date: 3-Jun-2007
        • (2006)Throughput optimization via cache partitioning for embedded multiprocessors2006 International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation10.1109/ICSAMOS.2006.300826(185-191)Online publication date: Jul-2006

        View Options

        Login options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Figures

        Tables

        Media

        Share

        Share

        Share this Publication link

        Share on social media