A Multiprocessor Cache for Massively Parallel SoC Architectures

Niemann, Jörg-Christian; Liß̈, Christian; Porrmann, Mario; Rückert, Ulrich

doi:10.1007/978-3-540-71270-1_7

Jörg-Christian Niemann¹,
Christian Liß̈¹,
Mario Porrmann¹ &
…
Ulrich Rückert¹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 4415))

Included in the following conference series:

International Conference on Architecture of Computing Systems

516 Accesses
1 Citations

Abstract

In this paper, we present an advanced multiprocessor cache architecture for chip multiprocessors (CMPs). It is designed for the scalable GigaNetIC CMP, which is based on massively parallel on-chip computing clusters. Our write-through multiprocessor cache is configurable in respect to the most relevant design options. It is supposed to be used in universal co-proc essors as well as in network processing units. For an early verification of the software and an early exploration of various hardware configurations, we have developed a SystemC-based simulation model for the complete chip multiproc essor. For detailed hardware-software co-verification, we use our FPGA-based rapid prototyping system RAPTOR2000 to emulate our architecture with near-ASIC performance. Finally, we demonstrate the performance gains for different application scenarios enabled by the usage of our multiprocessor cache.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bonorden, O., Brüls, N., Le, D.K., Kastens, U., Meyer auf der Heide, F., Niemann, J.-C., Porrmann, M., Rückert, U., Slowik, A., Thies, M.: A holistic methodology for network processor design. In: Proceedings of the Workshop on High-Speed Local Networks held in conjunction with the 28th Annual IEEE Conference on Local Computer Networks, October 20-24, pp. 583–592. IEEE Computer Society Press, Los Alamitos (2003)
Google Scholar
Niemann, J.-C., Porrmann, M., Rückert, U., Scalable, A.: Parallel SoC Architecture for Network Processors. In: IEEE Computer Society Annual Symposium on VLSI (ISVLSI), Tampa, FL, IEEE Computer Society Press, Los Alamitos (2005)
Google Scholar
Liß, C.: Implementation of an AMBA AHB Interconnection Matrix. Technical Report. University of Paderborn, Paderborn, Germany (May 2004)
Google Scholar
Kalte, H., Porrmann, M., Rückert, U.: A Prototyping Platform for Dynamically Reconfigurable System on Chip Designs. In: Proceedings of the IEEE Workshop Heterogeneous reconfigurable Systems on Chip (SoC), Hamburg, Germany, IEEE Press, Los Alamitos (2002)
Google Scholar
Niemann, J.-C., Puttmann, C., Porrmann, M., Rückert, U.: GigaNetIC - A Scalable Embedded On-Chip Multiprocessor Architecture for Network Applications. In: Grass, W., Sick, B., Waldschmidt, K. (eds.) ARCS 2006. LNCS, vol. 3894, pp. 268–282. Springer, Heidelberg (2006)
Chapter Google Scholar
Langen, D., Niemann, J.-C., Porrmann, M., Kalte, H., Rückert, U.: Implementation of a RISC Processor Core for SoC Designs FPGA Prototype vs. ASIC Implementation. In: Proc. of the IEEE-Workshop: Heterogeneous reconfigurable Systems on Chip (SoC), Hamburg, Germany, IEEE Press, Los Alamitos (2002)
Google Scholar
Grünewald, M., Kastens, U., Le, D.K., Niemann, J.-C., Porrmann, M., Rückert, U., Thies, M., Slowik, A.: Network Application Driven Instruction Set Extensions for Embedded Processing Clusters. In: PARELEC 2004, International Conference on Parallel Computing in Electrical Engineering, Dresden, Germany, pp. 209–214 (2004)
Google Scholar
Eickhoff, R., Niemann, J.-C., Porrmann, M., Rückert, U.: Adaptable Switch boxes as on-chip routing nodes for networks-on-chip. In: Rettberg, A., Zanella, M.C., Rammig, F.J. (eds.) From Specification to Embedded Systems Application, International Embedded Systems Symposium (IESS), Manaus, Brazil, 15-17 August, pp. 201–21 (2005)
Google Scholar
Dally, W.J., Towles, B.: Route Packets, Not Wires: On-Chip Interconnection Networks. In: Proceedings of the Design Automation Conference, Las Vegas, Nevada, USA, June 18-22, pp. 684–689 (2001)
Google Scholar
Niemann, J.-C., Porrmann, M., Sauer, C., Rückert, U.: An Evaluation of the Scalable GigaNetIC Architecture for Access Networks. In: Advanced Networking and Communications Hardware Workshop (ANCHOR), held in conjunction with the ISCA 2005, Advanced Networking and Communications (2005)
Google Scholar
Stümpel, E., Thies, M., Kastens, U.: VLIW Compilation Techniques for Superscalar Architectures. In: Koskimies, K. (ed.) Proc. of 7th International Conference on Compiler Construction CC’98 (1998)
Google Scholar
Kastens, U., Le, D.K., Slowik, A., Thies, M.: Feedback Driven Instruction-Set Extension. In: Proceedings of ACM SIGPLAN/SIGBED 2004 Conference on Languages, Compilers, and Tools for Embedded Systems (LCTES’04), Washington, D.C., USA, June, ACM Press, New York (2004)
Google Scholar
Burger, D., Austin, T.M.: The SimpleScalar tool set, version 2.0. SIGARCH Computer Architecture News 25(3), 13–25 (1997)
Article Google Scholar
Tarjan, D., Thoziyoor, S., Jouppi, N.P.: CACTI 4.0. Technical Report. HP Laboratories Palo Alto, Palo Alto, CA, USA (June 2006)
Google Scholar
Mudigonda, J., Vin, H., Yavatkar, R.: Managing Memory Access Latency in Packet Processing. In: SIGMETRICS ’05: Proceedings of the 2005 ACM SIGMETRICS international conference on Measurement and modeling of computer systems, Banff, Alberta, Canada, June, pp. 396–397. ACM Press, New York (2005)
Chapter Google Scholar
Tensilica. Xtensa, L.X.: Microprocessor, Overview Handbook. Internet publication, Santa Clara, CA, USA (2004), Source: http://tensilica.com/pdf/xtensalx_overview_handbook.pdf , Seen online: 05.10.2006
ARC International: ARC 700 configurable core family. Internet publication. San Jose, CA, USA (2005), Source: http://arc.com/evaluations/ARC_700_Family.pdf , Seen online: 05.10.2006
Sweazey, P., Smith, A.J.: A class of compatible cache consistency protocols and their support by the IEEE futurebus. In: 13th Annual International Symposium on Computer Architecture, ISCA, Japan (1986)
Google Scholar

Download references

Author information

Authors and Affiliations

Heinz Nixdorf Institute, University of Paderborn, Germany
Jörg-Christian Niemann, Christian Liß̈, Mario Porrmann & Ulrich Rückert

Authors

Jörg-Christian Niemann
View author publications
You can also search for this author in PubMed Google Scholar
Christian Liß̈
View author publications
You can also search for this author in PubMed Google Scholar
Mario Porrmann
View author publications
You can also search for this author in PubMed Google Scholar
Ulrich Rückert
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Paul Lukowicz Lothar Thiele Gerhard Tröster

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Niemann, JC., Liß̈, C., Porrmann, M., Rückert, U. (2007). A Multiprocessor Cache for Massively Parallel SoC Architectures. In: Lukowicz, P., Thiele, L., Tröster, G. (eds) Architecture of Computing Systems - ARCS 2007. ARCS 2007. Lecture Notes in Computer Science, vol 4415. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-71270-1_7

Download citation

DOI: https://doi.org/10.1007/978-3-540-71270-1_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-71267-1
Online ISBN: 978-3-540-71270-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics