Skip to main content

A Multiprocessor Cache for Massively Parallel SoC Architectures

  • Conference paper
Architecture of Computing Systems - ARCS 2007 (ARCS 2007)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 4415))

Included in the following conference series:

Abstract

In this paper, we present an advanced multiprocessor cache architecture for chip multiprocessors (CMPs). It is designed for the scalable GigaNetIC CMP, which is based on massively parallel on-chip computing clusters. Our write-through multiprocessor cache is configurable in respect to the most relevant design options. It is supposed to be used in universal co-proc essors as well as in network processing units. For an early verification of the software and an early exploration of various hardware configurations, we have developed a SystemC-based simulation model for the complete chip multiproc essor. For detailed hardware-software co-verification, we use our FPGA-based rapid prototyping system RAPTOR2000 to emulate our architecture with near-ASIC performance. Finally, we demonstrate the performance gains for different application scenarios enabled by the usage of our multiprocessor cache.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bonorden, O., Brüls, N., Le, D.K., Kastens, U., Meyer auf der Heide, F., Niemann, J.-C., Porrmann, M., Rückert, U., Slowik, A., Thies, M.: A holistic methodology for network processor design. In: Proceedings of the Workshop on High-Speed Local Networks held in conjunction with the 28th Annual IEEE Conference on Local Computer Networks, October 20-24, pp. 583–592. IEEE Computer Society Press, Los Alamitos (2003)

    Google Scholar 

  2. Niemann, J.-C., Porrmann, M., Rückert, U., Scalable, A.: Parallel SoC Architecture for Network Processors. In: IEEE Computer Society Annual Symposium on VLSI (ISVLSI), Tampa, FL, IEEE Computer Society Press, Los Alamitos (2005)

    Google Scholar 

  3. Liß, C.: Implementation of an AMBA AHB Interconnection Matrix. Technical Report. University of Paderborn, Paderborn, Germany (May 2004)

    Google Scholar 

  4. Kalte, H., Porrmann, M., Rückert, U.: A Prototyping Platform for Dynamically Reconfigurable System on Chip Designs. In: Proceedings of the IEEE Workshop Heterogeneous reconfigurable Systems on Chip (SoC), Hamburg, Germany, IEEE Press, Los Alamitos (2002)

    Google Scholar 

  5. Niemann, J.-C., Puttmann, C., Porrmann, M., Rückert, U.: GigaNetIC - A Scalable Embedded On-Chip Multiprocessor Architecture for Network Applications. In: Grass, W., Sick, B., Waldschmidt, K. (eds.) ARCS 2006. LNCS, vol. 3894, pp. 268–282. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  6. Langen, D., Niemann, J.-C., Porrmann, M., Kalte, H., Rückert, U.: Implementation of a RISC Processor Core for SoC Designs FPGA Prototype vs. ASIC Implementation. In: Proc. of the IEEE-Workshop: Heterogeneous reconfigurable Systems on Chip (SoC), Hamburg, Germany, IEEE Press, Los Alamitos (2002)

    Google Scholar 

  7. Grünewald, M., Kastens, U., Le, D.K., Niemann, J.-C., Porrmann, M., Rückert, U., Thies, M., Slowik, A.: Network Application Driven Instruction Set Extensions for Embedded Processing Clusters. In: PARELEC 2004, International Conference on Parallel Computing in Electrical Engineering, Dresden, Germany, pp. 209–214 (2004)

    Google Scholar 

  8. Eickhoff, R., Niemann, J.-C., Porrmann, M., Rückert, U.: Adaptable Switch boxes as on-chip routing nodes for networks-on-chip. In: Rettberg, A., Zanella, M.C., Rammig, F.J. (eds.) From Specification to Embedded Systems Application, International Embedded Systems Symposium (IESS), Manaus, Brazil, 15-17 August, pp. 201–21 (2005)

    Google Scholar 

  9. Dally, W.J., Towles, B.: Route Packets, Not Wires: On-Chip Interconnection Networks. In: Proceedings of the Design Automation Conference, Las Vegas, Nevada, USA, June 18-22, pp. 684–689 (2001)

    Google Scholar 

  10. Niemann, J.-C., Porrmann, M., Sauer, C., Rückert, U.: An Evaluation of the Scalable GigaNetIC Architecture for Access Networks. In: Advanced Networking and Communications Hardware Workshop (ANCHOR), held in conjunction with the ISCA 2005, Advanced Networking and Communications (2005)

    Google Scholar 

  11. Stümpel, E., Thies, M., Kastens, U.: VLIW Compilation Techniques for Superscalar Architectures. In: Koskimies, K. (ed.) Proc. of 7th International Conference on Compiler Construction CC’98 (1998)

    Google Scholar 

  12. Kastens, U., Le, D.K., Slowik, A., Thies, M.: Feedback Driven Instruction-Set Extension. In: Proceedings of ACM SIGPLAN/SIGBED 2004 Conference on Languages, Compilers, and Tools for Embedded Systems (LCTES’04), Washington, D.C., USA, June, ACM Press, New York (2004)

    Google Scholar 

  13. Burger, D., Austin, T.M.: The SimpleScalar tool set, version 2.0. SIGARCH Computer Architecture News 25(3), 13–25 (1997)

    Article  Google Scholar 

  14. Tarjan, D., Thoziyoor, S., Jouppi, N.P.: CACTI 4.0. Technical Report. HP Laboratories Palo Alto, Palo Alto, CA, USA (June 2006)

    Google Scholar 

  15. Mudigonda, J., Vin, H., Yavatkar, R.: Managing Memory Access Latency in Packet Processing. In: SIGMETRICS ’05: Proceedings of the 2005 ACM SIGMETRICS international conference on Measurement and modeling of computer systems, Banff, Alberta, Canada, June, pp. 396–397. ACM Press, New York (2005)

    Chapter  Google Scholar 

  16. Tensilica. Xtensa, L.X.: Microprocessor, Overview Handbook. Internet publication, Santa Clara, CA, USA (2004), Source: http://tensilica.com/pdf/xtensalx_overview_handbook.pdf , Seen online: 05.10.2006

  17. ARC International: ARC 700 configurable core family. Internet publication. San Jose, CA, USA (2005), Source: http://arc.com/evaluations/ARC_700_Family.pdf , Seen online: 05.10.2006

  18. Sweazey, P., Smith, A.J.: A class of compatible cache consistency protocols and their support by the IEEE futurebus. In: 13th Annual International Symposium on Computer Architecture, ISCA, Japan (1986)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Paul Lukowicz Lothar Thiele Gerhard Tröster

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer Berlin Heidelberg

About this paper

Cite this paper

Niemann, JC., Liß̈, C., Porrmann, M., Rückert, U. (2007). A Multiprocessor Cache for Massively Parallel SoC Architectures. In: Lukowicz, P., Thiele, L., Tröster, G. (eds) Architecture of Computing Systems - ARCS 2007. ARCS 2007. Lecture Notes in Computer Science, vol 4415. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-71270-1_7

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-71270-1_7

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-71267-1

  • Online ISBN: 978-3-540-71270-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics