Efficient Message Management in Tiled CMP Architectures Using a Heterogeneous Interconnection Network

Flores, Antonio; Aragón, Juan L.; Acacio, Manuel E.

doi:10.1007/978-3-540-77220-0_16

Antonio Flores¹,
Juan L. Aragón¹ &
Manuel E. Acacio¹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 4873))

Included in the following conference series:

International Conference on High-Performance Computing

1836 Accesses
4 Citations

Abstract

Previous studies have shown that the interconnection network of a Chip-Multiprocessor (CMP) has significant impact on both overall performance and energy consumption. Moreover, wires used in such interconnect can be designed with varying latency, bandwidth and power characteristics. In this work, we present a proposal for performance-and energy-efficient message management in tiled CMPs by using a heterogeneous interconnect. Our proposal consists of Reply Partitioning, a technique that classifies all coherence messages into critical and short, and non-critical and long messages; and the use of a heterogeneous interconnection network comprised of low-latency wires for critical messages and low-energy wires for non-critical ones. Through detailed simulations of 8- and 16-core CMPs, we show that our proposal obtains average improvements of 8% in execution time and 65% in the Energy-Delay² Product metric of the interconnect over previous works.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Taylor, M.B., Kim, J., et al.: The Raw Microprocessor: A Computational Fabric for Software Circuits and General-Purpose Programs. IEEE Micro 22(2), 25–35 (2002)
Article Google Scholar
Zhang, M., Asanovic, K.: Victim Replication: Maximizing Capacity while Hiding Wire Delay in Tiled Chip Multiprocessors. In: Proc. of the 32nd Int’l Symp. on Computer Architecture, pp. 336–345 (2005)
Google Scholar
Wang, H., Peh, L.S., Malik, S.: Power-driven Design of Router Microarchitectures in On-chip Networks. In: Proc. of the 36th Int’l Symp. on Microarchitecture, pp. 105–111 (2003)
Google Scholar
Magen, N., Kolodny, A.W., et al.: Interconnect-power dissipation in a microprocessor. In: Proc. of the 2004 Int’l Workshop on System Level Interconnect Prediction, pp. 7–13 (2004)
Google Scholar
Shang, L., Peh, L., Jha, N.: Dynamic voltage scaling with links for power optimization of interconnection networks. In: Proc. of the 9th Int’l Symp. on High-Performance Computer Architecture, pp. 91–102 (2003)
Google Scholar
Banerjee, K., Mehrotra, A.: A power-optimal repeater insertion methodology for global interconnects in nanometer designs. IEEE Trans. on Electron Devices 49(11), 2001–2007 (2002)
Article Google Scholar
Cheng, L., Muralimanohar, N., et al.: Interconnect-Aware Coherence Protocols for Chip Multiprocessors. In: Proc. of the 33rd Int’l Symp. on Computer Architecture, pp. 339–351 (2006)
Google Scholar
Kumar, R., Zyuban, V., Tullsen, D.M.: Interconnections in Multi-Core Architectures: Understanding Mechanisms, Overheads and Scaling. In: Proc. of the 32nd Int’l Symp. on Computer Architecture, pp. 408–419 (2005)
Google Scholar
Flores, A., Aragón, J.L., Acacio, M.E.: Sim-PowerCMP: A Detailed Simulator for Energy Consumption Analysis in Future Embedded CMP Architectures. In: Proc. of the 4th Int’l Symp. on Embedded Computing, pp. 752–757 (2007)
Google Scholar
Hughes, C.J., Pai, V.S., et al.: RSIM: Simulating Shared-Memory Multiprocessors with ILP Processors. IEEE Computer 35(2), 40–49 (2002)
Google Scholar
Brooks, D., Tiwari, V., Martonosi, M.: Wattch: a framework for architectural-level power analysis and optimizations. In: Proc. of the 27th Int’l Symp. on Computer Architecture, pp. 83–94 (2000)
Google Scholar
Zhang, Y., Parikh, D., et al.: HotLeakage: A Temperature-Aware Model of Subthreshold and Gate Leakage for Architects. Technical report, University of Virginia (2003)
Google Scholar
Wang, H.S., Zhu, X., et al.: Orion: a power-performance simulator for interconnection networks. In: Proc. of the 35th Int’l Symp. on Microarchitecture, pp. 294–305 (2002)
Google Scholar
Beckmann, B.M., Wood, D.A.: TLC: Transmission Line Caches. In: Proc. of the 36th Int’l Symp. on Microarchitecture, pp. 43–54 (2003)
Google Scholar
Beckmann, B.M., Wood, D.A., et al.: Managing Wire Delay in Large Chip-Multiprocessor Caches. In: Proc. of the 37th Int’l Symp. on Microarchitecture, pp. 319–330 (2004)
Google Scholar
Nelson, N., Briggs, G., et al.: Alleviating Thermal Constraints while Maintaining Performance via Silicon-Based On-Chip Optical Interconnects. In: Workshop on Unique Chips and Systems (2005)
Google Scholar
Balasubramonian, R., Muralimanohar, N., et al.: Microarchitectural Wire Management for Performance and Power in Partitioned Architectures. In: Proc. of the 11th Int’l Symp. on High-Performance Computer Architecture, pp. 28–39 (2005)
Google Scholar
Muralimanohar, N., Balasubramonian, R.: The Effect of Interconnect Design on the Performance of Large L2 Caches. In: 3rd IBM Watson Conf. on Interaction between Architecture, Circuits, and Compilers (P=ac2) (2006)
Google Scholar
Balfour, J., Dally, W.J.: Design tradeoffs for tiled CMP on-chip networks. In: Proc. of the 20th Int’l Conf. on Supercomputing, pp. 187–198 (2006)
Google Scholar

Download references

Author information

Authors and Affiliations

Departamento de Ingeniería y Tecnología de Computadores, University of Murcia, 30100 Murcia, Spain)
Antonio Flores, Juan L. Aragón & Manuel E. Acacio

Authors

Antonio Flores
View author publications
You can also search for this author in PubMed Google Scholar
Juan L. Aragón
View author publications
You can also search for this author in PubMed Google Scholar
Manuel E. Acacio
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Srinivas Aluru Manish Parashar Ramamurthy Badrinath Viktor K. Prasanna

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Flores, A., Aragón, J.L., Acacio, M.E. (2007). Efficient Message Management in Tiled CMP Architectures Using a Heterogeneous Interconnection Network. In: Aluru, S., Parashar, M., Badrinath, R., Prasanna, V.K. (eds) High Performance Computing – HiPC 2007. HiPC 2007. Lecture Notes in Computer Science, vol 4873. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-77220-0_16

Download citation

DOI: https://doi.org/10.1007/978-3-540-77220-0_16
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-77219-4
Online ISBN: 978-3-540-77220-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics