skip to main content
10.1145/2000064.2000115acmconferencesArticle/Chapter ViewAbstractPublication PagesiscaConference Proceedingsconference-collections
research-article

Combining memory and a controller with photonics through 3D-stacking to enable scalable and energy-efficient systems

Published: 04 June 2011 Publication History

Abstract

It is well-known that memory latency, energy, capacity, bandwidth, and scalability will be critical bottlenecks in future large-scale systems. This paper addresses these problems, focusing on the interface between the compute cores and memory, comprising the physical interconnect and the memory access protocol. For the physical interconnect, we study the prudent use of emerging silicon-photonic technology to reduce energy consumption and improve capacity scaling. We conclude that photonics are effective primarily to improve socket-edge bandwidth by breaking the pin barrier, and for use on heavily utilized links. For the access protocol, we propose a novel packet based interface that relinquishes most of the tight control that the memory controller holds in current systems and allows the memory modules to be more autonomous, improving flexibility and interoperability. The key enabler here is the introduction of a 3D-stacked interface die that allows both these optimizations without modifying commodity memory dies. The interface die handles all conversion between optics and electronics, as well as all low-level memory device control functionality. Communication beyond the interface die is fully electrical, with TSVs between dies and low-swing wires on-die. We show that such an approach results in substantially lowered energy consumption, reduced latency, better scalability to large capacities, and better support for heterogeneity and interoperability.

Supplementary Material

JPG File (isca_9a_1.jpg)
MP4 File (isca_9a_1.mp4)

References

[1]
CACTI: An Integrated Cache and Memory Access Time, Cycle Time, Area, Leakage, and Dynamic Power Model. http://www.hpl.hp.com/research/cacti/.
[2]
Fully-Buffered DIMM Technology in HP ProLiant Servers - Technology Brief. http://www.hp.com.
[3]
STREAM - Sustainable Memory Bandwidth in High Performance Computers. http://www.cs.virginia.edu/stream/.
[4]
Virtutech Simics Full System Simulator. http://www.virtutech.com.
[5]
N. Aggarwal et al. Power Efficient DRAM Speculation. In Proceedings of HPCA, 2008.
[6]
D. E. Atkins et al. Report of the NSF Blue-Ribbon Advisory Panel on Cyberinfrastructure. Technical report, National Science Foundation, 2003.
[7]
M. Awasthi et al. Handling PCM Resistance Drift with Device, Circuit, Architecture, and System Solutions. In Non-Volatile Memories Workshop, 2011.
[8]
R. Barbieri et al. Design and Construction of the High-Speed Optoelectronic Memory System Demonstrator. Applied Opt., 2008.
[9]
L. Barroso and U. Holzle. The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines. Morgan & Claypool, 2009.
[10]
S. Beamer et al. Re-Architecting DRAM Memory Systems with Monolithically Integrated Silicon Photonics. In Proceedings of ISCA, 2010.
[11]
R. G. Beausoleil et al. Nanoelectronic and Nanophotonic Interconnect. Proceedings of IEEE, 2008.
[12]
C. Benia et al. The PARSEC Benchmark Suite: Characterization and Architectural Implications. Technical report, Princeton University, 2008.
[13]
B. Black et al. Die Stacking (3D) Microarchitecture. In Proceedings of MICRO, December 2006.
[14]
M. J. Cianchetti, J. C. Kerekes, and D. H. Albonesi. Phastlane: A Rapid Transit Optical Routing Network. In Proceedings of ISCA, 2009.
[15]
J. Condit et al. Better I/O Through Byte-Addressable, Persistent Memory. In Proceedings of SOSP, 2009.
[16]
E. Cooper-Balis and B. Jacob. Fine-Grained Activation for Power Reduction in DRAM. IEEE Micro, May/June 2010.
[17]
W. Dally and B. Towles. Principles and Practices of Interconnection Networks. Morgan Kaufmann, 1st edition, 2003.
[18]
Elpida Memory, Inc. News Release: Elpida Completes Development of Cu-TSV (Through Silicon Via) Multi-Layer 8-Gigabit DRAM. http://www.elpida.com/pdfs/pr/2009-08-27e.pdf.
[19]
K. Fang et al. Mode-locked Silicon Evanescent Lasers. Optics Express, September 2007.
[20]
A. Hadke et al. OCDIMM: Scaling the DRAM Memory Wall Using WDM based Optical Interconnects. In Proceedings of HOTI, 2008.
[21]
R. Ho. On-Chip Wires: Scaling and Efficiency. PhD thesis, Stanford University, August 2003.
[22]
ITRS. International Technology Roadmap for Semiconductors, 2008 Update.
[23]
J. Ahn et al. Devices and architectures for photonic chip-scale integration. Applied Physics A: Materials Science and Processing, 95, 2009.
[24]
B. Jacob, S. W. Ng, and D. T. Wang. Memory Systems - Cache, DRAM, Disk. Elsevier, 2008.
[25]
John Carter, IBM Power Aware Systems. Personal Correspondence, 2011.
[26]
U. Kang et al. 8Gb 3D DDR DRAM Using Through-Silicon-Via Technology. In Proceedings of ISSCC, 2009.
[27]
N. Kirman et al. Leveraging Optical Technology in Future Bus-Based Chip Multiprocessors. In Proceedings of MICRO, 2006.
[28]
N. Kirman and J. F. Martinez. An Efficient All-Optical On-Chip Interconnect Based on Oblivious Routing. In Proceedings of ASPLOS, 2010.
[29]
P. Kogge(Editor). ExaScale Computing Study: Technology Challenges in Achieving Exascale Systems. Defense Advanced Research Projects Agency (DARPA), 2008.
[30]
S. Li et al. McPAT: An Integrated Power, Area, and Timing Modeling Framework for Multicore and Manycore Architectures. In Proceedings of MICRO, 2009.
[31]
K. Lim et al. Disaggregated Memory for Expansion and Sharing in Blade Servers. In Proceedings of ISCA, 2009.
[32]
G. Loh. 3D-Stacked Memory Architectures for Multi-Core Processors. In Proceedings of ISCA, 2008.
[33]
D. Meisner, B. Gold, and T. Wenisch. PowerNap: Eliminating Server Idle Power. In Proceedings of ASPLOS, 2009.
[34]
D. A. B. Miller. Device Requirements for Optical Interconnects to Silicon Chips. Proceedings of IEEE Special Issue on Silicon Photonics, 2009.
[35]
C. Natarajan, B. Christenson, and F. Briggs. A Study of Performance Impact of Memory Controller Features in Multi-Processor Environment. In Proceedings of WMPI, 2004.
[36]
C. Nitta, M. Farrens, and V. Akella. Addressing System-Level Trimming Issues in On-Chip Nanophotonic Networks. In Proceedings of HPCA, 2011.
[37]
E. Ordentlich et al. Coding for Limiting Current in Memristor Crossbar Memories. In Non-Volatile Memories Workshop, 2011.
[38]
M. Qureshi, V. Srinivasan, and J. Rivers. Scalable High Performance Main Memory System Using Phase-Change Memory Technology. In Proceedings of ISCA, 2009.
[39]
Raymond G. Beausoleil, HP Labs. Personal Correspondence, 2010.
[40]
K. Skadron et al. Temperature-Aware Microarchitecture. In Proceedings of ISCA, 2003.
[41]
N. Streibl et al. Digital Optics. Proceedings of IEEE, 1989.
[42]
J. Torrellas. Architectures for Extreme-Scale Computing. IEEE Computer, November 2009.
[43]
A. N. Udipi et al. Rethinking DRAM Design and Organization for Energy-Constrained Multi-Cores. In Proceedings of ISCA, 2010.
[44]
D. Vantrease et al. Corona: System Implications of Emerging Nanophotonic Technology. In Proceedings of ISCA, 2008.
[45]
D. H. Woo et al. An Optimized 3D-Stacked Memory Architecture by Exploiting Excessive, High-Density TSV Bandwidth. In Proceedings of HPCA, 2010.
[46]
Q. Xu et al. Micrometre-Scale Silicon Electro-Optic Modulator. Nature, 435:325--327, May 2005.
[47]
J. Xue et al. An Intra-Chip Free-Space Optical Interconnect. In Proceedings of ISCA, 2010.

Cited By

View all
  • (2020)CAMON: Low-Cost Silicon Photonic Chiplet for Manycore ProcessorsIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems10.1109/TCAD.2019.292649539:9(1820-1833)Online publication date: Sep-2020
  • (2018)3D-XpathProceedings of the 27th International Conference on Parallel Architectures and Compilation Techniques10.1145/3243176.3243191(1-12)Online publication date: 1-Nov-2018
  • (2017)Triple Engine Processor (TEP)ACM Transactions on Architecture and Code Optimization10.1145/315592014:4(1-25)Online publication date: 18-Dec-2017
  • Show More Cited By

Index Terms

  1. Combining memory and a controller with photonics through 3D-stacking to enable scalable and energy-efficient systems

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      ISCA '11: Proceedings of the 38th annual international symposium on Computer architecture
      June 2011
      488 pages
      ISBN:9781450304726
      DOI:10.1145/2000064
      • cover image ACM SIGARCH Computer Architecture News
        ACM SIGARCH Computer Architecture News  Volume 39, Issue 3
        ISCA '11
        June 2011
        462 pages
        ISSN:0163-5964
        DOI:10.1145/2024723
        Issue’s Table of Contents
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 04 June 2011

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. 3d stacking
      2. communication protocols
      3. dram
      4. photonics

      Qualifiers

      • Research-article

      Conference

      ISCA '11
      Sponsor:

      Acceptance Rates

      Overall Acceptance Rate 543 of 3,203 submissions, 17%

      Upcoming Conference

      ISCA '25

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)17
      • Downloads (Last 6 weeks)1
      Reflects downloads up to 03 Mar 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2020)CAMON: Low-Cost Silicon Photonic Chiplet for Manycore ProcessorsIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems10.1109/TCAD.2019.292649539:9(1820-1833)Online publication date: Sep-2020
      • (2018)3D-XpathProceedings of the 27th International Conference on Parallel Architectures and Compilation Techniques10.1145/3243176.3243191(1-12)Online publication date: 1-Nov-2018
      • (2017)Triple Engine Processor (TEP)ACM Transactions on Architecture and Code Optimization10.1145/315592014:4(1-25)Online publication date: 18-Dec-2017
      • (2017)FlexCLProceedings of the 54th Annual Design Automation Conference 201710.1145/3061639.3062251(1-6)Online publication date: 18-Jun-2017
      • (2017)Electrical and photonic off-chip interconnection and system integrationOptical Interconnects for Data Centers10.1016/B978-0-08-100512-5.00011-5(265-286)Online publication date: 2017
      • (2016)Reducing data movement energy via online data clustering and encodingThe 49th Annual IEEE/ACM International Symposium on Microarchitecture10.5555/3195638.3195676(1-13)Online publication date: 15-Oct-2016
      • (2016)A unified memory network architecture for in-memory computing in commodity serversThe 49th Annual IEEE/ACM International Symposium on Microarchitecture10.5555/3195638.3195673(1-14)Online publication date: 15-Oct-2016
      • (2016)Twin-LoadProceedings of the Second International Symposium on Memory Systems10.1145/2989081.2989106(164-176)Online publication date: 3-Oct-2016
      • (2016)A Comparative Analysis of Front-End and Back-End Compatible Silicon Photonic On-Chip InterconnectsProceedings of the 18th System Level Interconnect Prediction Workshop10.1145/2947357.2947362(1-8)Online publication date: 4-Jun-2016
      • (2016)Populating and exploring the design space of wavelength-routed optical network-on-chip topologies by leveraging the add-drop filtering primitive2016 Tenth IEEE/ACM International Symposium on Networks-on-Chip (NOCS)10.1109/NOCS.2016.7579331(1-8)Online publication date: Sep-2016
      • Show More Cited By

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media