skip to main content
10.1145/2333660.2333707acmconferencesArticle/Chapter ViewAbstractPublication PagesislpedConference Proceedingsconference-collections
research-article

TapeCache: a high density, energy efficient cache based on domain wall memory

Published: 30 July 2012 Publication History

Abstract

Domain Wall Memory (DWM) is a recently developed spin-based memory technology in which several bits of data are densely packed into the domains of a ferromagnetic wire. DWM has shown great promise in enabling non-volatile memory with unprecedented density and high energy efficiency. In this work, we propose TapeCache, a first attempt to employ DWMs as last-level caches in general purpose computing platforms. DWMs enable much higher density compared to SRAM, DRAM, and other spin-based memory technologies such as STT-MRAM. However, they also pose unique challenges such as serial access to the bits stored in a DWM cell, leading to variable access latencies. We propose a novel circuit-architecture co-design for TapeCache, consisting of (i) a multi-port DWM macro-cell optimized for read operations considering the asymmetry in applications' read/write characteristics, and (ii) a new cache organization and suitable management policies that mitigate the performance penalty arising from serial access to bits in a macro-cell. Over a wide range of SPEC 2006 benchmarks, TapeCache achieves 7.8X improvement in area, an average energy improvement of 7.3X, and an average performance improvement of 1.2% compared to an iso-capacity SRAM cache. Compared to an iso-capacity STT-MRAM cache, TapeCache obtains 2.3X improvement in area and 1.4X average energy savings with virtually identical performance.

References

[1]
L. Thomas, R. Moriya, C. Rettner, and S. Parkin. Dynamics of Magnetic Domain Walls Under Their Own Inertia. Science, 330(6012):1810--1813, December 2010.
[2]
C. Augustine et al. Numerical analysis of domain wall propagation for dense memory arrays. In Proc. IEDM, pages 17.6.1--17.6.4, December 2011.
[3]
S. Parkin, M. Hayashi, and L. Thomas. Magnetic domain-wall racetrack memory. Science, 320(5873):190--194, April 2008.
[4]
E. R. Lewis et al. Fast domain wall motion in magnetic comb structures. Nature, 9(12):980--983, December 2010.
[5]
P. Zhou, B. Zhao, J. Yang, and Y. Zhang. Energy reduction for STT-RAM using early write termination. In Proc. ICCAD, pages 264--268, November 2009.
[6]
R. Desikan et al. On-chip MRAM as a High-Bandwidth, Low-Latency Replacement for DRAM Physical Memories. Technical report, In IBM Austin CASC, 2002.
[7]
Y. Xie. Modeling, architecture, and applications for emerging memory technologies. IEEE Design and Test of Computers, 28(1):44--51, Jan-Feb 2011.
[8]
K. Lee and S.H. Kang. Development of Embedded STT-MRAM for Mobile System-on-Chips. Magnetics, IEEE Trans. on, 47(1):131--136, January 2011.
[9]
M. K. Qureshi, V. Srinivasan, and J. A. Rivers. Scalable high performance main memory system using phase-change memory technology. In Proc. ISCA, pages 24--33, June 2009.
[10]
G. Dhiman, R. Ayoub, and T. Rosing. PDRAM: A hybrid PRAM and DRAM main memory system. In Proc. DAC, pages 664--669, July 2009.
[11]
CACTI. http://www.hpl.hp.com/research/cacti/.
[12]
X. Dong et al. Circuit and microarchitecture evaluation of 3D stacking magnetic RAM (MRAM) as a universal memory replacement. In Proc. DAC, pages 554--559, June 2008.
[13]
T. Austin, E. Larson, and D. Ernst. Simplescalar: An infrastructure for computer system modeling. Computer, 35:59--67, February 2002.
[14]
X. Wu et al. Design exploration of hybrid caches with disparate memory technologies. ACM Trans. Archit. Code Optim., 7(3):15:1--15:34, December 2010.
[15]
M. Rasquinha et al. An energy efficient cache design using Spin Torque Transfer (STT) RAM. In Proc. ISLPED, pages 389--394, August 2010.
[16]
A. Jadidi, M. Arjomand, and H. S. Azad. High-endurance and performance-efficient design of hybrid cache architectures through adaptive line replacement. In Proc. ISLPED, pages 79--84, August 2011.
[17]
Y. Chen, W. F. Wong, H. Li, and C. K. Koh. Processor caches with multi-level spin-transfer torque RAM cells. In Proc. ISLPED, pages 73--78, August 2011.
[18]
R. Venkatesan et al. Energy efficient many-core processor for recognition and mining using spin-based memory. In Proc. NANOARCH, pages 122--128, June 2011.
[19]
W. Zhao, D. Ravelosona, J. Klein, and C. Chappert. Domain wall shift register-based reconfigurable logic. Magnetics, IEEE Trans., 47(10):2966--2969, October 2011.

Cited By

View all
  • (2024)SPIMulator: A Spintronic Processing-in-memory Simulator for RacetracksACM Transactions on Embedded Computing Systems10.1145/364511223:6(1-27)Online publication date: 11-Sep-2024
  • (2024)StreamPIM: Streaming Matrix Computation in Racetrack Memory2024 IEEE International Symposium on High-Performance Computer Architecture (HPCA)10.1109/HPCA57654.2024.00031(297-311)Online publication date: 2-Mar-2024
  • (2023)A Multi-Domain Magneto Tunnel Junction for Racetrack Nanowire StripsIEEE Transactions on Nanotechnology10.1109/TNANO.2023.329892022(581-583)Online publication date: 2023
  • Show More Cited By

Index Terms

  1. TapeCache: a high density, energy efficient cache based on domain wall memory

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    ISLPED '12: Proceedings of the 2012 ACM/IEEE international symposium on Low power electronics and design
    July 2012
    438 pages
    ISBN:9781450312493
    DOI:10.1145/2333660
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 30 July 2012

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. cache
    2. domain wall memory
    3. spin memory

    Qualifiers

    • Research-article

    Conference

    ISLPED'12
    Sponsor:
    ISLPED'12: International Symposium on Low Power Electronics and Design
    July 30 - August 1, 2012
    California, Redondo Beach, USA

    Acceptance Rates

    Overall Acceptance Rate 398 of 1,159 submissions, 34%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)34
    • Downloads (Last 6 weeks)8
    Reflects downloads up to 28 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)SPIMulator: A Spintronic Processing-in-memory Simulator for RacetracksACM Transactions on Embedded Computing Systems10.1145/364511223:6(1-27)Online publication date: 11-Sep-2024
    • (2024)StreamPIM: Streaming Matrix Computation in Racetrack Memory2024 IEEE International Symposium on High-Performance Computer Architecture (HPCA)10.1109/HPCA57654.2024.00031(297-311)Online publication date: 2-Mar-2024
    • (2023)A Multi-Domain Magneto Tunnel Junction for Racetrack Nanowire StripsIEEE Transactions on Nanotechnology10.1109/TNANO.2023.329892022(581-583)Online publication date: 2023
    • (2023)Correcting Multiple Deletions and Insertions in Racetrack MemoryIEEE Transactions on Information Theory10.1109/TIT.2023.327976669:9(5619-5639)Online publication date: Sep-2023
    • (2023)Granularity-Driven Management for Reliable and Efficient Skyrmion Racetrack MemoriesIEEE Transactions on Emerging Topics in Computing10.1109/TETC.2022.317180411:1(95-111)Online publication date: 1-Jan-2023
    • (2023)Optimizing Data Placement for Hybrid SRAM+Racetrack Memory SPM in Embedded SystemsIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems10.1109/TCAD.2022.318554842:3(847-859)Online publication date: Mar-2023
    • (2023)DownShift: Tuning Shift Reduction With Reliability for Racetrack MemoriesIEEE Transactions on Computers10.1109/TC.2023.325750972:9(2585-2599)Online publication date: 1-Sep-2023
    • (2023) ROLLED: R acetrack Memory O ptimized L inear L ayout and E fficient D ecomposition of Decision Trees IEEE Transactions on Computers10.1109/TC.2022.319709472:5(1488-1502)Online publication date: 1-May-2023
    • (2023)Toward Comprehensive Shifting Fault Tolerance for Domain-Wall Memories With PIETTIEEE Transactions on Computers10.1109/TC.2022.318820672:4(1095-1109)Online publication date: 1-Apr-2023
    • (2023)Sustainable AI Processing at the EdgeIEEE Micro10.1109/MM.2022.322039943:1(19-28)Online publication date: 1-Jan-2023
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media