Disrupting Low-Write-Energy vs. Fast-Read Dilemma in RRAM to Enable L1 Instruction Cache

Lele, Ashwin; Jandhyala, Srivatsava; Gangurde, Saurabh; Singh, Virendra; Subramoney, Sreenivas; Ganguly, Udayan

doi:10.1007/978-3-031-21514-8_41

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1687))

Included in the following conference series:

International Symposium on VLSI Design and Test

1134 Accesses
1 Citations

Abstract

RRAM has emerged as a non-volatile and denser alternative to SRAM memory. Various RRAMs show a range of write energies related to write currents. The write current magnitude is proportional to the read current magnitude, which is inversely related to the read latency. Hence, lower write energy leads to higher read latency - producing a fundamental trade-off. This trade-off leads to a fast-read vs. low write power dilemma, hindering the application of RRAM to lower level cache. In this work, we propose a modified bitcell design to overcome this and analyze its impact on L1 instruction cache replacement. We propose a modification in conventional one selection transistor (1T) and RRAM (1R) based 1T1R cell by adding another transistor (i.e. 2T1R cell) to drive high current for fast read irrespective of the RRAM current magnitude. We demonstrate that the read latency vs. write energy trade-off is mitigated using circuit simulations. The impact of the 2T1R bit cell for a fast read and slow write is compared with SRAM and 1T1R scheme for L1 cache replacement. We report an energy-delay product (EDP) reduction of 82% for high performance and 53% for embedded architecture with SRAM comparable throughput. Thus, the fast read capability establishes the potential of RRAM as a lower-level cache substitute for both high performance and embedded applications.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Sleepy-LRU: extending the lifetime of non-volatile caches by reducing activity of age bits

Article 23 January 2019

Instruction Fetch Energy Reduction with Biased SRAMs

Article 26 April 2018

A Comprehensive Review to Investigate the Effect of Read Port Topology on the Performance of Different 7 T SRAM Cells

References

Benchmarks, s. p. e. c.: Standard performance evaluation corporation (2000)
Google Scholar
https://web.stanford.edu/bartolo/assets/nvm-cache.pdf
https://www.7-cpu.com/cpu/haswell.html
https://www.7-cpu.com/cpu/krait.html
Binkert, N., et al.: The gem5 simulator. ACM SIGARCH Comput. Archit. News 39(2), 1–7 (2011)
Article Google Scholar
Catanzaro, M., Kudithipudi, D.: Reconfigurable RRAM for LUT logic mapping: a case study for reliability enhancement. In: 2012 IEEE International SOC Conference, pp. 94–99. IEEE (2012)
Google Scholar
Chang, M.F., et al.: 17.5 a 3T1R nonvolatile TCAM using MLC RaRAM with sub-1ns search time. In: 2015 IEEE International Solid-State Circuits Conference-(ISSCC) Digest of Technical Papers, pp. 1–3. IEEE (2015)
Google Scholar
Chang, M.F., et al.: 19.4 embedded 1mb ReRAM in 28nm CMOS with 0.27-to-1v read using swing-sample-and-couple sense amplifier and self-boost-write-termination scheme. In: 2014 IEEE International Solid-State Circuits Conference Digest of Technical Papers (ISSCC), pp. 332–333. IEEE (2014)
Google Scholar
Chen, P.Y., Yu, S.: Compact modeling of RRAM devices and its applications in 1T1R and 1S1R array design. IEEE Trans. Electron Devices 62(12), 4022–4028 (2015)
Article MathSciNet Google Scholar
Cheng, C.H., Chin, A., Yeh, F.: Ultralow switching energy nigeoxhfontan RRAM. IEEE Electron Device Lett. 32(3), 366–368 (2011)
Article Google Scholar
Govoreanu, B., et al.: 10$\times $ 10nm 2 Hf/HfO x crossbar resistive RAM with excellent performance, reliability and low-energy operation. In: 2011 International Electron Devices Meeting, pp. 31–6. IEEE (2011)
Google Scholar
Guthaus, M.R., Ringenberg, J.S., Ernst, D., Austin, T.M., Mudge, T., Brown, R.B.: MiBench: a free, commercially representative embedded benchmark suite. In: Proceedings of the fourth annual IEEE international workshop on workload characterization. WWC-4 (Cat. No. 01EX538). pp. 3–14. IEEE (2001)
Google Scholar
Guthaus, M.R., Stine, J.E., Ataei, S., Chen, B., Wu, B., Sarwar, M.: OpenRAM: an open-source memory compiler. In: 2016 IEEE/ACM International Conference on Computer-Aided Design (ICCAD), pp. 1–6. IEEE (2016)
Google Scholar
Hsieh, M.C., et al.: Ultra high density 3D via RRAM in pure 28nm CMOS process. In: 2013 IEEE International Electron Devices Meeting, pp. 10–3. IEEE (2013)
Google Scholar
Huang, J.J., Tseng, Y.M., Luo, W.C., Hsu, C.W., Hou, T.H.: One selector-one resistor (1S1R) crossbar array for high-density flexible memory applications. In: 2011 International Electron Devices Meeting, pp. 31–7. IEEE (2011)
Google Scholar
Ielmini, D.: Resistive switching memories based on metal oxides: mechanisms, reliability and scaling. Semicond. Sci. Technol. 31(6), 063002 (2016)
Article Google Scholar
Jokar, M.R., Arjomand, M., Sarbazi-Azad, H.: Sequoia: a high-endurance NVM-based cache architecture. IEEE Trans. Very Large Scale Integr. (VLSI) Syst. 24(3), 954–967 (2015)
Google Scholar
Jung, M., Shalf, J., Kandemir, M.: Design of a large-scale storage-class RRAM system. In: Proceedings of the 27th International ACM Conference on International Conference on Supercomputing, pp. 103–114 (2013)
Google Scholar
Kawahara, A., et al.: An 8 mb multi-layered cross-point ReRAM macro with 443 mb/s write throughput. IEEE J. Solid-State Circuits 48(1), 178–185 (2012)
Article Google Scholar
Kotra, J.B., Arjomand, M., Guttman, D., Kandemir, M.T., Das, C.R.: Re-NUCA: a practical NUCA architecture for ReRAM based last-level caches. In: 2016 IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 576–585. IEEE (2016)
Google Scholar
Kuhn, K., et al.: Managing process variation in Intel’s 45nm CMOS technology. Intel Technol. J. 12(2) (2008)
Google Scholar
Lashkare, S., Chouhan, S., Chavan, T., Bhat, A., Kumbhare, P., Ganguly, U.: PCMO RRAM for integrate-and-fire neuron in spiking neural networks. IEEE Electron Device Lett. 39(4), 484–487 (2018)
Article Google Scholar
Lee, H., et al.: Low power and high speed bipolar switching with a thin reactive TI buffer layer in robust HfO2 based RRAM. In: 2008 IEEE International Electron Devices Meeting, pp. 1–4. IEEE (2008)
Google Scholar
Sheu, S.S., et al.: A 5ns fast write multi-level non-volatile 1 k bits RRAM memory with advance write scheme. In: 2009 Symposium on VLSI Circuits, pp. 82–83. IEEE (2009)
Google Scholar
Shih, C.C., et al.: Ultra-low switching voltage induced by inserting SIO 2 layer in indium-tin-oxide-based resistance random access memory. IEEE Electron Device Lett. 37(10), 1276–1279 (2016)
Article Google Scholar
Wang, C.H., et al.: Three-dimensional 4f 2 ReRAM cell with CMOS logic compatible process. In: 2010 International Electron Devices Meeting, pp. 29–6. IEEE (2010)
Google Scholar
Wang, Y.T., Razavi, B.: An 8-bit 150-MHz CMOS A/D converter. IEEE J. Solid-State Circuits 35(3), 308–317 (2000)
Article Google Scholar
Wu, Y., Lee, B., Wong, H.S.P.: Ultra-low power al 2 o 3-based RRAM with 1$\mu $a reset current. In: Proceedings of 2010 International Symposium on VLSI Technology, System and Application, pp. 136–137. IEEE (2010)
Google Scholar
Xu, C., Dong, X., Jouppi, N.P., Xie, Y.: Design implications of memristor-based RRAM cross-point structures. In: 2011 Design, Automation and Test in Europe, pp. 1–6. IEEE (2011)
Google Scholar
Zhang, J., Donofrio, D., Shalf, J., Jung, M.: Integrating 3D resistive memory cache into GPGPU for energy-efficient data processing. In: 2015 International Conference on Parallel Architecture and Compilation (PACT), pp. 496–497. IEEE (2015)
Google Scholar
Zhao, W., Cao, Y.: New generation of predictive technology model for sub-45 nm early design exploration. IEEE Trans. Electron Devices 53(11), 2816–2823 (2006)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Indian Institute of Technology Bombay, Mumbai, India
Ashwin Lele, Saurabh Gangurde, Virendra Singh & Udayan Ganguly
Processor Architecture Research Lab, Intel Labs, Bangalore, India
Srivatsava Jandhyala & Sreenivas Subramoney

Authors

Ashwin Lele
View author publications
You can also search for this author in PubMed Google Scholar
Srivatsava Jandhyala
View author publications
You can also search for this author in PubMed Google Scholar
Saurabh Gangurde
View author publications
You can also search for this author in PubMed Google Scholar
Virendra Singh
View author publications
You can also search for this author in PubMed Google Scholar
Sreenivas Subramoney
View author publications
You can also search for this author in PubMed Google Scholar
Udayan Ganguly
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Udayan Ganguly .

Editor information

Editors and Affiliations

Department of Electrical Engineering, Indian Institute of Technology Jammu, Jammu, India
Ambika Prasad Shah
Department of Electronics and Communication Engineering, Indian Institute of Technology Roorkee, Roorkee, India
Sudeb Dasgupta
Department of Electronics Engineering, Sardar Vallabhbhai National Institute of Technology Surat, Surat, India
Anand Darji
Department of Computer Science and Engineering, Indian Institute of Technology Tirupati, Tirupati, India
Jaynarayan Tudu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lele, A., Jandhyala, S., Gangurde, S., Singh, V., Subramoney, S., Ganguly, U. (2022). Disrupting Low-Write-Energy vs. Fast-Read Dilemma in RRAM to Enable L1 Instruction Cache. In: Shah, A.P., Dasgupta, S., Darji, A., Tudu, J. (eds) VLSI Design and Test. VDAT 2022. Communications in Computer and Information Science, vol 1687. Springer, Cham. https://doi.org/10.1007/978-3-031-21514-8_41

Download citation

DOI: https://doi.org/10.1007/978-3-031-21514-8_41
Published: 17 December 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-21513-1
Online ISBN: 978-3-031-21514-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Disrupting Low-Write-Energy vs. Fast-Read Dilemma in RRAM to Enable L1 Instruction Cache

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Sleepy-LRU: extending the lifetime of non-volatile caches by reducing activity of age bits

Instruction Fetch Energy Reduction with Biased SRAMs

A Comprehensive Review to Investigate the Effect of Read Port Topology on the Performance of Different 7 T SRAM Cells

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Disrupting Low-Write-Energy vs. Fast-Read Dilemma in RRAM to Enable L1 Instruction Cache

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Sleepy-LRU: extending the lifetime of non-volatile caches by reducing activity of age bits

Instruction Fetch Energy Reduction with Biased SRAMs

A Comprehensive Review to Investigate the Effect of Read Port Topology on the Performance of Different 7 T SRAM Cells

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation