research-article

Open access

Improving the Efficiency of In-Memory-Computing Macro with a Hybrid Analog-Digital Computing Mode for Lossless Neural Network Inference

Authors:

Yiran ChenAuthors Info & Claims

DAC '24: Proceedings of the 61st ACM/IEEE Design Automation Conference

Article No.: 313, Pages 1 - 6

https://doi.org/10.1145/3649329.3658472

Published: 07 November 2024 Publication History

Abstract

Analog in-memory-computing (IMC) is an attractive technique with a higher energy efficiency to process machine learning workloads. However, the analog computing scheme suffers from large interface circuit overhead. In this work, we propose a macro with a hybrid analog-digital mode computation to reduce the precision requirement of the interface circuit. Considering the distribution of the multiplication and accumulation (MAC) value, we propose a nonlinear transfer function of the computing circuits by only accurately computing low MAC value in the analog domain with a digital mode to deal with the high MAC value with smaller possibility. Silicon measurement results show that the proposed macro could achieve 160 GOPS/mm² area efficiency and 25.5 TOPS/W for 8b/8b matrix computation. The architectural-level evaluation for real workloads shows that the proposed macro can achieve up to 2.92× higher energy efficiency than conventional analog IMC designs.

References

[1]

Qilin Zheng et al. Lattice: an adc/dac-less reram-based processing-in-memory architecture for accelerating deep convolution neural networks. In DAC, 2020.

[2]

Qilin Zheng et al. Mobilattice: A depth-wise dcnn accelerator with hybrid digital/analog nonvolatile processing-in-memory block. In ICCAD, 2020.

[3]

Bonan Yan et al. A 1.041-mb/mm 2 27.38-tops/w signed-int8 dynamic-logic-based adc-less sram compute-in-memory macro in 28nm with reconfigurable bitwise operation for ai and embedded applications. In ISSCC, 2022.

[4]

Kodai Ueyoshi et al. Diana: A n end-to-end energy-efficient digital and analog hybrid neural network soc. In ISSCC, 2022.

[5]

Pouya Houshmand et al. Diana: An end-to-end hybrid digital and analog neural network soc for the edge. JSSC, 2022.

[6]

Ziru Li et al. Asters: adaptable threshold spike-timing neuromorphic design with twin-column reram synapses. In DAC, 2022.

[7]

Qilin Zheng et al. Artificial neural network based on doped hfo 2 ferroelectric capacitors with multilevel characteristics. EDL, 2019.

[8]

Zongwei Wang et al. Self-activation neural network based on self-selective memory device with rectified multilevel states. TED, 2020.

[9]

Richard Linderman et al. Apparatus for performing matrix vector multiplication approximation using crossbar arrays of resistive memory devices, 2015. US Patent 9,152,827.

[10]

Xin Si et al. A twin-8t sram computation-in-memory unit-macro for multibit cnn-based ai edge processors. JSSC, 2019.

[11]

Avishek Biswas et al. Conv-sram: An energy-efficient sram with in-memory dot-product computation for low-power convolutional neural networks. JSSC, 2018.

[12]

Yen-Cheng Chiu et al. A 4-kb 1-to-8-bit configurable 6t sram-based computation-in-memory unit-macro for cnn-based ai edge processors. JSSC, 2020.

[13]

Xin Si et al. 15.5 a 28nm 64kb 6t sram computing-in-memory macro with 8b mac operation for ai edge chips. In ISSCC, 2020.

[14]

Win-San Khwa et al. A 65nm 4kb algorithm-dependent computing-in-memory sram unit-macro with 2.3 ns and 55.8 tops/w fully parallel product-sum operation for binary dnn edge processors. In ISSCC, 2018.

[15]

Edward Choi et al. A 133.6 tops/w compute-in-memory sram macro with fully parallel one-step multi-bit computation. In CICC, 2022.

[16]

Xiaoxuan Yang et al. Improving the robustness and efficiency of pim-based architecture by sw/hw co-design. In ASP-DAC, 2023.

[17]

Qilin Zheng et al. Enhance the robustness to time dependent variability of reram-based neuromorphic computing systems with regularization and 2r synapse. In ISCAS, 2019.

[18]

Jinseok Lee et al. Fully row/column-parallel in-memory computing sram macro employing capacitor-based mixed-signal computation with 5-b inputs. In VLSI, 2021.

[19]

Yu-Der Chih et al. 16.4 an 89tops/w and 16.3 tops/mm 2 all-digital sram-based full-precision compute-in memory macro in 22nm for machine-learning edge applications. In ISSCC, 2021.

[20]

Chia-Fu Lee et al. A 12nm 121-tops/w 41.6-tops/mm2 all digital full precision sram-based compute-in-memory with configurable bit-width for ai edge applications. In VLSI, 2022.

[21]

Qilin Zheng et al. Accelerating sparse attention with a reconfigurable non-volatile processing-in-memory architecture. In DAC, 2023.

[22]

Xin Si et al. A local computing cell and 6t sram-based computing-in-memory macro with 8-b mac operation for edge ai chips. JSSC, 2021.

[23]

Qilin Zheng et al. Pimulator-nn: An event-driven, cross-level simulation framework for processing-in-memory-based neural network accelerators. TCAD, 2022.

Index Terms

Improving the Efficiency of In-Memory-Computing Macro with a Hybrid Analog-Digital Computing Mode for Lossless Neural Network Inference
1. Computer systems organization
  1. Architectures
    1. Other architectures
      1. Analog computers
      2. Neural networks
2. Hardware

Index terms have been assigned to the content through auto-classification.

Recommendations

A 9T-SRAM in-memory computing macro for Boolean logic and multiply-and-accumulate operations
Abstract
Artificial intelligence algorithms play important roles in image classification to speech recognition, which contains enormous Boolean logic and multiplication operations. Traditional von Neumann architecture separates computing and storage units,...
A 9T-SRAM based computing-in-memory with redundant unit and digital operation for boolean logic and MAC
Abstract
The proposal of compute-in-memory (CIM) is a breakthrough for the traditional von Neumann architecture to achieve efficient computing research. This architecture has unique advantages in the computing field thanks to supporting multi-line ...
Bit parallel 6T SRAM in-memory computing with reconfigurable bit-precision
DAC '20: Proceedings of the 57th ACM/EDAC/IEEE Design Automation Conference

This paper presents 6T SRAM cell-based bit-parallel in-memory computing (IMC) architecture to support various computations with reconfigurable bit-precision. In the proposed technique, bitline computation is performed with a short WL followed by BL ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

DAC '24: Proceedings of the 61st ACM/IEEE Design Automation Conference

June 2024

2159 pages

ISBN:9798400706011

DOI:10.1145/3649329

Chair:
Vivek De

This work is licensed under a Creative Commons Attribution International 4.0 License.

Sponsors

SIGDA: ACM Special Interest Group on Design Automation
IEEE-CEDA

In-Cooperation

SIGBED: ACM Special Interest Group on Embedded Systems

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 November 2024

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

NSF (National Science Foundation)

Conference

DAC '24

Sponsor:

SIGDA

DAC '24: 61st ACM/IEEE Design Automation Conference

June 23 - 27, 2024

CA, San Francisco, USA

Acceptance Rates

Overall Acceptance Rate 1,770 of 5,499 submissions, 32%

Upcoming Conference

DAC '25

Sponsor:
sigda

62nd ACM/IEEE Design Automation Conference

June 22 - 26, 2025

San Francisco , CA , USA

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
219
Total Downloads

Downloads (Last 12 months)219
Downloads (Last 6 weeks)77

Reflects downloads up to 13 Feb 2025

Other Metrics

View Author Metrics

Citations

View Options

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Figures

Tables

Media

View Table of Conten