skip to main content
10.1145/3649329.3655673acmconferencesArticle/Chapter ViewAbstractPublication PagesdacConference Proceedingsconference-collections
research-article
Open access

Digital CIM with Noisy SRAM Bit: A Compact Clustered Annealer for Large-Scale Combinatorial Optimization

Published: 07 November 2024 Publication History

Abstract

Combinatorial optimization problems (COP) are NP-hard and intractable to solve using conventional computing. The Ising model-based annealer has gained increasing attention recently due to its efficiency and speed in finding approximate solutions. However, Ising solvers for travelling salesman problems (TSP) usually suffer from a scalability issue due to quadratically increasing number of spins. In this paper, we propose a digital computing-in-memory (CIM) based clustered annealer to solve tens of thousands of city-scale TSP with only a few mega-byte (MB) of static random access memory (SRAM), using hierarchical clustering to solve input sparsity and digital CIM flexibility to solve weight sparsity. The intrinsic process variations between SRAM devices are utilized to generate the noisy bit errors during pseudo-read under reduced supply voltage, realizing the annealing process. The design space of cluster size and programmability is explored to understand the trade-offs of solution quality and hardware cost, for TSP scale ranging from 3080 to 85900 cities. The proposed design speeds up the convergence by >109× with <25% solution quality overhead compared with the CPU baseline. The comparison with state-of-the-art scalable annealers shows a >1013× improvement on functionally normalized area and power.

References

[1]
A. R. Karlin, et al., "A (slightly) improved approximation algorithm for metric TSP," ACM SIGACT STOC, pp. 32--45, 2021.
[2]
A. Dan, et al., "Clustering approach for solving traveling salesman problems via Ising model based solver", ACM/IEEE DAC, 2020.
[3]
A. Lu, et al., "Scalable in-memory clustered annealer with temporal noise of charge trap transistor for large scale travelling salesman problems," IEEE J. Emerg. Sel. Top. Circuits Syst., 13(1), pp. 422--435, 2023.
[4]
M. Yamaoka, et al., "A 20k-spin Ising chip to solve combinatorial optimization problems with CMOS annealing," IEEE JSSC, 51(1), pp. 303--309, 2015.
[5]
M. Bagherbeik, et al., "A permutational boltzmann machine with parallel tempering for solving combinatorial optimization problems", PPSN, pp. 317--331, 2020.
[6]
Y.-D. Chih, et al., "An 89TOPS/W and 16.3 TOPS/mm 2 all-digital SRAM-based full-precision compute-in memory macro in 22nm for machine-learning edge applications," IEEE ISSCC, 2021.
[7]
C.-F. Lee, et al., "A 12nm 121-TOPS/W 41.6-TOPS/mm2 all digital full precision SRAM-based compute-in-memory with configurable bit-width for AI edge applications," IEEE VLSI, 2022.
[8]
H. Fujiwara, et al., "A 5-nm 254-TOPS/W 221-TOPS/mm2 fully-digital computing-in-memory macro supporting wide-range dynamic-voltage-frequency scaling and simultaneous MAC and write operations," IEEE ISSCC, 2022.
[9]
S. Yu, et al., "Compute-in-memory chips for deep learning: Recent trends and prospects," IEEE Circuits Syst. Mag., 21(3), pp. 31--56, 2021.
[10]
J. Gonzalez, et al., "Parallel Gibbs sampling: From colored fields to thin junction trees," AISTATS, PMLR, pp. 324--332, 2011.
[11]
G. Reinelt, "TSPLIB---A traveling salesman problem library," ORSA J. Comput., 3(4), pp. 376--384, 1991.
[12]
X. Peng, et al., "DNN+NeuroSim: An end-to-end benchmarking framework for compute-in-memory accelerators with versatile device technologies," IEEE IEDM, 2019.
[13]
Concorde Benchmarks (from 1999), https://www.math.uwaterloo.ca/tsp/concorde/benchmarks/bench99.html.
[14]
Y. Zou and M. Lin, "Massively simulating adiabatic bifurcations with FPGA to solve combinatorial optimization," ACM/SIGDA FPGA, 2020.
[15]
H. Goto, et al., "High-performance combinatorial optimization based on classical mechanics," Sci. Adv., 7(6), p. eabe7953, 2021.
[16]
T. Zhang and J. Han, " Efficient traveling salesman problem solvers using the Ising model with simulated bifurcation," DATE, 2022.
[17]
D. Shin, et al., "Memory-efficient fpga implementation of stochastic simulated annealing," IEEE J. Emerg. Sel. Top. Circuits Syst., 13(1), pp. 108--118, 2023.
[18]
K. Yamamoto, et al., "STATICA: A 512-spin 0.25M-weight full-digital annealing processor with a near-memory all-spin-updates-at-once architecture for combinatorial optimization with complete spin-spin interactions," IEEE ISSCC, 2020.
[19]
Q. Tao, and J. Han, "Solving traveling salesman problems via a parallel fully connected Ising machine," ACM/IEEE DAC, 2022.
[20]
N.A. Aadit, et al., "Accelerating Adaptive Parallel Tempering with FPGA-based p-bits," IEEE VLSI, 2023.
[21]
S. Sanyal, K. Roy, "Neuro-Ising: Accelerating large scale travelling salesman problems via graph neural network guided localized Ising solvers", IEEE Trans. Comput.-Aided Des. Integr. Circuits Syst., pp. 1--1, 2022.
[22]
Y. Su, et al., "CIM-spin: A 0.5-to-1.2 V scalable annealing processor using digital compute-in-memory spin operators and register-based spins for combinatorial optimization problems," IEEE ISSCC, pp. 480--482, 2020.
[23]
T. Takemoto, et al., "A 144Kb annealing system composed of 9×16Kb annealing processor chips with scalable chip-to-chip connections for large-scale combinatorial optimization problems," IEEE ISSCC, 2021.
[24]
Y. Su, et al., " Flexspin: A scalable CMOS Ising machine with 256 flexible spin processing elements for solving complex combinatorial optimization problems," IEEE ISSCC, 2022.
[25]
K. Kawamura, et al., "Amorphica: 4-replica 512 fully connected spin 336MHz metamorphic annealer with programmable optimization strategy and compressed-spin-transfer multi-chip extension," IEEE ISSCC, 2023.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
DAC '24: Proceedings of the 61st ACM/IEEE Design Automation Conference
June 2024
2159 pages
ISBN:9798400706011
DOI:10.1145/3649329
This work is licensed under a Creative Commons Attribution International 4.0 License.

Sponsors

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 November 2024

Check for updates

Author Tags

  1. travelling salesman problem
  2. combinatorial optimization problem
  3. simulated annealing
  4. digital in-memory computing
  5. ising model

Qualifiers

  • Research-article

Funding Sources

Conference

DAC '24
Sponsor:
DAC '24: 61st ACM/IEEE Design Automation Conference
June 23 - 27, 2024
CA, San Francisco, USA

Acceptance Rates

Overall Acceptance Rate 1,770 of 5,499 submissions, 32%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 286
    Total Downloads
  • Downloads (Last 12 months)286
  • Downloads (Last 6 weeks)126
Reflects downloads up to 05 Mar 2025

Other Metrics

Citations

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media