# Energy-Efficient High-Accuracy Spiking Neural Network Inference Using Time-Domain Neurons

Joonghyun Song\*, Jiwon Shin\*, Hanseok Kim\*<sup>†</sup>, and Woo-Seok Choi\* \*Department of ECE, ISRC, Seoul National University, Seoul, South Korea <sup>†</sup>Samsung Electronics, Hwaseong, South Korea Email: {jhsong1997, wooseokchoi}@snu.ac.kr

Abstract—Due to the limitations of realizing artificial neural networks on prevalent von Neumann architectures, recent studies have presented neuromorphic systems based on spiking neural networks (SNNs) to reduce power and computational cost. However, conventional analog voltage-domain integrate-and-fire (I&F) neuron circuits, based on either current mirrors or op-amps, pose serious issues such as nonlinearity or high power consumption, thereby degrading either inference accuracy or energy efficiency of the SNN. To achieve excellent energy efficiency and high accuracy simultaneously, this paper presents a low-power highly linear time-domain I&F neuron circuit. Designed and simulated in a 28 nm CMOS process, the proposed neuron leads to more than  $4.3 \times$  lower error rate on the MNIST inference over the conventional current-mirror-based neurons. In addition, the power consumed by the proposed neuron circuit is simulated to be 0.230  $\mu$ W per neuron, which is orders of magnitude lower than the existing voltage-domain neurons.

*Keywords*—artificial neural network, spiking neural network, ANN-to-SNN conversion, integrate-and-fire neuron, time-domain signal processing

## I. INTRODUCTION

Artificial neural networks (ANNs) have been utilized to achieve groundbreaking results in a variety of fields such as image recognition [1], speech recognition [2], and machine translation [3]. However, implementing such networks with a large number of parameters on conventional von Neumann architectures incurs tremendous latency and energy costs dominated by memory access, which limits the use of deep networks in mobile applications [4]. Hence, recent studies have focused on developing a new type of system that can replace the prevalent architectures. Especially, neuromorphic systems using spiking neural networks (SNNs) are considered an alternative since they are effective in reducing both power consumption and computational effort [5]. While there have been a multitude of researches on direct SNN training methods, SNNs based on simple integrate-and-fire (I&F) neurons with the ANN-to-SNN conversion technique [6] show better inference accuracy compared to other models. Thus, this paper focuses on the design of a novel I&F neuron that leads to highaccuracy SNN inference in an energy-efficient manner.

Since conventional voltage-domain analog neurons exhibit nonlinear behavior that causes inference accuracy degradation, [7] proposes a time-domain neuron, which improves linearity, thereby achieving inference accuracy on a par with that of ANNs. However, questions on how to design a low-power time-domain neuron and, more importantly, whether they can be used to implement an SNN achieving good energy efficiency and high accuracy simultaneously, remain unclear. To answer these questions, this paper presents a low-power design of the time-domain I&F neuron and simulates an SNN for MNIST inference using the proposed neuron. Implemented in a 28 nm CMOS process, the performance of the proposed neuron is far superior to that of existing analog neurons while operating energy-efficiently.

### **II. TIME-DOMAIN NEURON**

#### A. Limitations of Voltage-Domain Neurons

Current-mirror-based neurons [8], [9] and op-amp-based neurons [10], [11] are the two types of analog I&F neurons commonly used in hardware-based SNNs. The I&F neurons receive pre-synaptic spikes from the preceding layer and accumulate them as membrane potential in the voltage domain. When the membrane potential exceeds a threshold, a spike is generated by the neuron and delivered to the next layer. These voltage-domain neurons suffer from serious issues in either performance or power. Since the membrane potential, or the voltage of the capacitor, changes the amount of current from the current mirror that flows into the capacitor due to the channel length modulation, nonlinear behavior is observed in the current-mirror-based neuron [7]. This leads to the performance degradation of the SNN, which is later validated in Section III. On the other hand, op-amp-based neurons require high supply voltage to provide sufficient gain and linearity, thereby suffering from high power consumption.

The design challenges of the voltage-domain neurons get worse as technology scales down and the supply voltage reduces. Due to the lower output impedance/intrinsic gain and higher leakage of the devices, nonlinear behavior in the voltage-domain neurons becomes severe in deep sub-micron technologies. Moreover, as lower supply voltage reduces both voltage headroom and the range of membrane potential, performance degradation of the voltage-domain neurons is

This research was supported in part by National R&D Program through the National Research Foundation of Korea (NRF) funded by Ministry of Science and ICT (2021M3F3A2A01037928) and by Ministry of Science and ICT (2020M3H2A1078119).



Fig. 1. Proposed architecture of the low-power mixed-signal time-domain I&F neuron.

inevitable. These issues can be partly overcome by choosing large devices to design neurons, but this incurs large area/power overhead, limiting the usage of deeper and larger networks.

## B. Time-Domain Signal Processing

Time-domain signal processing has emerged as an attractive technique to mitigate the analog circuit design challenges associated with process scaling and reduced voltage headroom, and it has been applied to analog filters [12], data converters [13], and PID controllers [14]. The main idea behind time-domain signal processing is to encode the variable of interest in the time, or phase, domain, which fundamentally solves the voltage-headroom-related issues. For instance, since phase is obtained by integrating frequency with respect to time, an ideal integrator of a voltage signal can be implemented in the phase domain using a voltage controlled oscillator (VCO) whose frequency is controlled by the voltage signal. An I&F neuron exploiting this concept is presented in [7], which shows highly linear behavior and better performance compared to the existing current-mirror-based voltage-domain neurons.

In the time-domain neuron, whenever a spike is received from the preceding layer, a current pulse flows into the current mirror. Depending on the polarity of the synaptic weights, the amount of current that flows into the current controlled oscillator (ICO) either increases or decreases. This changes the ICO frequency, and a phase is shifted accordingly. Since the amount of phase shift is the accumulation of the weight-multiplied input spikes, the time-domain neuron generates and transmits a spike to the following layer when the phase shift reaches a threshold. Although time-domain neurons can potentially solve many design issues existing in voltage-domain neurons, an important question on whether both excellent energy efficiency and high inference accuracy can be achieved simultaneously using time-domain neurons has not been answered yet.

## C. Low-Power Design of Time-Domain Neuron

To answer the question, we first propose an ultra-lowpower mixed-signal time-domain I&F neuron circuit (see

Fig. 1). When a spike is received from the preceding layer, a current pulse (typically generated by synaptic devices like memristors [15]) is supplied from the synapse array and flows into the neuron circuit. Note that when an input spike is modulated with a positive (negative) weight, the ICO frequency should increase (decrease). A straightforward way to implement this is to use a current mirror as [9], where PMOS/NMOS are used to add/subtract current flowing into the ICO, respectively. However, this implementation suffers from a mismatch between PMOS and NMOS, which manifests itself as weight error and leads to inference accuracy degradation. To circumvent this issue, the proposed neuron uses only NMOSs to increase/decrease the ICO current as shown in Fig. 1. For positive weights, less current flows into the current mirror<sup>1</sup>, while more current flows in for negative weights. This results in an increase or decrease, respectively, in the ICO current (i.e. ICO frequency) as the constant current is provided by the PMOS current source. With this time-domain neuron, the membrane potential is now embedded in the ICO phase.

To detect the amount of phase shift, which represents the membrane potential, a reference phase is required. The proposed neuron uses an identical ICO whose current, or frequency, is fixed to generate a reference. Note that, since this reference ICO can be shared for all the neurons, it adds negligible area overhead. Two counters are used to detect whether the phase difference between the two ICO outputs, CLK and REF, crosses a threshold by counting the numbers of rising edges. In our implementation, when the phase difference between the counter outputs becomes 1. Then, a digital spike generator (see Fig. 1) generates a spike and transmits it to the following layer. Since two counters are synchronized with different clocks, gray code with cascaded flip-flops is used for clock domain crossing.

The proposed time-domain I&F neuron circuit does not need any extra hardware to implement the "reset by subtraction"

<sup>&</sup>lt;sup>1</sup>This can be implemented by having current from the synapse constantly flow (steady state) and then gating the current when receiving an input spike.



Fig. 2. Spike generation and reset by subtraction mechanism.

mechanism, which enables more accurate inference than "reset to zero" when converting ANNs into SNNs [6]. When a spike is generated, 1 is added to the reference counter value to reset the phase difference, as shown in Fig. 2, by which the exact amount of the threshold  $(2\pi$  phase) can be subtracted from the original phase difference. In the proposed design, in order to minimize the number of bits in digital circuits, which incurs minimal area/power overhead, the threshold is set to  $2\pi$ radians in phase, equivalent to the counter output difference of 1. Moreover, since the proposed neuron is highly digital, energy efficiency can be improved substantially by lowering the supply voltage down to a range of 0.35 V-0.5 V, which is inapplicable to the existing analog voltage-domain neurons.

## **III. SIMULATION RESULTS**

#### A. Operation of Proposed Time-Domain I&F Neuron

The proposed time-domain I&F neuron circuit is designed and simulated in a 28 nm CMOS process. The simulated waveforms of the neuron are plotted in Fig. 3. When the current pulses are periodically supplied from the synapse array, the phase difference between two ICOs,  $\Delta \Phi = \Phi_{CLK} - \Phi_{REF}$ , eventually becomes larger than  $2\pi$ , which can be detected by comparing the numbers of rising edges of *CLK* and *REF*, counted by two counters. If the numbers differ as shown in Fig. 3, a spike is generated and the phase difference is subtracted by  $2\pi$  by resetting the counters. From the supply voltage ranging from 0.35 V to 0.5 V, the designed ICO oscillates with the frequency from 3 MHz to 100 MHz, which is also the range of the maximum firing rates of the neuron.

#### B. SNN System Simulation with Proposed Neuron

To evaluate the performance of the SNN using the proposed time-domain neurons, LeNet-5 [1] is trained on the MNIST dataset and converted to the SNN by extracting the network parameters using the ANN-to-SNN conversion tool [6]. Fig. 4 displays the inference error rate on MNIST with respect to the simulation time step for three different neurons: the ideal I&F neuron, the current-mirror-based voltage-domain neuron, and the proposed time-domain neuron. The simulated error rates of these three neurons are 0.99% (ideal), 4.31%



Fig. 3. Simulated waveform of the proposed neuron.



Fig. 4. Simulated inference error rate on MNIST for different neurons.



Fig. 5. Simulated inference error rate versus latency for the proposed neurons with different maximum firing rates.

(voltage-domain), and 0.98 % (proposed), respectively. While the accuracy of the time-domain neuron approaches that of the ideal I&F neuron and the original ANN (0.99 %), the voltagedomain neuron fails to reach the same accuracy level due to its nonlinearity. In Fig. 5, the error rates for the time-domain neurons with 6 different oscillation frequencies, or maximum firing rates (from 3 MHz to 100 MHz), are illustrated, where the x-axis represents the absolute latency in microseconds. The result indicates that, although all the neurons eventually approach the same accuracy level, lower inference latency can be achieved when the maximum firing rate becomes higher.



Fig. 6. Average energy per inference that each neuron with different maximum firing rate consumes to reach the error rates of 1%, 2.5%, 5%, and 10%.

TABLE I. Performance summary and comparison.

|                      | [8]            | [10]                | [11]      | This work               |
|----------------------|----------------|---------------------|-----------|-------------------------|
| Process              | 800 nm         | 65 nm               | 65 nm     | 28 nm                   |
| Domain               | Voltage        | Voltage             | Voltage   | Time                    |
| Key element          | Current mirror | Op-amp              | Op-amp    | ICO                     |
| Power per neuron     | $> 10 \mu W$   | $14.4\mu\mathrm{W}$ | $3 \mu W$ | <b>0.230</b> μ <b>W</b> |
| Energy per inference | N/A            | N/A                 | N/A       | 3.72 nJ                 |
| Inference accuracy   | N/A            | N/A                 | N/A       | 99 %                    |
|                      | 1              |                     |           | 1                       |

## C. Energy Efficiency of Proposed Neuron

Fig. 6 shows the total energy consumption per neuron for each inference to achieve a certain level of accuracy for the proposed neuron with different maximum firing rates. Neurons with lower ICO frequency result in reduced power consumption compared to those with higher frequency. However, as shown in Fig. 6, since more time should be spent to achieve the same level of accuracy (see Fig. 5), less power does not simply imply lower energy for inference. For example, the lowest total energy per neuron to achieve 1 % error is 0.488 pJ at 15 MHz ICO frequency while the highest is 0.646 pJ at 3 MHz. It is important to notice that there is an optimal operating point that minimizes the total energy consumption, e.g. 15 MHz operating frequency in our simulation results. Therefore, when choosing the operating frequency of the time-domain neuron, both total energy consumption and latency should be carefully considered.

Table I summarizes the performance of the proposed neuron and compares it with the prior art. The proposed neuron shows the power consumption of  $0.230 \,\mu$ W, which is substantially low compared to the voltage-domain neurons. Note that, as technology scales down, the voltage-domain neurons will suffer from either higher power consumption or degraded inference accuracy due to nonlinearity, and the benefits of the proposed time-domain neuron are expected to become larger. The SNN using the proposed neuron consumes only 3.72 nJ for classifying an MNIST image while achieving 99 % inference accuracy.

## IV. CONCLUSION

This paper proposes a low-power highly linear I&F neuron circuit composed of a time-domain integrator and a digital spike generator. Designed and simulated in a 28 nm CMOS process, the proposed neuron leads to more than  $4.3 \times$  lower error rate on the MNIST inference over the conventional analog current-mirror-based neurons. In addition, the power consumed by the proposed neuron circuit is  $0.230 \,\mu$ W per neuron, which is orders of magnitude lower than the existing voltage-domain neurons. The simulation results indicate that the proposed time-domain neuron enables the SNN inference to achieve excellent energy efficiency and high accuracy.

### ACKNOWLEDGMENT

The EDA Tool was supported by the IC Design Education Center.

#### REFERENCES

- Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner, "Gradient-based learning applied to document recognition," *Proceedings of the IEEE*, vol. 86, no. 11, pp. 2278–2324, 1998.
- [2] G. Hinton *et al.*, "Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups," *IEEE Signal processing magazine*, vol. 29, no. 6, pp. 82–97, 2012.
- [3] Y. Wu *et al.*, "Google's neural machine translation system: Bridging the gap between human and machine translation," *arXiv preprint arXiv:1609.08144*, 2016.
- [4] M. Horowitz, "1.1 computing's energy problem (and what we can do about it)," in 2014 IEEE International Solid-State Circuits Conference Digest of Technical Papers (ISSCC). IEEE, 2014, pp. 10–14.
- [5] J.-s. Seo et al., "A 45nm CMOS neuromorphic chip with a scalable architecture for learning in networks of spiking neurons," in 2011 IEEE Custom Integrated Circuits Conference (CICC). IEEE, 2011, pp. 1–4.
- [6] B. Rueckauer, I.-A. Lungu, Y. Hu, M. Pfeiffer, and S.-C. Liu, "Conversion of continuous-valued deep networks to efficient event-driven networks for image classification," *Frontiers in neuroscience*, vol. 11, p. 682, 2017.
- [7] H. Kim and W.-S. Choi, "Improving spiking neural network accuracy using time-based neurons," arXiv preprint arXiv:2201.01394, 2022.
- [8] G. Indiveri, E. Chicca, and R. Douglas, "A vlsi array of low-power spiking neurons and bistable synapses with spike-timing dependent plasticity," *IEEE transactions on neural networks*, vol. 17, no. 1, pp. 211–221, 2006.
- [9] S. Hwang, J. Chang, M.-H. Oh, J.-H. Lee, and B.-G. Park, "Impact of the sub-resting membrane potential on accurate inference in spiking neural networks," *Scientific reports*, vol. 10, no. 1, pp. 1–10, 2020.
- [10] S. A. Aamir et al., "An accelerated LIF neuronal network array for a large-scale mixed-signal neuromorphic architecture," *IEEE Transactions* on Circuits and Systems I: Regular Papers, vol. 65, no. 12, pp. 4299– 4312, 2018.
- [11] M. S. Asghar, S. Arslan, and H. Kim, "Current multiplier based synapse and neuron circuits for compact snn chip," in 2021 IEEE International Symposium on Circuits and Systems (ISCAS). IEEE, 2021, pp. 1–4.
- [12] B. Drost, M. Talegaonkar, and P. K. Hanumolu, "Analog filter design using ring oscillator integrators," *IEEE Journal of Solid-State Circuits*, vol. 47, no. 12, pp. 3120–3129, 2012.
- [13] K. Reddy, S. Rao, R. Inti, B. Young, A. Elshazly, M. Talegaonkar, and P. K. Hanumolu, "A 16-mW 78-dB SNDR 10-MHz BW CT ΔΣ ADC using residue-cancelling VCO-based quantizer," *IEEE journal of solid-state circuits*, vol. 47, no. 12, pp. 2916–2927, 2012.
- [14] S. J. Kim, W.-S. Choi, R. Pilawa-Podgurski, and P. K. Hanumolu, "A 10-MHz 2–800-mA 0.5–1.5-V 90% peak efficiency time-based buck converter with seamless transition between PWM/PFM modes," *IEEE Journal of Solid-State Circuits*, vol. 53, no. 3, pp. 814–824, 2017.
- [15] M. Chu, B. Kim, S. Park, H. Hwang, M. Jeon, B. H. Lee, and B.-G. Lee, "Neuromorphic hardware system for visual pattern recognition with memristor array and CMOS neuron," *IEEE Transactions on Industrial Electronics*, vol. 62, no. 4, pp. 2410–2419, 2014.