In-memory computing (IMC) has been identified as a promising paradigm for hardware neural network accelerators thanks to the reduced data movement and improved parallelism. A known issue of IMC is the relatively large summation current within the memory array, which causes energy inefficiency and computing inaccuracy due to IR drop. An additional burden is the area and energy-demanding readout circuits, which limit the density and energy efficiency of computing. This work reports a hardware demonstration of IMC using 3-D crosspoint (3DXP) arrays of ovonic threshold switches and phase change memories (PCMs). We demonstrate a precise program–verify (PV) algorithm optimized for the subthreshold regime, allowing for a reduction of the operating currents by more than two orders of magnitude with respect to the conventional 3DXP technology. We experimentally demonstrate vector–vector multiplication (VVM) and feature extractions, which are key operations of convolutional neural networks (CNNs). Simulation study of LeNet5 with binary and ternary quantization, including device variability, 1/f noise, and drift, demonstrates high accuracy and low-energy inference thanks to precise programming, subthreshold operation, and careful drift compensation.
3-D Crosspoint (3DXP) Memory Arrays With Subthreshold Operation for Low-Energy, High-Accuracy Neural Network Accelerators
Carletti, F.;Farronato, M.;Hu, G. Y. C.;Lepri, N.;Pirovano, A.;Ielmini, D.
2025-01-01
Abstract
In-memory computing (IMC) has been identified as a promising paradigm for hardware neural network accelerators thanks to the reduced data movement and improved parallelism. A known issue of IMC is the relatively large summation current within the memory array, which causes energy inefficiency and computing inaccuracy due to IR drop. An additional burden is the area and energy-demanding readout circuits, which limit the density and energy efficiency of computing. This work reports a hardware demonstration of IMC using 3-D crosspoint (3DXP) arrays of ovonic threshold switches and phase change memories (PCMs). We demonstrate a precise program–verify (PV) algorithm optimized for the subthreshold regime, allowing for a reduction of the operating currents by more than two orders of magnitude with respect to the conventional 3DXP technology. We experimentally demonstrate vector–vector multiplication (VVM) and feature extractions, which are key operations of convolutional neural networks (CNNs). Simulation study of LeNet5 with binary and ternary quantization, including device variability, 1/f noise, and drift, demonstrates high accuracy and low-energy inference thanks to precise programming, subthreshold operation, and careful drift compensation.| File | Dimensione | Formato | |
|---|---|---|---|
|
2026_ted_3dxp.pdf
accesso aperto
:
Publisher’s version
Dimensione
2.43 MB
Formato
Adobe PDF
|
2.43 MB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


