In-memory computing (IMC) has emerged as a promising solution for artificial intelligence (AI) accelerators due to the reduced data movement and improved parallelism in the crosspoint array. However, IMC faces several limitations such as the device variations affecting the computing accuracy, the area- and energy-consuming peripheral circuitry, and the time-consuming high-voltage program-verify operations of the nonvolatile memory (NVM). In addition, the relatively large summation currents cause IR drop within the array, leading to further loss of accuracy. To overcome these issues, this work presents 1-bit quantized networks based on spin-orbit-torque magnetoresistive random access-memory (SOT-MRAM) with high resistance-area (RA) product. We provide a detailed statistical characterization of SOT-MRAM arrays and develop quantization-aware training of various neural networks. Our results indicate that SOT-MRAM enables: 1) high inference accuracy, thanks to excellent uniformity; 2) negligible input-dependent IR drop nonlinearity, thanks to high resistance; and 3) high-speed, low-power reconfiguration, thanks to fast device programming. These results support high-RA SOT-MRAM for digital-like and reconfigurable IMC accelerators of edge AI.

High-Accuracy, High-Performance In-Memory Computing With High-Resistance Spin-Orbit Torque (SOT) Magnetic Memory

Carletti, F.;Ambrosi, E.;Mannocci, P.;Farronato, M.;Ielmini, D.
2025-01-01

Abstract

In-memory computing (IMC) has emerged as a promising solution for artificial intelligence (AI) accelerators due to the reduced data movement and improved parallelism in the crosspoint array. However, IMC faces several limitations such as the device variations affecting the computing accuracy, the area- and energy-consuming peripheral circuitry, and the time-consuming high-voltage program-verify operations of the nonvolatile memory (NVM). In addition, the relatively large summation currents cause IR drop within the array, leading to further loss of accuracy. To overcome these issues, this work presents 1-bit quantized networks based on spin-orbit-torque magnetoresistive random access-memory (SOT-MRAM) with high resistance-area (RA) product. We provide a detailed statistical characterization of SOT-MRAM arrays and develop quantization-aware training of various neural networks. Our results indicate that SOT-MRAM enables: 1) high inference accuracy, thanks to excellent uniformity; 2) negligible input-dependent IR drop nonlinearity, thanks to high resistance; and 3) high-speed, low-power reconfiguration, thanks to fast device programming. These results support high-RA SOT-MRAM for digital-like and reconfigurable IMC accelerators of edge AI.
2025
Artificial intelligence (AI)
binary neural network (BNN)
deep neural network (DNN)
in-memory computing (IMC)
spin-orbit-torque magnetic random access-memory (SOT-MRAM)
ternary neural network (TNN)
File in questo prodotto:
File Dimensione Formato  
2026_ted_sot.pdf

accesso aperto

: Publisher’s version
Dimensione 2.09 MB
Formato Adobe PDF
2.09 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11311/1304248
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact