RAELLA: Reforming the Arithmetic for Efficient, Low-Resolution, and Low-Loss Analog PIM: No Retraining Required!

Andrulis, Tanner; Emer, Joel S.; Sze, Vivienne

doi:10.1145/3579371.3589062

Computer Science > Hardware Architecture

arXiv:2304.07935 (cs)

[Submitted on 17 Apr 2023]

Title:RAELLA: Reforming the Arithmetic for Efficient, Low-Resolution, and Low-Loss Analog PIM: No Retraining Required!

Authors:Tanner Andrulis, Joel S. Emer, Vivienne Sze

View PDF

Abstract:Processing-In-Memory (PIM) accelerators have the potential to efficiently run Deep Neural Network (DNN) inference by reducing costly data movement and by using resistive RAM (ReRAM) for efficient analog compute. Unfortunately, overall PIM accelerator efficiency is limited by energy-intensive analog-to-digital converters (ADCs). Furthermore, existing accelerators that reduce ADC cost do so by changing DNN weights or by using low-resolution ADCs that reduce output fidelity. These strategies harm DNN accuracy and/or require costly DNN retraining to compensate.
To address these issues, we propose the RAELLA architecture. RAELLA adapts the architecture to each DNN; it lowers the resolution of computed analog values by encoding weights to produce near-zero analog values, adaptively slicing weights for each DNN layer, and dynamically slicing inputs through speculation and recovery. Low-resolution analog values allow RAELLA to both use efficient low-resolution ADCs and maintain accuracy without retraining, all while computing with fewer ADC converts.
Compared to other low-accuracy-loss PIM accelerators, RAELLA increases energy efficiency by up to 4.9$\times$ and throughput by up to 3.3$\times$. Compared to PIM accelerators that cause accuracy loss and retrain DNNs to recover, RAELLA achieves similar efficiency and throughput without expensive DNN retraining.

Comments:	16 pages; 15 figures; Accepted at ISCA 2023 (the International Symposium on Computer Architecture)
Subjects:	Hardware Architecture (cs.AR)
ACM classes:	C.1.3
Cite as:	arXiv:2304.07935 [cs.AR]
	(or arXiv:2304.07935v1 [cs.AR] for this version)
	https://doi.org/10.48550/arXiv.2304.07935
Related DOI:	https://doi.org/10.1145/3579371.3589062

Submission history

From: Tanner Andrulis [view email]
[v1] Mon, 17 Apr 2023 01:13:40 UTC (4,083 KB)

Computer Science > Hardware Architecture

Title:RAELLA: Reforming the Arithmetic for Efficient, Low-Resolution, and Low-Loss Analog PIM: No Retraining Required!

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Hardware Architecture

Title:RAELLA: Reforming the Arithmetic for Efficient, Low-Resolution, and Low-Loss Analog PIM: No Retraining Required!

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators