Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3649329.3656538acmconferencesArticle/Chapter ViewAbstractPublication PagesdacConference Proceedingsconference-collections
research-article

FeBiM: Efficient and Compact Bayesian Inference Engine Empowered with Ferroelectric In-Memory Computing

Published: 07 November 2024 Publication History

Abstract

In scenarios with limited training data or where explainability is crucial, conventional neural network-based machine learning models often face challenges. In contrast, Bayesian inference-based algorithms excel in providing interpretable predictions and reliable uncertainty estimation in these scenarios. While many state-of-the-art in-memory computing (IMC) architectures leverage emerging non-volatile memory (NVM) technologies to offer unparalleled computing capacity and energy efficiency for neural network workloads, their application in Bayesian inference is limited. This is because the core operations in Bayesian inference, i.e., cumulative multiplications of prior and likelihood probabilities, differ significantly from the multiplication-accumulation (MAC) operations common in neural networks, rendering them generally unsuitable for direct implementation in most existing IMC designs. In this paper, we propose FeBiM, an efficient and compact Bayesian inference engine powered by multi-bit ferroelectric field-effect transistor (FeFET)-based IMC. FeBiM effectively encodes the trained probabilities of a Bayesian inference model within a compact FeFET-based crossbar. It maps quantized logarithmic probabilities to discrete FeFET states. As a result, the accumulated outputs of the crossbar naturally represent the posterior probabilities, i.e., the Bayesian inference model's output given a set of observations. This approach enables efficient in-memory Bayesian inference without the need for additional calculation circuitry. As the first FeFET-based in-memory Bayesian inference engine, FeBiM achieves an impressive storage density of 26.32 Mb/mm2 and a computing efficiency of 581.40 TOPS/W in a representative Bayesian classification task. These results demonstrate 10.7×/43.4× improvement in compactness/efficiency compared to the state-of-the-art hardware implementation of Bayesian inference.

References

[1]
D. Ielmini et al., "In-memory computing with resistive switching devices," Nature electronics, vol. 1, no. 6, pp. 333--343, 2018.
[2]
A. Shafiee et al., "Isaac: A convolutional neural network accelerator with in-situ analog arithmetic in crossbars," ACM SIGARCH, vol. 44, no. 3, pp. 14--26, 2016.
[3]
X. S. Hu et al., "In-memory computing with associative memories: A cross-layer perspective," in 2021 IEEE IEDM, pp. 25--2, 2021.
[4]
Z. Yan et al., "Improving realistic worst-case performance of nvcim dnn accelerators through training with right-censored gaussian noise," in 2023 IEEE/ACM ICCAD, pp. 1--9, 2023.
[5]
S. Jung et al., "A crossbar array of magnetoresistive memory devices for in-memory computing," Nature, vol. 601, no. 7892, pp. 211--216, 2022.
[6]
A. Qayyum et al., "Secure and robust machine learning for healthcare: A survey," IEEE Reviews in Biomedical Engineering, vol. 14, pp. 156--180, 2020.
[7]
G. Yang et al., "Unbox the black-box for the medical explainable ai via multi-modal and multi-centre data fusion," Information Fusion, vol. 77, pp. 29--52, 2022.
[8]
Z. Ghahramani, "Probabilistic machine learning and artificial intelligence," Nature, vol. 521, no. 7553, pp. 452--459, 2015.
[9]
N. Burkart et al., "A survey on the explainability of supervised machine learning," Journal of Artificial Intelligence Research, vol. 70, pp. 245--317, 2021.
[10]
R. J. Smith et al., "Massively parallel bayesian inference for transient gravitational-wave astronomy," MNRAS, vol. 498, no. 3, pp. 4492--4502, 2020.
[11]
C. Talbot et al., "Parallelized inference for gravitational-wave astronomy," Physical Review D, vol. 100, no. 4, p. 043030, 2019.
[12]
H. Awano et al., "Bynqnet: Bayesian neural network with quadratic activations for sampling-free uncertainty estimation on fpga," in 2020 IEEE DATE, pp. 1402--1407, IEEE, 2020.
[13]
D. Vodenicarevic et al., "Low-energy truly random number generation with superparamagnetic tunnel junctions for unconventional computing," PRL, vol. 8, no. 5, p. 054045, 2017.
[14]
Y. Zheng et al., "Hardware implementation of bayesian network based on two-dimensional memtransistors," Nature communications, vol. 13, no. 1, p. 5578, 2022.
[15]
R. Faria et al., "Implementing bayesian networks with embedded stochastic mram," AIP Advances, vol. 8, no. 4, 2018.
[16]
K.-E. Harabi et al., "A memristor-based bayesian machine," Nature Electronics, vol. 6, no. 1, pp. 52--63, 2023.
[17]
J. Müller et al., "Ferroelectricity in hfo 2 enables nonvolatile data storage in 28 nm hkmg," in 2012 IEEE Symposium on VLSI technology, pp. 25--26, IEEE, 2012.
[18]
A. I. Khan et al., "The future of ferroelectric field-effect transistor technology," Nature Electronics, vol. 3, no. 10, pp. 588--597, 2020.
[19]
X. Yin et al., "Ferroelectric compute-in-memory annealer for combinatorial optimization problems," Nature Communications, vol. 15, no. 1, p. 2419, 2024.
[20]
S. Dünkel et al., "A fefet based super-low-power ultra-fast embedded nvm technology for 22nm fdsoi and beyond," in 2017 IEEE IEDM, pp. 19--7, IEEE, 2017.
[21]
K. Ni et al., "Ferroelectric ternary content-addressable memory for one-shot learning," Nature Electronics, vol. 2, no. 11, pp. 521--529, 2019.
[22]
X. Yin et al., "Ferroelectric ternary content addressable memories for energy-efficient associative search," IEEE TCAD, vol. 42, no. 4, pp. 1099--1112, 2022.
[23]
C. Li et al., "A scalable design of multi-bit ferroelectric content addressable memory for data-centric computing," in 2020 IEEE IEDM, pp. 29--3, IEEE, 2020.
[24]
S. Shou et al., "See-mcam: Scalable multi-bit fefet content addressable memories for energy efficient associative search," in 2023 IEEE/ACM ICCAD, pp. 1--9, IEEE, 2023.
[25]
T. Soliman et al., "First demonstration of in-memory computing crossbar using multi-level cell FeFET," Nature Communications, vol. 14, no. 1, p. 6348, 2023.
[26]
X. Yin et al., "An ultracompact single-ferroelectric field-effect transistor binary and multibit associative search engine," Advanced Intelligent Systems, vol. 5, no. 7, p. 2200428, 2023.
[27]
X. Yin et al., "Deep random forest with ferroelectric analog content addressable memory," arXiv preprint arXiv:2110.02495, 2021.
[28]
G. E. Box et al., Bayesian inference in statistical analysis. John Wiley & Sons, 2011.
[29]
D. Nikovski, "Constructing bayesian networks for medical diagnosis from incomplete and partially correct statistics," IEEE TKDE, vol. 12, no. 4, pp. 509--516, 2000.
[30]
P. C. Trimmer et al., "Decision-making under uncertainty: biases and bayesians," Animal cognition, vol. 14, pp. 465--476, 2011.
[31]
D. Lowd et al., "Naive bayes models for probability estimation," in ICML, pp. 529--536, 2005.
[32]
G. G. Ko et al., "A 3mm 2 programmable bayesian inference accelerator for unsupervised machine perception using parallel gibbs sampling in 16nm," in 2020 IEEE Symposium on VLSI Circuits, IEEE, 2020.
[33]
C.-K. Liu et al., "Cosime: Fefet based associative memory for in-memory cosine similarity search," in 2022 IEEE/ACM ICCAD, pp. 1--9, 2022.
[34]
K. Ni et al., "Write disturb in ferroelectric fets and its implication for 1t-fefet and memory arrays," IEEE EDL, vol. 39, no. 11, pp. 1656--1659, 2018.
[35]
K. Ni et al., "A circuit compatible accurate compact model for ferroelectric-fets," in 2018 IEEE Symposium on VLSI Technology, pp. 131--132, IEEE, 2018.
[36]
R. Vattikonda et al., "Modeling and minimization of pmos nbti effect for robust nanometer design," in 2006 ACM/IEEE DAC, pp. 1047--1052, 2006.
[37]
A. H. Wang, "Don't follow me: Spam detection in twitter," in 2010 IEEE SECRYPT, pp. 1--10, IEEE, 2010.
[38]
I. Rish et al., "An empirical study of the naive bayes classifier," in IJCAI, vol. 3, pp. 41--46, 2001.
[39]
Q. Liu et al., "33.2 a fully integrated analog reram based 78.4 tops/w compute-in-memory chip with fully parallel mac computing," in 2020 IEEE ISSCC, pp. 500--502, IEEE, 2020.
[40]
F. Pedregosa et al., "Scikit-learn: Machine learning in Python," the Journal of Machine Learning Research, vol. 12, pp. 2825--2830, 2011.
[41]
X. Yin et al., "Fecam: A universal compact digital and analog content addressable memory using ferroelectric," IEEE TED, vol. 67, no. 7, pp. 2785--2792, 2020.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
DAC '24: Proceedings of the 61st ACM/IEEE Design Automation Conference
June 2024
2159 pages
ISBN:9798400706011
DOI:10.1145/3649329
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 November 2024

Check for updates

Qualifiers

  • Research-article

Conference

DAC '24
Sponsor:
DAC '24: 61st ACM/IEEE Design Automation Conference
June 23 - 27, 2024
CA, San Francisco, USA

Acceptance Rates

Overall Acceptance Rate 1,770 of 5,499 submissions, 32%

Upcoming Conference

DAC '25
62nd ACM/IEEE Design Automation Conference
June 22 - 26, 2025
San Francisco , CA , USA

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 55
    Total Downloads
  • Downloads (Last 12 months)55
  • Downloads (Last 6 weeks)9
Reflects downloads up to 01 Feb 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media