Power-Softmax: Towards Secure LLM Inference over Encrypted Data

Zimerman, Itamar; Adir, Allon; Aharoni, Ehud; Avitan, Matan; Baruch, Moran; Drucker, Nir; Lerner, Jenny; Masalha, Ramy; Meiri, Reut; Soceanu, Omri

Computer Science > Machine Learning

arXiv:2410.09457 (cs)

[Submitted on 12 Oct 2024]

Title:Power-Softmax: Towards Secure LLM Inference over Encrypted Data

Authors:Itamar Zimerman, Allon Adir, Ehud Aharoni, Matan Avitan, Moran Baruch, Nir Drucker, Jenny Lerner, Ramy Masalha, Reut Meiri, Omri Soceanu

View PDF HTML (experimental)

Abstract:Modern cryptographic methods for implementing privacy-preserving LLMs such as Homomorphic Encryption (HE) require the LLMs to have a polynomial form. Forming such a representation is challenging because Transformers include non-polynomial components, such as Softmax and layer normalization. Previous approaches have either directly approximated pre-trained models with large-degree polynomials, which are less efficient over HE, or replaced non-polynomial components with easier-to-approximate primitives before training, e.g., Softmax with pointwise attention. The latter approach might introduce scalability challenges.
We present a new HE-friendly variant of self-attention that offers a stable form for training and is easy to approximate with polynomials for secure inference. Our work introduces the first polynomial LLMs with 32 layers and over a billion parameters, exceeding the size of previous models by more than tenfold. The resulting models demonstrate reasoning and in-context learning (ICL) capabilities comparable to standard transformers of the same size, representing a breakthrough in the field. Finally, we provide a detailed latency breakdown for each computation over encrypted data, paving the way for further optimization, and explore the differences in inductive bias between transformers relying on our HE-friendly variant and standard transformers. Our code is attached as a supplement.

Subjects:	Machine Learning (cs.LG); Cryptography and Security (cs.CR)
ACM classes:	F.2.2; I.2.7
Cite as:	arXiv:2410.09457 [cs.LG]
	(or arXiv:2410.09457v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2410.09457

Submission history

From: Itamar Zimerman [view email]
[v1] Sat, 12 Oct 2024 09:32:42 UTC (10,060 KB)

Computer Science > Machine Learning

Title:Power-Softmax: Towards Secure LLM Inference over Encrypted Data

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Power-Softmax: Towards Secure LLM Inference over Encrypted Data

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators