Text Revealer: Private Text Reconstruction via Model Inversion Attacks against Transformers

Zhang, Ruisi; Hidano, Seira; Koushanfar, Farinaz

Computer Science > Computation and Language

arXiv:2209.10505 (cs)

[Submitted on 21 Sep 2022]

Title:Text Revealer: Private Text Reconstruction via Model Inversion Attacks against Transformers

Authors:Ruisi Zhang, Seira Hidano, Farinaz Koushanfar

View PDF

Abstract:Text classification has become widely used in various natural language processing applications like sentiment analysis. Current applications often use large transformer-based language models to classify input texts. However, there is a lack of systematic study on how much private information can be inverted when publishing models. In this paper, we formulate \emph{Text Revealer} -- the first model inversion attack for text reconstruction against text classification with transformers. Our attacks faithfully reconstruct private texts included in training data with access to the target model. We leverage an external dataset and GPT-2 to generate the target domain-like fluent text, and then perturb its hidden state optimally with the feedback from the target model. Our extensive experiments demonstrate that our attacks are effective for datasets with different text lengths and can reconstruct private texts with accuracy.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2209.10505 [cs.CL]
	(or arXiv:2209.10505v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2209.10505

Submission history

From: Ruisi Zhang [view email]
[v1] Wed, 21 Sep 2022 17:05:12 UTC (144 KB)

Computer Science > Computation and Language

Title:Text Revealer: Private Text Reconstruction via Model Inversion Attacks against Transformers

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Text Revealer: Private Text Reconstruction via Model Inversion Attacks against Transformers

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators