Adversarial Text Normalization

Bitton, Joanna; Pavlova, Maya; Evtimov, Ivan

Computer Science > Computation and Language

arXiv:2206.04137 (cs)

[Submitted on 8 Jun 2022]

Title:Adversarial Text Normalization

Authors:Joanna Bitton, Maya Pavlova, Ivan Evtimov

View PDF

Abstract:Text-based adversarial attacks are becoming more commonplace and accessible to general internet users. As these attacks proliferate, the need to address the gap in model robustness becomes imminent. While retraining on adversarial data may increase performance, there remains an additional class of character-level attacks on which these models falter. Additionally, the process to retrain a model is time and resource intensive, creating a need for a lightweight, reusable defense. In this work, we propose the Adversarial Text Normalizer, a novel method that restores baseline performance on attacked content with low computational overhead. We evaluate the efficacy of the normalizer on two problem areas prone to adversarial attacks, i.e. Hate Speech and Natural Language Inference. We find that text normalization provides a task-agnostic defense against character-level attacks that can be implemented supplementary to adversarial retraining solutions, which are more suited for semantic alterations.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2206.04137 [cs.CL]
	(or arXiv:2206.04137v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2206.04137

Submission history

From: Joanna Bitton [view email]
[v1] Wed, 8 Jun 2022 19:44:03 UTC (4,582 KB)

Computer Science > Computation and Language

Title:Adversarial Text Normalization

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Adversarial Text Normalization

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators