Lost In Translation: Generating Adversarial Examples Robust to Round-Trip Translation

Bhandari, Neel; Chen, Pin-Yu

doi:10.1109/ICASSP49357.2023.10094630

Computer Science > Computation and Language

arXiv:2307.12520 (cs)

[Submitted on 24 Jul 2023]

Title:Lost In Translation: Generating Adversarial Examples Robust to Round-Trip Translation

Authors:Neel Bhandari, Pin-Yu Chen

View PDF

Abstract:Language Models today provide a high accuracy across a large number of downstream tasks. However, they remain susceptible to adversarial attacks, particularly against those where the adversarial examples maintain considerable similarity to the original text. Given the multilingual nature of text, the effectiveness of adversarial examples across translations and how machine translations can improve the robustness of adversarial examples remain largely unexplored. In this paper, we present a comprehensive study on the robustness of current text adversarial attacks to round-trip translation. We demonstrate that 6 state-of-the-art text-based adversarial attacks do not maintain their efficacy after round-trip translation. Furthermore, we introduce an intervention-based solution to this problem, by integrating Machine Translation into the process of adversarial example generation and demonstrating increased robustness to round-trip translation. Our results indicate that finding adversarial examples robust to translation can help identify the insufficiency of language models that is common across languages, and motivate further research into multilingual adversarial attacks.

Comments:	Published at International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 2023
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2307.12520 [cs.CL]
	(or arXiv:2307.12520v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2307.12520
Related DOI:	https://doi.org/10.1109/ICASSP49357.2023.10094630

Submission history

From: Neel Bhandari [view email]
[v1] Mon, 24 Jul 2023 04:29:43 UTC (989 KB)

Computer Science > Computation and Language

Title:Lost In Translation: Generating Adversarial Examples Robust to Round-Trip Translation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Lost In Translation: Generating Adversarial Examples Robust to Round-Trip Translation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators