A Synergetic Attack against Neural Network Classifiers combining Backdoor and Adversarial Examples

Liu, Guanxiong; Khalil, Issa; Khreishah, Abdallah; Phan, NhatHai

Computer Science > Cryptography and Security

arXiv:2109.01275 (cs)

[Submitted on 3 Sep 2021]

Title:A Synergetic Attack against Neural Network Classifiers combining Backdoor and Adversarial Examples

Authors:Guanxiong Liu, Issa Khalil, Abdallah Khreishah, NhatHai Phan

View PDF

Abstract:In this work, we show how to jointly exploit adversarial perturbation and model poisoning vulnerabilities to practically launch a new stealthy attack, dubbed AdvTrojan. AdvTrojan is stealthy because it can be activated only when: 1) a carefully crafted adversarial perturbation is injected into the input examples during inference, and 2) a Trojan backdoor is implanted during the training process of the model. We leverage adversarial noise in the input space to move Trojan-infected examples across the model decision boundary, making it difficult to detect. The stealthiness behavior of AdvTrojan fools the users into accidentally trust the infected model as a robust classifier against adversarial examples. AdvTrojan can be implemented by only poisoning the training data similar to conventional Trojan backdoor attacks. Our thorough analysis and extensive experiments on several benchmark datasets show that AdvTrojan can bypass existing defenses with a success rate close to 100% in most of our experimental scenarios and can be extended to attack federated learning tasks as well.

Subjects:	Cryptography and Security (cs.CR); Machine Learning (cs.LG)
Cite as:	arXiv:2109.01275 [cs.CR]
	(or arXiv:2109.01275v1 [cs.CR] for this version)
	https://doi.org/10.48550/arXiv.2109.01275

Submission history

From: Guanxiong Liu [view email]
[v1] Fri, 3 Sep 2021 02:18:57 UTC (2,747 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CR

< prev | next >

new | recent | 2021-09

Change to browse by:

cs
cs.LG

References & Citations

DBLP - CS Bibliography

listing | bibtex

Guanxiong Liu
Issa Khalil
Abdallah Khreishah
NhatHai Phan

export BibTeX citation

Computer Science > Cryptography and Security

Title:A Synergetic Attack against Neural Network Classifiers combining Backdoor and Adversarial Examples

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Cryptography and Security

Title:A Synergetic Attack against Neural Network Classifiers combining Backdoor and Adversarial Examples

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators