Certified Robustness against Sparse Adversarial Perturbations via Data Localization

Pal, Ambar; Vidal, René; Sulam, Jeremias

Computer Science > Machine Learning

arXiv:2405.14176 (cs)

[Submitted on 23 May 2024]

Title:Certified Robustness against Sparse Adversarial Perturbations via Data Localization

Authors:Ambar Pal, René Vidal, Jeremias Sulam

View PDF HTML (experimental)

Abstract:Recent work in adversarial robustness suggests that natural data distributions are localized, i.e., they place high probability in small volume regions of the input space, and that this property can be utilized for designing classifiers with improved robustness guarantees for $\ell_2$-bounded perturbations. Yet, it is still unclear if this observation holds true for more general metrics. In this work, we extend this theory to $\ell_0$-bounded adversarial perturbations, where the attacker can modify a few pixels of the image but is unrestricted in the magnitude of perturbation, and we show necessary and sufficient conditions for the existence of $\ell_0$-robust classifiers. Theoretical certification approaches in this regime essentially employ voting over a large ensemble of classifiers. Such procedures are combinatorial and expensive or require complicated certification techniques. In contrast, a simple classifier emerges from our theory, dubbed Box-NN, which naturally incorporates the geometry of the problem and improves upon the current state-of-the-art in certified robustness against sparse attacks for the MNIST and Fashion-MNIST datasets.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2405.14176 [cs.LG]
	(or arXiv:2405.14176v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2405.14176

Submission history

From: Ambar Pal [view email]
[v1] Thu, 23 May 2024 05:02:00 UTC (107 KB)

Computer Science > Machine Learning

Title:Certified Robustness against Sparse Adversarial Perturbations via Data Localization

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Certified Robustness against Sparse Adversarial Perturbations via Data Localization

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators