Understanding the Detrimental Class-level Effects of Data Augmentation

Kirichenko, Polina; Ibrahim, Mark; Balestriero, Randall; Bouchacourt, Diane; Vedantam, Ramakrishna; Firooz, Hamed; Wilson, Andrew Gordon

Computer Science > Computer Vision and Pattern Recognition

arXiv:2401.01764 (cs)

[Submitted on 7 Dec 2023]

Title:Understanding the Detrimental Class-level Effects of Data Augmentation

Authors:Polina Kirichenko, Mark Ibrahim, Randall Balestriero, Diane Bouchacourt, Ramakrishna Vedantam, Hamed Firooz, Andrew Gordon Wilson

View PDF HTML (experimental)

Abstract:Data augmentation (DA) encodes invariance and provides implicit regularization critical to a model's performance in image classification tasks. However, while DA improves average accuracy, recent studies have shown that its impact can be highly class dependent: achieving optimal average accuracy comes at the cost of significantly hurting individual class accuracy by as much as 20% on ImageNet. There has been little progress in resolving class-level accuracy drops due to a limited understanding of these effects. In this work, we present a framework for understanding how DA interacts with class-level learning dynamics. Using higher-quality multi-label annotations on ImageNet, we systematically categorize the affected classes and find that the majority are inherently ambiguous, co-occur, or involve fine-grained distinctions, while DA controls the model's bias towards one of the closely related classes. While many of the previously reported performance drops are explained by multi-label annotations, our analysis of class confusions reveals other sources of accuracy degradation. We show that simple class-conditional augmentation strategies informed by our framework improve performance on the negatively affected classes.

Comments:	Neural Information Processing Systems (NeurIPS), 2023
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2401.01764 [cs.CV]
	(or arXiv:2401.01764v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2401.01764

Submission history

From: Polina Kirichenko [view email]
[v1] Thu, 7 Dec 2023 18:37:43 UTC (30,601 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Understanding the Detrimental Class-level Effects of Data Augmentation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Understanding the Detrimental Class-level Effects of Data Augmentation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators