Robustness through Data Augmentation Loss Consistency

Huang, Tianjian; Halbe, Shaunak; Sankar, Chinnadhurai; Amini, Pooyan; Kottur, Satwik; Geramifard, Alborz; Razaviyayn, Meisam; Beirami, Ahmad

Computer Science > Machine Learning

arXiv:2110.11205 (cs)

[Submitted on 21 Oct 2021 (v1), last revised 24 Jan 2023 (this version, v3)]

Title:Robustness through Data Augmentation Loss Consistency

Authors:Tianjian Huang, Shaunak Halbe, Chinnadhurai Sankar, Pooyan Amini, Satwik Kottur, Alborz Geramifard, Meisam Razaviyayn, Ahmad Beirami

View PDF

Abstract:While deep learning through empirical risk minimization (ERM) has succeeded at achieving human-level performance at a variety of complex tasks, ERM is not robust to distribution shifts or adversarial attacks. Synthetic data augmentation followed by empirical risk minimization (DA-ERM) is a simple and widely used solution to improve robustness in ERM. In addition, consistency regularization can be applied to further improve the robustness of the model by forcing the representation of the original sample and the augmented one to be similar. However, existing consistency regularization methods are not applicable to covariant data augmentation, where the label in the augmented sample is dependent on the augmentation function. For example, dialog state covaries with named entity when we augment data with a new named entity. In this paper, we propose data augmented loss invariant regularization (DAIR), a simple form of consistency regularization that is applied directly at the loss level rather than intermediate features, making it widely applicable to both invariant and covariant data augmentation regardless of network architecture, problem setup, and task. We apply DAIR to real-world learning problems involving covariant data augmentation: robust neural task-oriented dialog state tracking and robust visual question answering. We also apply DAIR to tasks involving invariant data augmentation: robust regression, robust classification against adversarial attacks, and robust ImageNet classification under distribution shift. Our experiments show that DAIR consistently outperforms ERM and DA-ERM with little marginal computational cost and sets new state-of-the-art results in several benchmarks involving covariant data augmentation. Our code of all experiments is available at: this https URL

Comments:	40 pages
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2110.11205 [cs.LG]
	(or arXiv:2110.11205v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2110.11205

Submission history

From: Tianjian Huang [view email]
[v1] Thu, 21 Oct 2021 15:30:40 UTC (5,671 KB)
[v2] Tue, 1 Mar 2022 00:11:52 UTC (980 KB)
[v3] Tue, 24 Jan 2023 10:55:13 UTC (1,778 KB)

Computer Science > Machine Learning

Title:Robustness through Data Augmentation Loss Consistency

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Robustness through Data Augmentation Loss Consistency

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators