An Empirical Study on Robustness to Spurious Correlations using Pre-trained Language Models

Tu, Lifu; Lalwani, Garima; Gella, Spandana; He, He

Computer Science > Computation and Language

arXiv:2007.06778 (cs)

[Submitted on 14 Jul 2020 (v1), last revised 11 Aug 2020 (this version, v3)]

Title:An Empirical Study on Robustness to Spurious Correlations using Pre-trained Language Models

Authors:Lifu Tu, Garima Lalwani, Spandana Gella, He He

View PDF

Abstract:Recent work has shown that pre-trained language models such as BERT improve robustness to spurious correlations in the dataset. Intrigued by these results, we find that the key to their success is generalization from a small amount of counterexamples where the spurious correlations do not hold. When such minority examples are scarce, pre-trained models perform as poorly as models trained from scratch. In the case of extreme minority, we propose to use multi-task learning (MTL) to improve generalization. Our experiments on natural language inference and paraphrase identification show that MTL with the right auxiliary tasks significantly improves performance on challenging examples without hurting the in-distribution performance. Further, we show that the gain from MTL mainly comes from improved generalization from the minority examples. Our results highlight the importance of data diversity for overcoming spurious correlations.

Comments:	Accepted to TACL 2020
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2007.06778 [cs.CL]
	(or arXiv:2007.06778v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2007.06778

Submission history

From: Lifu Tu [view email]
[v1] Tue, 14 Jul 2020 02:34:59 UTC (510 KB)
[v2] Mon, 10 Aug 2020 16:18:25 UTC (512 KB)
[v3] Tue, 11 Aug 2020 15:51:37 UTC (512 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2020-07

Change to browse by:

cs
cs.LG

References & Citations

DBLP - CS Bibliography

listing | bibtex

Lifu Tu
Spandana Gella
He He

export BibTeX citation

Computer Science > Computation and Language

Title:An Empirical Study on Robustness to Spurious Correlations using Pre-trained Language Models

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:An Empirical Study on Robustness to Spurious Correlations using Pre-trained Language Models

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators