Generating Sequences by Learning to Self-Correct

Welleck, Sean; Lu, Ximing; West, Peter; Brahman, Faeze; Shen, Tianxiao; Khashabi, Daniel; Choi, Yejin

Computer Science > Computation and Language

arXiv:2211.00053 (cs)

[Submitted on 31 Oct 2022]

Title:Generating Sequences by Learning to Self-Correct

Authors:Sean Welleck, Ximing Lu, Peter West, Faeze Brahman, Tianxiao Shen, Daniel Khashabi, Yejin Choi

View PDF

Abstract:Sequence generation applications require satisfying semantic constraints, such as ensuring that programs are correct, using certain keywords, or avoiding undesirable content. Language models, whether fine-tuned or prompted with few-shot demonstrations, frequently violate these constraints, and lack a mechanism to iteratively revise their outputs. Moreover, some powerful language models are of extreme scale or inaccessible, making it inefficient, if not infeasible, to update their parameters for task-specific adaptation. We present Self-Correction, an approach that decouples an imperfect base generator (an off-the-shelf language model or supervised sequence-to-sequence model) from a separate corrector that learns to iteratively correct imperfect generations. To train the corrector, we propose an online training procedure that can use either scalar or natural language feedback on intermediate imperfect generations. We show that Self-Correction improves upon the base generator in three diverse generation tasks - mathematical program synthesis, lexically-constrained generation, and toxicity control - even when the corrector is much smaller than the base generator.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2211.00053 [cs.CL]
	(or arXiv:2211.00053v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2211.00053

Submission history

From: Sean Welleck [view email]
[v1] Mon, 31 Oct 2022 18:09:51 UTC (2,006 KB)

Computer Science > Computation and Language

Title:Generating Sequences by Learning to Self-Correct

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Generating Sequences by Learning to Self-Correct

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators