FIAT: Fusing learning paradigms with Instruction-Accelerated Tuning

Wang, Xinyi; Wieting, John; Clark, Jonathan H.

Computer Science > Computation and Language

arXiv:2309.04663 (cs)

[Submitted on 9 Sep 2023 (v1), last revised 12 Sep 2023 (this version, v2)]

Title:FIAT: Fusing learning paradigms with Instruction-Accelerated Tuning

Authors:Xinyi Wang, John Wieting, Jonathan H. Clark

View PDF

Abstract:Learning paradigms for large language models (LLMs) currently tend to fall within either in-context learning (ICL) or full fine-tuning. Each of these comes with their own trade-offs based on available data, model size, compute cost, ease-of-use, and final quality with neither solution performing well across-the-board. In this article, we first describe ICL and fine-tuning paradigms in a way that highlights their natural connections. Based on these connections, we propose a new learning paradigm called FIAT that fuses the best of these paradigms together, enabling prompt-engineered instructions and chain-of-thought reasoning with the very largest models while also using similar methods to perform parameter updates on a modestly-sized LLM with parameter-efficient tuning. We evaluate FIAT's effectiveness on a variety of multilingual tasks and observe that FIAT performs better than both ICL and fine-tuning at scales ranging from 100-10,000 training examples. We hope that FIAT provides a practical way of harnessing the full potential of LLMs without needing to make a hard choice between learning paradigms.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2309.04663 [cs.CL]
	(or arXiv:2309.04663v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2309.04663

Submission history

From: Xinyi Wang [view email]
[v1] Sat, 9 Sep 2023 02:43:48 UTC (7,644 KB)
[v2] Tue, 12 Sep 2023 14:34:03 UTC (7,645 KB)

Computer Science > Computation and Language

Title:FIAT: Fusing learning paradigms with Instruction-Accelerated Tuning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:FIAT: Fusing learning paradigms with Instruction-Accelerated Tuning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators