Prompt Waywardness: The Curious Case of Discretized Interpretation of Continuous Prompts

Khashabi, Daniel; Lyu, Shane; Min, Sewon; Qin, Lianhui; Richardson, Kyle; Welleck, Sean; Hajishirzi, Hannaneh; Khot, Tushar; Sabharwal, Ashish; Singh, Sameer; Choi, Yejin

Computer Science > Computation and Language

arXiv:2112.08348 (cs)

[Submitted on 15 Dec 2021 (v1), last revised 4 May 2022 (this version, v2)]

Title:Prompt Waywardness: The Curious Case of Discretized Interpretation of Continuous Prompts

Authors:Daniel Khashabi, Shane Lyu, Sewon Min, Lianhui Qin, Kyle Richardson, Sean Welleck, Hannaneh Hajishirzi, Tushar Khot, Ashish Sabharwal, Sameer Singh, Yejin Choi

View PDF

Abstract:Fine-tuning continuous prompts for target tasks has recently emerged as a compact alternative to full model fine-tuning. Motivated by these promising results, we investigate the feasibility of extracting a discrete (textual) interpretation of continuous prompts that is faithful to the problem they solve. In practice, we observe a "wayward" behavior between the task solved by continuous prompts and their nearest neighbor discrete projections: We can find continuous prompts that solve a task while being projected to an arbitrary text (e.g., definition of a different or even a contradictory task), while being within a very small (2%) margin of the best continuous prompt of the same size for the task. We provide intuitions behind this odd and surprising behavior, as well as extensive empirical analyses quantifying the effect of various parameters. For instance, for larger model sizes we observe higher waywardness, i.e, we can find prompts that more closely map to any arbitrary text with a smaller drop in accuracy. These findings have important implications relating to the difficulty of faithfully interpreting continuous prompts and their generalization across models and tasks, providing guidance for future progress in prompting language models.

Comments:	NAACL 2022
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2112.08348 [cs.CL]
	(or arXiv:2112.08348v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2112.08348

Submission history

From: Daniel Khashabi Mr. [view email]
[v1] Wed, 15 Dec 2021 18:55:05 UTC (4,284 KB)
[v2] Wed, 4 May 2022 04:28:12 UTC (4,382 KB)

Computer Science > Computation and Language

Title:Prompt Waywardness: The Curious Case of Discretized Interpretation of Continuous Prompts

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Prompt Waywardness: The Curious Case of Discretized Interpretation of Continuous Prompts

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators