Selection-p: Self-Supervised Task-Agnostic Prompt Compression for Faithfulness and Transferability

Chung, Tsz Ting; Cui, Leyang; Liu, Lemao; Huang, Xinting; Shi, Shuming; Yeung, Dit-Yan

Computer Science > Computation and Language

arXiv:2410.11786 (cs)

[Submitted on 15 Oct 2024 (v1), last revised 21 Oct 2024 (this version, v2)]

Title:Selection-p: Self-Supervised Task-Agnostic Prompt Compression for Faithfulness and Transferability

Authors:Tsz Ting Chung, Leyang Cui, Lemao Liu, Xinting Huang, Shuming Shi, Dit-Yan Yeung

View PDF HTML (experimental)

Abstract:Large Language Models (LLMs) have demonstrated impressive capabilities in a wide range of natural language processing tasks when leveraging in-context learning. To mitigate the additional computational and financial costs associated with in-context learning, several prompt compression methods have been proposed to compress the in-context learning prompts. Despite their success, these methods face challenges with transferability due to model-specific compression, or rely on external training data, such as GPT-4. In this paper, we investigate the ability of LLMs to develop a unified compression method that discretizes uninformative tokens, utilizing a self-supervised pre-training technique. By introducing a small number of parameters during the continual pre-training, the proposed Selection-p produces a probability for each input token, indicating whether to preserve or discard it. Experiments show Selection-p achieves state-of-the-art performance across numerous classification tasks, achieving compression rates of up to 10 times while experiencing only a marginal 0.8% decrease in performance. Moreover, it exhibits superior transferability to different models compared to prior work. Additionally, we further analyze how Selection-p helps maintain performance on in-context learning with long contexts.

Comments:	14 pages, 5 figures, 10 tables, EMNLP 2024 Findings
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2410.11786 [cs.CL]
	(or arXiv:2410.11786v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2410.11786

Submission history

From: Tsz Ting Chung [view email]
[v1] Tue, 15 Oct 2024 17:05:25 UTC (419 KB)
[v2] Mon, 21 Oct 2024 13:11:44 UTC (419 KB)

Computer Science > Computation and Language

Title:Selection-p: Self-Supervised Task-Agnostic Prompt Compression for Faithfulness and Transferability

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Selection-p: Self-Supervised Task-Agnostic Prompt Compression for Faithfulness and Transferability

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators