Unraveling the Mechanics of Learning-Based Demonstration Selection for In-Context Learning

Liu, Hui; Wang, Wenya; Sun, Hao; Tian, Chris Xing; Kong, Chenqi; Dong, Xin; Li, Haoliang

Computer Science > Machine Learning

arXiv:2406.11890 (cs)

[Submitted on 14 Jun 2024 (v1), last revised 15 Oct 2024 (this version, v2)]

Title:Unraveling the Mechanics of Learning-Based Demonstration Selection for In-Context Learning

Authors:Hui Liu, Wenya Wang, Hao Sun, Chris Xing Tian, Chenqi Kong, Xin Dong, Haoliang Li

View PDF

Abstract:Large Language Models (LLMs) have demonstrated impressive in-context learning (ICL) capabilities from few-shot demonstration exemplars. While recent learning-based demonstration selection methods have proven beneficial to ICL by choosing more useful exemplars, their underlying mechanisms are opaque, hindering efforts to address limitations such as high training costs and poor generalization across tasks. These methods generally assume the selection process captures similarities between the exemplar and the target instance, however, it remains unknown what kinds of similarities are captured and vital to performing ICL. To dive into this question, we analyze the working mechanisms of the learning-based demonstration selection methods and empirically identify two important factors related to similarity measurement: 1) The ability to integrate different levels of task-agnostic text similarities between the input of exemplars and test cases enhances generalization power across different tasks. 2) Incorporating task-specific labels when measuring the similarities significantly improves the performance on each specific task. We validate these two findings through extensive quantitative and qualitative analyses across ten datasets and various LLMs. Based on our findings, we introduce two effective yet simplified exemplar selection methods catering to task-agnostic and task-specific demands, eliminating the costly LLM inference overhead.

Comments:	17 pages, 7 figures and 9 tables
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Cite as:	arXiv:2406.11890 [cs.LG]
	(or arXiv:2406.11890v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2406.11890

Submission history

From: Hui Liu [view email]
[v1] Fri, 14 Jun 2024 03:34:02 UTC (6,404 KB)
[v2] Tue, 15 Oct 2024 10:53:55 UTC (7,562 KB)

Computer Science > Machine Learning

Title:Unraveling the Mechanics of Learning-Based Demonstration Selection for In-Context Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Unraveling the Mechanics of Learning-Based Demonstration Selection for In-Context Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators