Prompt-Based Length Controlled Generation with Multiple Control Types

Jie, Renlong; Meng, Xiaojun; Shang, Lifeng; Jiang, Xin; Liu, Qun

Computer Science > Computation and Language

arXiv:2406.10278 (cs)

[Submitted on 12 Jun 2024]

Title:Prompt-Based Length Controlled Generation with Multiple Control Types

Authors:Renlong Jie, Xiaojun Meng, Lifeng Shang, Xin Jiang, Qun Liu

View PDF HTML (experimental)

Abstract:Large language models (LLMs) have attracted great attention given their strong performance on a wide range of NLP tasks. In practice, users often expect generated texts to fall within a specific length range, making length controlled generation an important topic, especially for GPT-style models. Existing length control methods mostly focus on a simple control type of "equal to" a target length. Different from them, we propose a prompt-based method to achieve length controlled generation under different control types with high accuracy. In particular, we adopt reinforcement learning (RL) and sample filtering with the reward signal given by rule-based reward models, which enhances the length control ability of models by rewarding outputs that follow certain control instructions. In addition, we introduce a standard prompt extractor to parse arbitrary users' input into standard control instructions. Experiments show that our method significantly improves the accuracy of prompt-based length control on popular summarization datasets like CNNDM and NYT under multiple control types. Moreover, both the standard prompt extractor and RL-tuned model show strong generalization to unseen control prompt templates.

Comments:	Accepted by ACL 2024 findings. arXiv admin note: text overlap with arXiv:2308.12030
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2406.10278 [cs.CL]
	(or arXiv:2406.10278v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2406.10278

Submission history

From: Renlong Jie [view email]
[v1] Wed, 12 Jun 2024 01:49:54 UTC (7,666 KB)

Computer Science > Computation and Language

Title:Prompt-Based Length Controlled Generation with Multiple Control Types

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Prompt-Based Length Controlled Generation with Multiple Control Types

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators