Orchestrating LLMs with Different Personalizations

Zhou, Jin Peng; Luo, Katie Z; Gu, Jingwen; Yuan, Jason; Weinberger, Kilian Q.; Sun, Wen

Computer Science > Artificial Intelligence

arXiv:2407.04181 (cs)

[Submitted on 4 Jul 2024]

Title:Orchestrating LLMs with Different Personalizations

Authors:Jin Peng Zhou, Katie Z Luo, Jingwen Gu, Jason Yuan, Kilian Q. Weinberger, Wen Sun

View PDF HTML (experimental)

Abstract:This paper presents a novel approach to aligning large language models (LLMs) with individual human preferences, sometimes referred to as Reinforcement Learning from \textit{Personalized} Human Feedback (RLPHF). Given stated preferences along multiple dimensions, such as helpfulness, conciseness, or humor, the goal is to create an LLM without re-training that best adheres to this specification. Starting from specialized expert LLMs, each trained for one such particular preference dimension, we propose a black-box method that merges their outputs on a per-token level. We train a lightweight Preference Control Model (PCM) that dynamically translates the preference description and current context into next-token prediction weights. By combining the expert models' outputs at the token level, our approach dynamically generates text that optimizes the given preference. Empirical tests show that our method matches or surpasses existing preference merging techniques, providing a scalable, efficient alternative to fine-tuning LLMs for individual personalization.

Subjects:	Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Cite as:	arXiv:2407.04181 [cs.AI]
	(or arXiv:2407.04181v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2407.04181

Submission history

From: Jin Peng Zhou [view email]
[v1] Thu, 4 Jul 2024 22:55:02 UTC (1,330 KB)

Computer Science > Artificial Intelligence

Title:Orchestrating LLMs with Different Personalizations

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Orchestrating LLMs with Different Personalizations

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators