Semantically Diverse Language Generation for Uncertainty Estimation in Language Models

Aichberger, Lukas; Schweighofer, Kajetan; Ielanskyi, Mykyta; Hochreiter, Sepp

Computer Science > Machine Learning

arXiv:2406.04306 (cs)

[Submitted on 6 Jun 2024]

Title:Semantically Diverse Language Generation for Uncertainty Estimation in Language Models

Authors:Lukas Aichberger, Kajetan Schweighofer, Mykyta Ielanskyi, Sepp Hochreiter

View PDF HTML (experimental)

Abstract:Large language models (LLMs) can suffer from hallucinations when generating text. These hallucinations impede various applications in society and industry by making LLMs untrustworthy. Current LLMs generate text in an autoregressive fashion by predicting and appending text tokens. When an LLM is uncertain about the semantic meaning of the next tokens to generate, it is likely to start hallucinating. Thus, it has been suggested that hallucinations stem from predictive uncertainty. We introduce Semantically Diverse Language Generation (SDLG) to quantify predictive uncertainty in LLMs. SDLG steers the LLM to generate semantically diverse yet likely alternatives for an initially generated text. This approach provides a precise measure of aleatoric semantic uncertainty, detecting whether the initial text is likely to be hallucinated. Experiments on question-answering tasks demonstrate that SDLG consistently outperforms existing methods while being the most computationally efficient, setting a new standard for uncertainty estimation in LLMs.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2406.04306 [cs.LG]
	(or arXiv:2406.04306v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2406.04306

Submission history

From: Lukas Aichberger [view email]
[v1] Thu, 6 Jun 2024 17:53:34 UTC (1,697 KB)

Computer Science > Machine Learning

Title:Semantically Diverse Language Generation for Uncertainty Estimation in Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Semantically Diverse Language Generation for Uncertainty Estimation in Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators