BIRD: A Trustworthy Bayesian Inference Framework for Large Language Models

Feng, Yu; Zhou, Ben; Lin, Weidong; Roth, Dan

Computer Science > Computation and Language

arXiv:2404.12494 (cs)

[Submitted on 18 Apr 2024]

Title:BIRD: A Trustworthy Bayesian Inference Framework for Large Language Models

Authors:Yu Feng, Ben Zhou, Weidong Lin, Dan Roth

View PDF HTML (experimental)

Abstract:Large language models primarily rely on inductive reasoning for decision making. This results in unreliable decisions when applied to real-world tasks that often present incomplete contexts and conditions. Thus, accurate probability estimation and appropriate interpretations are required to enhance decision-making reliability. In this paper, we propose a Bayesian inference framework called BIRD for large language models. BIRD provides controllable and interpretable probability estimation for model decisions, based on abductive factors, LLM entailment, as well as learnable deductive Bayesian modeling. Experiments show that BIRD produces probability estimations that align with human judgments over 65% of the time using open-sourced Llama models, outperforming the state-of-the-art GPT-4 by 35%. We also show that BIRD can be directly used for trustworthy decision making on many real-world applications.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2404.12494 [cs.CL]
	(or arXiv:2404.12494v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2404.12494

Submission history

From: Yu Feng [view email]
[v1] Thu, 18 Apr 2024 20:17:23 UTC (6,470 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2024-04

Change to browse by:

References & Citations

export BibTeX citation

✅2024-10-01: arxiv.org is back to normal.✅

Computer Science > Computation and Language

Title:BIRD: A Trustworthy Bayesian Inference Framework for Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

✅2024-10-01: arxiv.org is back to normal.✅

Computer Science > Computation and Language

Title:BIRD: A Trustworthy Bayesian Inference Framework for Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators