Minimizing Factual Inconsistency and Hallucination in Large Language Models

I, Muneeswaran; Saxena, Shreya; Prasad, Siva; Prakash, M V Sai; Shankar, Advaith; V, Varun; Vaddina, Vishal; Gopalakrishnan, Saisubramaniam

Computer Science > Computation and Language

arXiv:2311.13878 (cs)

[Submitted on 23 Nov 2023]

Title:Minimizing Factual Inconsistency and Hallucination in Large Language Models

Authors:Muneeswaran I, Shreya Saxena, Siva Prasad, M V Sai Prakash, Advaith Shankar, Varun V, Vishal Vaddina, Saisubramaniam Gopalakrishnan

View PDF

Abstract:Large Language Models (LLMs) are widely used in critical fields such as healthcare, education, and finance due to their remarkable proficiency in various language-related tasks. However, LLMs are prone to generating factually incorrect responses or "hallucinations," which can lead to a loss of credibility and trust among users. To address this issue, we propose a multi-stage framework that generates the rationale first, verifies and refines incorrect ones, and uses them as supporting references to generate the answer. The generated rationale enhances the transparency of the answer and our framework provides insights into how the model arrived at this answer, by using this rationale and the references to the context. In this paper, we demonstrate its effectiveness in improving the quality of responses to drug-related inquiries in the life sciences industry. Our framework improves traditional Retrieval Augmented Generation (RAG) by enabling OpenAI GPT-3.5-turbo to be 14-25% more faithful and 16-22% more accurate on two datasets. Furthermore, fine-tuning samples based on our framework improves the accuracy of smaller open-access LLMs by 33-42% and competes with RAG on commercial models.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2311.13878 [cs.CL]
	(or arXiv:2311.13878v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2311.13878

Submission history

From: Saisubramaniam Gopalakrishnan [view email]
[v1] Thu, 23 Nov 2023 09:58:39 UTC (1,570 KB)

Computer Science > Computation and Language

Title:Minimizing Factual Inconsistency and Hallucination in Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Minimizing Factual Inconsistency and Hallucination in Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators