RegaVAE: A Retrieval-Augmented Gaussian Mixture Variational Auto-Encoder for Language Modeling

Deng, Jingcheng; Pang, Liang; Shen, Huawei; Cheng, Xueqi

Computer Science > Computation and Language

arXiv:2310.10567 (cs)

[Submitted on 16 Oct 2023 (v1), last revised 23 Oct 2023 (this version, v2)]

Title:RegaVAE: A Retrieval-Augmented Gaussian Mixture Variational Auto-Encoder for Language Modeling

Authors:Jingcheng Deng, Liang Pang, Huawei Shen, Xueqi Cheng

View PDF

Abstract:Retrieval-augmented language models show promise in addressing issues like outdated information and hallucinations in language models (LMs). However, current research faces two main problems: 1) determining what information to retrieve, and 2) effectively combining retrieved information during generation. We argue that valuable retrieved information should not only be related to the current source text but also consider the future target text, given the nature of LMs that model future tokens. Moreover, we propose that aggregation using latent variables derived from a compact latent space is more efficient than utilizing explicit raw text, which is limited by context length and susceptible to noise. Therefore, we introduce RegaVAE, a retrieval-augmented language model built upon the variational auto-encoder (VAE). It encodes the text corpus into a latent space, capturing current and future information from both source and target text. Additionally, we leverage the VAE to initialize the latent space and adopt the probabilistic form of the retrieval generation paradigm by expanding the Gaussian prior distribution into a Gaussian mixture distribution. Theoretical analysis provides an optimizable upper bound for RegaVAE. Experimental results on various datasets demonstrate significant improvements in text generation quality and hallucination removal.

Comments:	Accepted to the Findings of EMNLP 2023
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2310.10567 [cs.CL]
	(or arXiv:2310.10567v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2310.10567

Submission history

From: Deng Jingcheng [view email]
[v1] Mon, 16 Oct 2023 16:42:01 UTC (239 KB)
[v2] Mon, 23 Oct 2023 12:16:44 UTC (239 KB)

Computer Science > Computation and Language

Title:RegaVAE: A Retrieval-Augmented Gaussian Mixture Variational Auto-Encoder for Language Modeling

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:RegaVAE: A Retrieval-Augmented Gaussian Mixture Variational Auto-Encoder for Language Modeling

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators