On-The-Fly Information Retrieval Augmentation for Language Models

Wang, Hai; McAllester, David

Computer Science > Computation and Language

arXiv:2007.01528 (cs)

[Submitted on 3 Jul 2020]

Title:On-The-Fly Information Retrieval Augmentation for Language Models

Authors:Hai Wang, David McAllester

View PDF

Abstract:Here we experiment with the use of information retrieval as an augmentation for pre-trained language models. The text corpus used in information retrieval can be viewed as form of episodic memory which grows over time. By augmenting GPT 2.0 with information retrieval we achieve a zero shot 15% relative reduction in perplexity on Gigaword corpus without any re-training. We also validate our IR augmentation on an event co-reference task.

Comments:	ACL 2020 NUSE Workshop
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2007.01528 [cs.CL]
	(or arXiv:2007.01528v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2007.01528

Submission history

From: Hai Wang [view email]
[v1] Fri, 3 Jul 2020 07:31:14 UTC (126 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2020-07

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Hai Wang
David McAllester

export BibTeX citation

Computer Science > Computation and Language

Title:On-The-Fly Information Retrieval Augmentation for Language Models

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:On-The-Fly Information Retrieval Augmentation for Language Models

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators