Quick Dense Retrievers Consume KALE: Post Training Kullback Leibler Alignment of Embeddings for Asymmetrical dual encoders

Campos, Daniel; Magnani, Alessandro; Zhai, ChengXiang

Computer Science > Computation and Language

arXiv:2304.01016 (cs)

[Submitted on 31 Mar 2023 (v1), last revised 1 Jun 2023 (this version, v3)]

Title:Quick Dense Retrievers Consume KALE: Post Training Kullback Leibler Alignment of Embeddings for Asymmetrical dual encoders

Authors:Daniel Campos, Alessandro Magnani, ChengXiang Zhai

View PDF

Abstract:In this paper, we consider the problem of improving the inference latency of language model-based dense retrieval systems by introducing structural compression and model size asymmetry between the context and query encoders. First, we investigate the impact of pre and post-training compression on the MSMARCO, Natural Questions, TriviaQA, SQUAD, and SCIFACT, finding that asymmetry in the dual encoders in dense retrieval can lead to improved inference efficiency. Knowing this, we introduce Kullback Leibler Alignment of Embeddings (KALE), an efficient and accurate method for increasing the inference efficiency of dense retrieval methods by pruning and aligning the query encoder after training. Specifically, KALE extends traditional Knowledge Distillation after bi-encoder training, allowing for effective query encoder compression without full retraining or index generation. Using KALE and asymmetric training, we can generate models which exceed the performance of DistilBERT despite having 3x faster inference.

Comments:	SustaiNLP2023 @ ACL 2023, 8 pages, 4 figures, 30 tables
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
Cite as:	arXiv:2304.01016 [cs.CL]
	(or arXiv:2304.01016v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2304.01016

Submission history

From: Daniel Campos [view email]
[v1] Fri, 31 Mar 2023 15:44:13 UTC (7,232 KB)
[v2] Mon, 17 Apr 2023 18:00:25 UTC (7,237 KB)
[v3] Thu, 1 Jun 2023 22:08:03 UTC (7,238 KB)

Computer Science > Computation and Language

Title:Quick Dense Retrievers Consume KALE: Post Training Kullback Leibler Alignment of Embeddings for Asymmetrical dual encoders

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Quick Dense Retrievers Consume KALE: Post Training Kullback Leibler Alignment of Embeddings for Asymmetrical dual encoders

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators