RepBERT: Contextualized Text Embeddings for First-Stage Retrieval

Zhan, Jingtao; Mao, Jiaxin; Liu, Yiqun; Zhang, Min; Ma, Shaoping

Computer Science > Information Retrieval

arXiv:2006.15498 (cs)

[Submitted on 28 Jun 2020 (v1), last revised 20 Jul 2020 (this version, v2)]

Title:RepBERT: Contextualized Text Embeddings for First-Stage Retrieval

Authors:Jingtao Zhan, Jiaxin Mao, Yiqun Liu, Min Zhang, Shaoping Ma

View PDF

Abstract:Although exact term match between queries and documents is the dominant method to perform first-stage retrieval, we propose a different approach, called RepBERT, to represent documents and queries with fixed-length contextualized embeddings. The inner products of query and document embeddings are regarded as relevance scores. On MS MARCO Passage Ranking task, RepBERT achieves state-of-the-art results among all initial retrieval techniques. And its efficiency is comparable to bag-of-words methods.

Comments:	For corresponding code and data, see this https URL
Subjects:	Information Retrieval (cs.IR)
Cite as:	arXiv:2006.15498 [cs.IR]
	(or arXiv:2006.15498v2 [cs.IR] for this version)
	https://doi.org/10.48550/arXiv.2006.15498

Submission history

From: Jingtao Zhan [view email]
[v1] Sun, 28 Jun 2020 03:46:32 UTC (1,220 KB)
[v2] Mon, 20 Jul 2020 12:51:00 UTC (1,221 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.IR

< prev | next >

new | recent | 2020-06

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Jiaxin Mao
Yiqun Liu
Min Zhang
Shaoping Ma

export BibTeX citation

Computer Science > Information Retrieval

Title:RepBERT: Contextualized Text Embeddings for First-Stage Retrieval

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Information Retrieval

Title:RepBERT: Contextualized Text Embeddings for First-Stage Retrieval

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators