Understanding Hard Negatives in Noise Contrastive Estimation

Zhang, Wenzheng; Stratos, Karl

Computer Science > Computation and Language

arXiv:2104.06245 (cs)

[Submitted on 13 Apr 2021]

Title:Understanding Hard Negatives in Noise Contrastive Estimation

Authors:Wenzheng Zhang, Karl Stratos

View PDF

Abstract:The choice of negative examples is important in noise contrastive estimation. Recent works find that hard negatives -- highest-scoring incorrect examples under the model -- are effective in practice, but they are used without a formal justification. We develop analytical tools to understand the role of hard negatives. Specifically, we view the contrastive loss as a biased estimator of the gradient of the cross-entropy loss, and show both theoretically and empirically that setting the negative distribution to be the model distribution results in bias reduction. We also derive a general form of the score function that unifies various architectures used in text retrieval. By combining hard negatives with appropriate score functions, we obtain strong results on the challenging task of zero-shot entity linking.

Comments:	NAACL 2021
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2104.06245 [cs.CL]
	(or arXiv:2104.06245v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2104.06245

Submission history

From: Karl Stratos [view email]
[v1] Tue, 13 Apr 2021 14:42:41 UTC (169 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2021-04

Change to browse by:

cs
cs.LG

References & Citations

DBLP - CS Bibliography

listing | bibtex

Wenzheng Zhang
Karl Stratos

export BibTeX citation

Computer Science > Computation and Language

Title:Understanding Hard Negatives in Noise Contrastive Estimation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Understanding Hard Negatives in Noise Contrastive Estimation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators