Calibrate your listeners! Robust communication-based training for pragmatic speakers

Wang, Rose E.; White, Julia; Mu, Jesse; Goodman, Noah D.

Computer Science > Computation and Language

arXiv:2110.05422 (cs)

[Submitted on 11 Oct 2021]

Title:Calibrate your listeners! Robust communication-based training for pragmatic speakers

Authors:Rose E. Wang, Julia White, Jesse Mu, Noah D. Goodman

View PDF

Abstract:To be good conversational partners, natural language processing (NLP) systems should be trained to produce contextually useful utterances. Prior work has investigated training NLP systems with communication-based objectives, where a neural listener stands in as a communication partner. However, these systems commonly suffer from semantic drift where the learned language diverges radically from natural language. We propose a method that uses a population of neural listeners to regularize speaker training. We first show that language drift originates from the poor uncertainty calibration of a neural listener, which makes high-certainty predictions on novel sentences. We explore ensemble- and dropout-based populations of listeners and find that the former results in better uncertainty quantification. We evaluate both population-based objectives on reference games, and show that the ensemble method with better calibration enables the speaker to generate pragmatic utterances while scaling to a large vocabulary and generalizing to new games and listeners.

Comments:	Findings of EMNLP 2021 Code: this https URL
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multiagent Systems (cs.MA)
Cite as:	arXiv:2110.05422 [cs.CL]
	(or arXiv:2110.05422v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2110.05422

Submission history

From: Rose Wang [view email]
[v1] Mon, 11 Oct 2021 17:07:38 UTC (327 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2021-10

Change to browse by:

cs
cs.AI
cs.LG
cs.MA

References & Citations

DBLP - CS Bibliography

listing | bibtex

Jesse Mu
Noah D. Goodman

export BibTeX citation

Computer Science > Computation and Language

Title:Calibrate your listeners! Robust communication-based training for pragmatic speakers

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Calibrate your listeners! Robust communication-based training for pragmatic speakers

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators