Integrating Pretrained Language Model for Dialogue Policy Learning

Wang, Hongru; Wang, Huimin; Wang, Zezhong; Wong, Kam-Fai

Computer Science > Computation and Language

arXiv:2111.01398 (cs)

[Submitted on 2 Nov 2021]

Title:Integrating Pretrained Language Model for Dialogue Policy Learning

Authors:Hongru Wang, Huimin Wang, Zezhong Wang, Kam-Fai Wong

View PDF

Abstract:Reinforcement Learning (RL) has been witnessed its potential for training a dialogue policy agent towards maximizing the accumulated rewards given from users. However, the reward can be very sparse for it is usually only provided at the end of a dialog session, which causes unaffordable interaction requirements for an acceptable dialog agent. Distinguished from many efforts dedicated to optimizing the policy and recovering the reward alternatively which suffers from easily getting stuck in local optima and model collapse, we decompose the adversarial training into two steps: 1) we integrate a pre-trained language model as a discriminator to judge whether the current system action is good enough for the last user action (i.e., \textit{next action prediction}); 2) the discriminator gives and extra local dense reward to guide the agent's exploration. The experimental result demonstrates that our method significantly improves the complete rate (~4.4\%) and success rate (~8.0\%) of the dialogue system.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2111.01398 [cs.CL]
	(or arXiv:2111.01398v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2111.01398

Submission history

From: Hongru Wang [view email]
[v1] Tue, 2 Nov 2021 07:16:03 UTC (1,016 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2021-11

Change to browse by:

cs
cs.AI

References & Citations

DBLP - CS Bibliography

listing | bibtex

Kam-Fai Wong

export BibTeX citation

Computer Science > Computation and Language

Title:Integrating Pretrained Language Model for Dialogue Policy Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Integrating Pretrained Language Model for Dialogue Policy Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators