Reinforcement Learning of Multi-Domain Dialog Policies Via Action Embeddings

Mendez, Jorge A.; Geramifard, Alborz; Ghavamzadeh, Mohammad; Liu, Bing

Computer Science > Computation and Language

arXiv:2207.00468 (cs)

[Submitted on 1 Jul 2022]

Title:Reinforcement Learning of Multi-Domain Dialog Policies Via Action Embeddings

Authors:Jorge A. Mendez, Alborz Geramifard, Mohammad Ghavamzadeh, Bing Liu

View PDF

Abstract:Learning task-oriented dialog policies via reinforcement learning typically requires large amounts of interaction with users, which in practice renders such methods unusable for real-world applications. In order to reduce the data requirements, we propose to leverage data from across different dialog domains, thereby reducing the amount of data required from each given domain. In particular, we propose to learn domain-agnostic action embeddings, which capture general-purpose structure that informs the system how to act given the current dialog context, and are then specialized to a specific domain. We show how this approach is capable of learning with significantly less interaction with users, with a reduction of 35% in the number of dialogs required to learn, and to a higher level of proficiency than training separate policies for each domain on a set of simulated domains.

Comments:	Presented in the Conversational AI Workshop, NeurIPS 2019
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2207.00468 [cs.CL]
	(or arXiv:2207.00468v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2207.00468

Submission history

From: Jorge A Mendez [view email]
[v1] Fri, 1 Jul 2022 14:49:05 UTC (73 KB)

Computer Science > Computation and Language

Title:Reinforcement Learning of Multi-Domain Dialog Policies Via Action Embeddings

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Reinforcement Learning of Multi-Domain Dialog Policies Via Action Embeddings

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators