DeepCopy: Grounded Response Generation with Hierarchical Pointer Networks

Yavuz, Semih; Rastogi, Abhinav; Chao, Guan-Lin; Hakkani-Tur, Dilek

Computer Science > Computation and Language

arXiv:1908.10731 (cs)

[Submitted on 28 Aug 2019]

Title:DeepCopy: Grounded Response Generation with Hierarchical Pointer Networks

Authors:Semih Yavuz, Abhinav Rastogi, Guan-Lin Chao, Dilek Hakkani-Tur

View PDF

Abstract:Recent advances in neural sequence-to-sequence models have led to promising results for several language generation-based tasks, including dialogue response generation, summarization, and machine translation. However, these models are known to have several problems, especially in the context of chit-chat based dialogue systems: they tend to generate short and dull responses that are often too generic. Furthermore, these models do not ground conversational responses on knowledge and facts, resulting in turns that are not accurate, informative and engaging for the users. In this paper, we propose and experiment with a series of response generation models that aim to serve in the general scenario where in addition to the dialogue context, relevant unstructured external knowledge in the form of text is also assumed to be available for models to harness. Our proposed approach extends pointer-generator networks (See et al., 2017) by allowing the decoder to hierarchically attend and copy from external knowledge in addition to the dialogue context. We empirically show the effectiveness of the proposed model compared to several baselines including (Ghazvininejad et al., 2018; Zhang et al., 2018) through both automatic evaluation metrics and human evaluation on CONVAI2 dataset.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:1908.10731 [cs.CL]
	(or arXiv:1908.10731v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1908.10731

Submission history

From: Semih Yavuz [view email]
[v1] Wed, 28 Aug 2019 14:03:44 UTC (268 KB)

Computer Science > Computation and Language

Title:DeepCopy: Grounded Response Generation with Hierarchical Pointer Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:DeepCopy: Grounded Response Generation with Hierarchical Pointer Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators