Multiresolution Recurrent Neural Networks: An Application to Dialogue Response Generation

Serban, Iulian Vlad; Klinger, Tim; Tesauro, Gerald; Talamadupula, Kartik; Zhou, Bowen; Bengio, Yoshua; Courville, Aaron

Computer Science > Computation and Language

arXiv:1606.00776 (cs)

[Submitted on 2 Jun 2016 (v1), last revised 14 Jun 2016 (this version, v2)]

Title:Multiresolution Recurrent Neural Networks: An Application to Dialogue Response Generation

Authors:Iulian Vlad Serban, Tim Klinger, Gerald Tesauro, Kartik Talamadupula, Bowen Zhou, Yoshua Bengio, Aaron Courville

View PDF

Abstract:We introduce the multiresolution recurrent neural network, which extends the sequence-to-sequence framework to model natural language generation as two parallel discrete stochastic processes: a sequence of high-level coarse tokens, and a sequence of natural language tokens. There are many ways to estimate or learn the high-level coarse tokens, but we argue that a simple extraction procedure is sufficient to capture a wealth of high-level discourse semantics. Such procedure allows training the multiresolution recurrent neural network by maximizing the exact joint log-likelihood over both sequences. In contrast to the standard log- likelihood objective w.r.t. natural language tokens (word perplexity), optimizing the joint log-likelihood biases the model towards modeling high-level abstractions. We apply the proposed model to the task of dialogue response generation in two challenging domains: the Ubuntu technical support domain, and Twitter conversations. On Ubuntu, the model outperforms competing approaches by a substantial margin, achieving state-of-the-art results according to both automatic evaluation metrics and a human evaluation study. On Twitter, the model appears to generate more relevant and on-topic responses according to automatic evaluation metrics. Finally, our experiments demonstrate that the proposed model is more adept at overcoming the sparsity of natural language and is better able to capture long-term structure.

Comments:	21 pages, 2 figures, 10 tables
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Machine Learning (stat.ML)
ACM classes:	I.5.1; I.2.7
Cite as:	arXiv:1606.00776 [cs.CL]
	(or arXiv:1606.00776v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1606.00776

Submission history

From: Iulian Vlad Serban [view email]
[v1] Thu, 2 Jun 2016 17:37:31 UTC (1,749 KB)
[v2] Tue, 14 Jun 2016 02:01:16 UTC (1,750 KB)

Computer Science > Computation and Language

Title:Multiresolution Recurrent Neural Networks: An Application to Dialogue Response Generation

Submission history

Access Paper:

References & Citations

1 blog link

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Multiresolution Recurrent Neural Networks: An Application to Dialogue Response Generation

Submission history

Access Paper:

References & Citations

1 blog link

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators