Measuring Semantic Abstraction of Multilingual NMT with Paraphrase Recognition and Generation Tasks

Tiedemann, Jörg; Scherrer, Yves

Computer Science > Computation and Language

arXiv:1808.06826 (cs)

[Submitted on 21 Aug 2018 (v1), last revised 3 May 2019 (this version, v2)]

Title:Measuring Semantic Abstraction of Multilingual NMT with Paraphrase Recognition and Generation Tasks

Authors:Jörg Tiedemann, Yves Scherrer

View PDF

Abstract:In this paper, we investigate whether multilingual neural translation models learn stronger semantic abstractions of sentences than bilingual ones. We test this hypotheses by measuring the perplexity of such models when applied to paraphrases of the source language. The intuition is that an encoder produces better representations if a decoder is capable of recognizing synonymous sentences in the same language even though the model is never trained for that task. In our setup, we add 16 different auxiliary languages to a bidirectional bilingual baseline model (English-French) and test it with in-domain and out-of-domain paraphrases in English. The results show that the perplexity is significantly reduced in each of the cases, indicating that meaning can be grounded in translation. This is further supported by a study on paraphrase generation that we also include at the end of the paper.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:1808.06826 [cs.CL]
	(or arXiv:1808.06826v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1808.06826

Submission history

From: Jörg Tiedemann [view email]
[v1] Tue, 21 Aug 2018 10:07:18 UTC (150 KB)
[v2] Fri, 3 May 2019 09:06:57 UTC (39 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2018-08

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Jörg Tiedemann
Yves Scherrer

export BibTeX citation

Computer Science > Computation and Language

Title:Measuring Semantic Abstraction of Multilingual NMT with Paraphrase Recognition and Generation Tasks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Measuring Semantic Abstraction of Multilingual NMT with Paraphrase Recognition and Generation Tasks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators