Translating Pro-Drop Languages with Reconstruction Models

Wang, Longyue; Tu, Zhaopeng; Shi, Shuming; Zhang, Tong; Graham, Yvette; Liu, Qun

Computer Science > Computation and Language

arXiv:1801.03257 (cs)

[Submitted on 10 Jan 2018]

Title:Translating Pro-Drop Languages with Reconstruction Models

Authors:Longyue Wang, Zhaopeng Tu, Shuming Shi, Tong Zhang, Yvette Graham, Qun Liu

View PDF

Abstract:Pronouns are frequently omitted in pro-drop languages, such as Chinese, generally leading to significant challenges with respect to the production of complete translations. To date, very little attention has been paid to the dropped pronoun (DP) problem within neural machine translation (NMT). In this work, we propose a novel reconstruction-based approach to alleviating DP translation problems for NMT models. Firstly, DPs within all source sentences are automatically annotated with parallel information extracted from the bilingual training corpus. Next, the annotated source sentence is reconstructed from hidden representations in the NMT model. With auxiliary training objectives, in terms of reconstruction scores, the parameters associated with the NMT model are guided to produce enhanced hidden representations that are encouraged as much as possible to embed annotated DP information. Experimental results on both Chinese-English and Japanese-English dialogue translation tasks show that the proposed approach significantly and consistently improves translation performance over a strong NMT baseline, which is directly built on the training data annotated with DPs.

Comments:	Accepted by AAAI-18
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:1801.03257 [cs.CL]
	(or arXiv:1801.03257v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1801.03257

Submission history

From: Longyue Wang [view email]
[v1] Wed, 10 Jan 2018 07:53:22 UTC (116 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2018-01

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Longyue Wang
Zhaopeng Tu
Shuming Shi
Tong Zhang
Yvette Graham

…

export BibTeX citation

Computer Science > Computation and Language

Title:Translating Pro-Drop Languages with Reconstruction Models

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Translating Pro-Drop Languages with Reconstruction Models

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators