Using Perturbed Length-aware Positional Encoding for Non-autoregressive Neural Machine Translation

Oka, Yui; Sudoh, Katsuhito; Nakamura, Satoshi

Computer Science > Computation and Language

arXiv:2107.13689 (cs)

[Submitted on 29 Jul 2021]

Title:Using Perturbed Length-aware Positional Encoding for Non-autoregressive Neural Machine Translation

Authors:Yui Oka, Katsuhito Sudoh, Satoshi Nakamura

View PDF

Abstract:Non-autoregressive neural machine translation (NAT) usually employs sequence-level knowledge distillation using autoregressive neural machine translation (AT) as its teacher model. However, a NAT model often outputs shorter sentences than an AT model. In this work, we propose sequence-level knowledge distillation (SKD) using perturbed length-aware positional encoding and apply it to a student model, the Levenshtein Transformer. Our method outperformed a standard Levenshtein Transformer by 2.5 points in bilingual evaluation understudy (BLEU) at maximum in a WMT14 German to English translation. The NAT model output longer sentences than the baseline NAT models.

Comments:	5 pages, 1 figures. Will be presented at ACL SRW 2021
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2107.13689 [cs.CL]
	(or arXiv:2107.13689v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2107.13689

Submission history

From: Yui Oka [view email]
[v1] Thu, 29 Jul 2021 00:51:44 UTC (58 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2021-07

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Katsuhito Sudoh
Satoshi Nakamura

export BibTeX citation

Computer Science > Computation and Language

Title:Using Perturbed Length-aware Positional Encoding for Non-autoregressive Neural Machine Translation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Using Perturbed Length-aware Positional Encoding for Non-autoregressive Neural Machine Translation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators