Fine-Tuning by Curriculum Learning for Non-Autoregressive Neural Machine Translation

Guo, Junliang; Tan, Xu; Xu, Linli; Qin, Tao; Chen, Enhong; Liu, Tie-Yan

Computer Science > Machine Learning

arXiv:1911.08717 (cs)

[Submitted on 20 Nov 2019 (v1), last revised 21 Nov 2019 (this version, v2)]

Title:Fine-Tuning by Curriculum Learning for Non-Autoregressive Neural Machine Translation

Authors:Junliang Guo, Xu Tan, Linli Xu, Tao Qin, Enhong Chen, Tie-Yan Liu

View PDF

Abstract:Non-autoregressive translation (NAT) models remove the dependence on previous target tokens and generate all target tokens in parallel, resulting in significant inference speedup but at the cost of inferior translation accuracy compared to autoregressive translation (AT) models. Considering that AT models have higher accuracy and are easier to train than NAT models, and both of them share the same model configurations, a natural idea to improve the accuracy of NAT models is to transfer a well-trained AT model to an NAT model through fine-tuning. However, since AT and NAT models differ greatly in training strategy, straightforward fine-tuning does not work well. In this work, we introduce curriculum learning into fine-tuning for NAT. Specifically, we design a curriculum in the fine-tuning process to progressively switch the training from autoregressive generation to non-autoregressive generation. Experiments on four benchmark translation datasets show that the proposed method achieves good improvement (more than $1$ BLEU score) over previous NAT baselines in terms of translation accuracy, and greatly speed up (more than $10$ times) the inference process over AT baselines.

Comments:	AAAI 2020
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1911.08717 [cs.LG]
	(or arXiv:1911.08717v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1911.08717

Submission history

From: Junliang Guo [view email]
[v1] Wed, 20 Nov 2019 05:48:31 UTC (543 KB)
[v2] Thu, 21 Nov 2019 09:43:45 UTC (543 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2019-11

Change to browse by:

cs
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Junliang Guo
Xu Tan
Linli Xu
Tao Qin
Enhong Chen

…

export BibTeX citation

Computer Science > Machine Learning

Title:Fine-Tuning by Curriculum Learning for Non-Autoregressive Neural Machine Translation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Fine-Tuning by Curriculum Learning for Non-Autoregressive Neural Machine Translation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators