Fuzzy Alignments in Directed Acyclic Graph for Non-Autoregressive Machine Translation

Ma, Zhengrui; Shao, Chenze; Gui, Shangtong; Zhang, Min; Feng, Yang

Computer Science > Computation and Language

arXiv:2303.06662 (cs)

[Submitted on 12 Mar 2023 (v1), last revised 17 Jul 2023 (this version, v2)]

Title:Fuzzy Alignments in Directed Acyclic Graph for Non-Autoregressive Machine Translation

Authors:Zhengrui Ma, Chenze Shao, Shangtong Gui, Min Zhang, Yang Feng

View PDF

Abstract:Non-autoregressive translation (NAT) reduces the decoding latency but suffers from performance degradation due to the multi-modality problem. Recently, the structure of directed acyclic graph has achieved great success in NAT, which tackles the multi-modality problem by introducing dependency between vertices. However, training it with negative log-likelihood loss implicitly requires a strict alignment between reference tokens and vertices, weakening its ability to handle multiple translation modalities. In this paper, we hold the view that all paths in the graph are fuzzily aligned with the reference sentence. We do not require the exact alignment but train the model to maximize a fuzzy alignment score between the graph and reference, which takes captured translations in all modalities into account. Extensive experiments on major WMT benchmarks show that our method substantially improves translation performance and increases prediction confidence, setting a new state of the art for NAT on the raw training data.

Comments:	ICLR 2023
Subjects:	Computation and Language (cs.CL)
ACM classes:	I.2.7
Cite as:	arXiv:2303.06662 [cs.CL]
	(or arXiv:2303.06662v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2303.06662

Submission history

From: Zhengrui Ma [view email]
[v1] Sun, 12 Mar 2023 13:51:38 UTC (152 KB)
[v2] Mon, 17 Jul 2023 07:21:50 UTC (152 KB)

Computer Science > Computation and Language

Title:Fuzzy Alignments in Directed Acyclic Graph for Non-Autoregressive Machine Translation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Fuzzy Alignments in Directed Acyclic Graph for Non-Autoregressive Machine Translation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators