Viewing Knowledge Transfer in Multilingual Machine Translation Through a Representational Lens

Stap, David; Niculae, Vlad; Monz, Christof

Computer Science > Computation and Language

arXiv:2305.11550v3 (cs)

[Submitted on 19 May 2023 (v1), last revised 4 Dec 2023 (this version, v3)]

Title:Viewing Knowledge Transfer in Multilingual Machine Translation Through a Representational Lens

Authors:David Stap, Vlad Niculae, Christof Monz

View PDF HTML (experimental)

Abstract:We argue that translation quality alone is not a sufficient metric for measuring knowledge transfer in multilingual neural machine translation. To support this claim, we introduce Representational Transfer Potential (RTP), which measures representational similarities between languages. We show that RTP can measure both positive and negative transfer (interference), and find that RTP is strongly correlated with changes in translation quality, indicating that transfer does occur. Furthermore, we investigate data and language characteristics that are relevant for transfer, and find that multi-parallel overlap is an important yet under-explored feature. Based on this, we develop a novel training scheme, which uses an auxiliary similarity loss that encourages representations to be more invariant across languages by taking advantage of multi-parallel data. We show that our method yields increased translation quality for low- and mid-resource languages across multiple data and model setups.

Comments:	Accepted to EMNLP 2023 Findings
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2305.11550 [cs.CL]
	(or arXiv:2305.11550v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2305.11550

Submission history

From: David Stap [view email]
[v1] Fri, 19 May 2023 09:36:48 UTC (343 KB)
[v2] Mon, 23 Oct 2023 07:29:13 UTC (410 KB)
[v3] Mon, 4 Dec 2023 10:15:37 UTC (390 KB)

Computer Science > Computation and Language

Title:Viewing Knowledge Transfer in Multilingual Machine Translation Through a Representational Lens

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Viewing Knowledge Transfer in Multilingual Machine Translation Through a Representational Lens

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators