A Joint Learning Model with Variational Interaction for Multilingual Program Translation

Du, Yali; Sun, Hui; Li, Ming

Computer Science > Software Engineering

arXiv:2408.14515 (cs)

[Submitted on 25 Aug 2024 (v1), last revised 13 Sep 2024 (this version, v2)]

Title:A Joint Learning Model with Variational Interaction for Multilingual Program Translation

Authors:Yali Du, Hui Sun, Ming Li

View PDF HTML (experimental)

Abstract:Programs implemented in various programming languages form the foundation of software applications. To alleviate the burden of program migration and facilitate the development of software systems, automated program translation across languages has garnered significant attention. Previous approaches primarily focus on pairwise translation paradigms, learning translation between pairs of languages using bilingual parallel data. However, parallel data is difficult to collect for some language pairs, and the distribution of program semantics across languages can shift, posing challenges for pairwise program translation. In this paper, we argue that jointly learning a unified model to translate code across multiple programming languages is superior to separately learning from bilingual parallel data. We propose Variational Interaction for Multilingual Program Translation~(VIM-PT), a disentanglement-based generative approach that jointly trains a unified model for multilingual program translation across multiple languages. VIM-PT disentangles code into language-shared and language-specific features, using variational inference and interaction information with a novel lower bound, then achieves program translation through conditional generation. VIM-PT demonstrates four advantages: 1) captures language-shared information more accurately from various implementations and improves the quality of multilingual program translation, 2) mines and leverages the capability of non-parallel data, 3) addresses the distribution shift of program semantics across languages, 4) and serves as a unified model, reducing deployment complexity.

Comments:	Accepted by the 39th IEEE/ACM International Conference on Automated Software Engineering (ASE 2024)
Subjects:	Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Programming Languages (cs.PL)
Cite as:	arXiv:2408.14515 [cs.SE]
	(or arXiv:2408.14515v2 [cs.SE] for this version)
	https://doi.org/10.48550/arXiv.2408.14515

Submission history

From: Yali Du [view email]
[v1] Sun, 25 Aug 2024 11:33:52 UTC (613 KB)
[v2] Fri, 13 Sep 2024 04:25:37 UTC (614 KB)

Computer Science > Software Engineering

Title:A Joint Learning Model with Variational Interaction for Multilingual Program Translation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Software Engineering

Title:A Joint Learning Model with Variational Interaction for Multilingual Program Translation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators