Graph Optimal Transport for Cross-Domain Alignment

Chen, Liqun; Gan, Zhe; Cheng, Yu; Li, Linjie; Carin, Lawrence; Liu, Jingjing

Computer Science > Computation and Language

arXiv:2006.14744 (cs)

[Submitted on 26 Jun 2020 (v1), last revised 24 Jul 2020 (this version, v3)]

Title:Graph Optimal Transport for Cross-Domain Alignment

Authors:Liqun Chen, Zhe Gan, Yu Cheng, Linjie Li, Lawrence Carin, Jingjing Liu

View PDF

Abstract:Cross-domain alignment between two sets of entities (e.g., objects in an image, words in a sentence) is fundamental to both computer vision and natural language processing. Existing methods mainly focus on designing advanced attention mechanisms to simulate soft alignment, with no training signals to explicitly encourage alignment. The learned attention matrices are also dense and lacks interpretability. We propose Graph Optimal Transport (GOT), a principled framework that germinates from recent advances in Optimal Transport (OT). In GOT, cross-domain alignment is formulated as a graph matching problem, by representing entities into a dynamically-constructed graph. Two types of OT distances are considered: (i) Wasserstein distance (WD) for node (entity) matching; and (ii) Gromov-Wasserstein distance (GWD) for edge (structure) matching. Both WD and GWD can be incorporated into existing neural network models, effectively acting as a drop-in regularizer. The inferred transport plan also yields sparse and self-normalized alignment, enhancing the interpretability of the learned model. Experiments show consistent outperformance of GOT over baselines across a wide range of tasks, including image-text retrieval, visual question answering, image captioning, machine translation, and text summarization.

Subjects:	Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2006.14744 [cs.CL]
	(or arXiv:2006.14744v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2006.14744
Journal reference:	ICML 2020

Submission history

From: Liqun Chen [view email]
[v1] Fri, 26 Jun 2020 01:14:23 UTC (5,093 KB)
[v2] Mon, 29 Jun 2020 15:58:36 UTC (16,662 KB)
[v3] Fri, 24 Jul 2020 20:04:49 UTC (16,664 KB)

Computer Science > Computation and Language

Title:Graph Optimal Transport for Cross-Domain Alignment

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Graph Optimal Transport for Cross-Domain Alignment

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators