RotoGrad: Gradient Homogenization in Multitask Learning

Javaloy, Adrián; Valera, Isabel

Computer Science > Machine Learning

arXiv:2103.02631 (cs)

[Submitted on 3 Mar 2021 (v1), last revised 16 Feb 2022 (this version, v3)]

Title:RotoGrad: Gradient Homogenization in Multitask Learning

Authors:Adrián Javaloy, Isabel Valera

View PDF

Abstract:Multitask learning is being increasingly adopted in applications domains like computer vision and reinforcement learning. However, optimally exploiting its advantages remains a major challenge due to the effect of negative transfer. Previous works have tracked down this issue to the disparities in gradient magnitudes and directions across tasks, when optimizing the shared network parameters. While recent work has acknowledged that negative transfer is a two-fold problem, existing approaches fall short as they only focus on either homogenizing the gradient magnitude across tasks; or greedily change the gradient directions, overlooking future conflicts. In this work, we introduce RotoGrad, an algorithm that tackles negative transfer as a whole: it jointly homogenizes gradient magnitudes and directions, while ensuring training convergence. We show that RotoGrad outperforms competing methods in complex problems, including multi-label classification in CelebA and computer vision tasks in the NYUv2 dataset. A Pytorch implementation can be found in this https URL.

Comments:	Spotlight at ICLR 2022. 24 pages, 9 figures
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2103.02631 [cs.LG]
	(or arXiv:2103.02631v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2103.02631

Submission history

From: Adrián Javaloy [view email]
[v1] Wed, 3 Mar 2021 19:03:52 UTC (1,487 KB)
[v2] Wed, 6 Oct 2021 20:15:38 UTC (2,189 KB)
[v3] Wed, 16 Feb 2022 11:20:05 UTC (2,191 KB)

Computer Science > Machine Learning

Title:RotoGrad: Gradient Homogenization in Multitask Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:RotoGrad: Gradient Homogenization in Multitask Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators