Augmentations in Graph Contrastive Learning: Current Methodological Flaws & Towards Better Practices

Trivedi, Puja; Lubana, Ekdeep Singh; Yan, Yujun; Yang, Yaoqing; Koutra, Danai

Computer Science > Machine Learning

arXiv:2111.03220 (cs)

[Submitted on 5 Nov 2021 (v1), last revised 11 Mar 2022 (this version, v2)]

Title:Augmentations in Graph Contrastive Learning: Current Methodological Flaws & Towards Better Practices

Authors:Puja Trivedi, Ekdeep Singh Lubana, Yujun Yan, Yaoqing Yang, Danai Koutra

View PDF

Abstract:Unsupervised graph representation learning is critical to a wide range of applications where labels may be scarce or expensive to procure. Contrastive learning (CL) is an increasingly popular paradigm for such settings and the state-of-the-art in unsupervised visual representation learning. Recent work attributes the success of visual CL to use of task-relevant augmentations and large, diverse datasets. Interestingly, graph CL frameworks report strong performance despite using orders of magnitude smaller datasets and employing domain-agnostic graph augmentations (DAGAs). Motivated by this discrepancy, we probe the quality of representations learnt by popular graph CL frameworks using DAGAs. We find that DAGAs can destroy task-relevant information and harm the model's ability to learn discriminative representations. On small benchmark datasets, we show the inductive bias of graph neural networks can significantly compensate for this weak discriminability. Based on our findings, we propose several sanity checks that enable practitioners to quickly assess the quality of their model's learned representations. We further propose a broad strategy for designing task-aware augmentations that are amenable to graph CL and demonstrate its efficacy on two large-scale, complex graph applications. For example, in graph-based document classification, we show task-aware augmentations improve accuracy up to 20%.

Comments:	8 pages, 4 figures, Accepted WebConf 2022
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2111.03220 [cs.LG]
	(or arXiv:2111.03220v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2111.03220

Submission history

From: Puja Trivedi [view email]
[v1] Fri, 5 Nov 2021 02:15:01 UTC (13,021 KB)
[v2] Fri, 11 Mar 2022 20:14:26 UTC (17,790 KB)

Computer Science > Machine Learning

Title:Augmentations in Graph Contrastive Learning: Current Methodological Flaws & Towards Better Practices

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Augmentations in Graph Contrastive Learning: Current Methodological Flaws & Towards Better Practices

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators