Understanding Negative Sampling in Graph Representation Learning

Yang, Zhen; Ding, Ming; Zhou, Chang; Yang, Hongxia; Zhou, Jingren; Tang, Jie

Computer Science > Machine Learning

arXiv:2005.09863 (cs)

[Submitted on 20 May 2020 (v1), last revised 25 Jun 2020 (this version, v2)]

Title:Understanding Negative Sampling in Graph Representation Learning

Authors:Zhen Yang, Ming Ding, Chang Zhou, Hongxia Yang, Jingren Zhou, Jie Tang

View PDF

Abstract:Graph representation learning has been extensively studied in recent years. Despite its potential in generating continuous embeddings for various networks, both the effectiveness and efficiency to infer high-quality representations toward large corpus of nodes are still challenging. Sampling is a critical point to achieve the performance goals. Prior arts usually focus on sampling positive node pairs, while the strategy for negative sampling is left insufficiently explored. To bridge the gap, we systematically analyze the role of negative sampling from the perspectives of both objective and risk, theoretically demonstrating that negative sampling is as important as positive sampling in determining the optimization objective and the resulted variance. To the best of our knowledge, we are the first to derive the theory and quantify that the negative sampling distribution should be positively but sub-linearly correlated to their positive sampling distribution. With the guidance of the theory, we propose MCNS, approximating the positive distribution with self-contrast approximation and accelerating negative sampling by Metropolis-Hastings. We evaluate our method on 5 datasets that cover extensive downstream graph learning tasks, including link prediction, node classification and personalized recommendation, on a total of 19 experimental settings. These relatively comprehensive experimental results demonstrate its robustness and superiorities.

Comments:	KDD 2020
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2005.09863 [cs.LG]
	(or arXiv:2005.09863v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2005.09863

Submission history

From: Chang Zhou [view email]
[v1] Wed, 20 May 2020 06:25:21 UTC (6,295 KB)
[v2] Thu, 25 Jun 2020 04:10:30 UTC (2,299 KB)

Computer Science > Machine Learning

Title:Understanding Negative Sampling in Graph Representation Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Understanding Negative Sampling in Graph Representation Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators