ERASE: Error-Resilient Representation Learning on Graphs for Label Noise Tolerance

Chen, Ling-Hao; Zhang, Yuanshuo; Huang, Taohua; Su, Liangcai; Lin, Zeyi; Xiao, Xi; Xia, Xiaobo; Liu, Tongliang

Computer Science > Machine Learning

arXiv:2312.08852 (cs)

[Submitted on 13 Dec 2023 (v1), last revised 8 Mar 2024 (this version, v2)]

Title:ERASE: Error-Resilient Representation Learning on Graphs for Label Noise Tolerance

Authors:Ling-Hao Chen, Yuanshuo Zhang, Taohua Huang, Liangcai Su, Zeyi Lin, Xi Xiao, Xiaobo Xia, Tongliang Liu

View PDF HTML (experimental)

Abstract:Deep learning has achieved remarkable success in graph-related tasks, yet this accomplishment heavily relies on large-scale high-quality annotated datasets. However, acquiring such datasets can be cost-prohibitive, leading to the practical use of labels obtained from economically efficient sources such as web searches and user tags. Unfortunately, these labels often come with noise, compromising the generalization performance of deep networks. To tackle this challenge and enhance the robustness of deep learning models against label noise in graph-based tasks, we propose a method called ERASE (Error-Resilient representation learning on graphs for lAbel noiSe tolerancE). The core idea of ERASE is to learn representations with error tolerance by maximizing coding rate reduction. Particularly, we introduce a decoupled label propagation method for learning representations. Before training, noisy labels are pre-corrected through structural denoising. During training, ERASE combines prototype pseudo-labels with propagated denoised labels and updates representations with error resilience, which significantly improves the generalization performance in node classification. The proposed method allows us to more effectively withstand errors caused by mislabeled nodes, thereby strengthening the robustness of deep networks in handling noisy graph data. Extensive experimental results show that our method can outperform multiple baselines with clear margins in broad noise levels and enjoy great scalability. Codes are released at this https URL.

Comments:	24 pages, 14 figures, 15 tables and a project page at this https URL
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2312.08852 [cs.LG]
	(or arXiv:2312.08852v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2312.08852

Submission history

From: Yuanshuo Zhang [view email]
[v1] Wed, 13 Dec 2023 17:59:07 UTC (28,981 KB)
[v2] Fri, 8 Mar 2024 12:29:44 UTC (28,974 KB)

Computer Science > Machine Learning

Title:ERASE: Error-Resilient Representation Learning on Graphs for Label Noise Tolerance

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:ERASE: Error-Resilient Representation Learning on Graphs for Label Noise Tolerance

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators