Addressing Shortcomings in Fair Graph Learning Datasets: Towards a New Benchmark

Qian, Xiaowei; Guo, Zhimeng; Li, Jialiang; Mao, Haitao; Li, Bingheng; Wang, Suhang; Ma, Yao

Computer Science > Machine Learning

arXiv:2403.06017 (cs)

[Submitted on 9 Mar 2024 (v1), last revised 18 Jun 2024 (this version, v2)]

Title:Addressing Shortcomings in Fair Graph Learning Datasets: Towards a New Benchmark

Authors:Xiaowei Qian, Zhimeng Guo, Jialiang Li, Haitao Mao, Bingheng Li, Suhang Wang, Yao Ma

View PDF HTML (experimental)

Abstract:Fair graph learning plays a pivotal role in numerous practical applications. Recently, many fair graph learning methods have been proposed; however, their evaluation often relies on poorly constructed semi-synthetic datasets or substandard real-world datasets. In such cases, even a basic Multilayer Perceptron (MLP) can outperform Graph Neural Networks (GNNs) in both utility and fairness. In this work, we illustrate that many datasets fail to provide meaningful information in the edges, which may challenge the necessity of using graph structures in these problems. To address these issues, we develop and introduce a collection of synthetic, semi-synthetic, and real-world datasets that fulfill a broad spectrum of requirements. These datasets are thoughtfully designed to include relevant graph structures and bias information crucial for the fair evaluation of models. The proposed synthetic and semi-synthetic datasets offer the flexibility to create data with controllable bias parameters, thereby enabling the generation of desired datasets with user-defined bias values with ease. Moreover, we conduct systematic evaluations of these proposed datasets and establish a unified evaluation approach for fair graph learning models. Our extensive experimental results with fair graph learning methods across our datasets demonstrate their effectiveness in benchmarking the performance of these methods. Our datasets and the code for reproducing our experiments are available at this https URL.

Comments:	KDD ADS 2024
Subjects:	Machine Learning (cs.LG); Computers and Society (cs.CY)
Cite as:	arXiv:2403.06017 [cs.LG]
	(or arXiv:2403.06017v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2403.06017

Submission history

From: Xiaowei Qian [view email]
[v1] Sat, 9 Mar 2024 21:33:26 UTC (74 KB)
[v2] Tue, 18 Jun 2024 03:55:04 UTC (74 KB)

Computer Science > Machine Learning

Title:Addressing Shortcomings in Fair Graph Learning Datasets: Towards a New Benchmark

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Addressing Shortcomings in Fair Graph Learning Datasets: Towards a New Benchmark

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators