On the Use of Unrealistic Predictions in Hundreds of Papers Evaluating Graph Representations

Lin, Li-Chung; Liu, Cheng-Hung; Chen, Chih-Ming; Hsu, Kai-Chin; Wu, I-Feng; Tsai, Ming-Feng; Lin, Chih-Jen

Computer Science > Machine Learning

arXiv:2112.04274 (cs)

[Submitted on 8 Dec 2021 (v1), last revised 13 Dec 2021 (this version, v3)]

Title:On the Use of Unrealistic Predictions in Hundreds of Papers Evaluating Graph Representations

Authors:Li-Chung Lin, Cheng-Hung Liu, Chih-Ming Chen, Kai-Chin Hsu, I-Feng Wu, Ming-Feng Tsai, Chih-Jen Lin

View PDF

Abstract:Prediction using the ground truth sounds like an oxymoron in machine learning. However, such an unrealistic setting was used in hundreds, if not thousands of papers in the area of finding graph representations. To evaluate the multi-label problem of node classification by using the obtained representations, many works assume in the prediction stage that the number of labels of each test instance is known. In practice such ground truth information is rarely available, but we point out that such an inappropriate setting is now ubiquitous in this research area. We detailedly investigate why the situation occurs. Our analysis indicates that with unrealistic information, the performance is likely over-estimated. To see why suitable predictions were not used, we identify difficulties in applying some multi-label techniques. For the use in future studies, we propose simple and effective settings without using practically unknown information. Finally, we take this chance to conduct a fair and serious comparison of major graph-representation learning methods on multi-label node classification.

Comments:	Accepted by AAAI 2022
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2112.04274 [cs.LG]
	(or arXiv:2112.04274v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2112.04274

Submission history

From: Cheng-Hung Liu [view email]
[v1] Wed, 8 Dec 2021 13:15:48 UTC (33 KB)
[v2] Thu, 9 Dec 2021 11:05:40 UTC (33 KB)
[v3] Mon, 13 Dec 2021 08:39:22 UTC (401 KB)

Computer Science > Machine Learning

Title:On the Use of Unrealistic Predictions in Hundreds of Papers Evaluating Graph Representations

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:On the Use of Unrealistic Predictions in Hundreds of Papers Evaluating Graph Representations

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators