research-article

Open access

Realistic Re-evaluation of Knowledge Graph Completion Methods: An Experimental Study

Authors:

Farahnaz Akrami,

Mohammed Samiul Saeef,

Qingheng Zhang,

Wei Hu,

Chengkai LiAuthors Info & Claims

SIGMOD '20: Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data

Pages 1995 - 2010

https://doi.org/10.1145/3318464.3380599

Published: 31 May 2020 Publication History

PDF eReader

Abstract

In the active research area of employing embedding models for knowledge graph completion, particularly for the task of link prediction, most prior studies used two benchmark datasets FB15k and WN18 in evaluating such models. Most triples in these and other datasets in such studies belong to reverse and duplicate relations which exhibit high data redundancy due to semantic duplication, correlation or data incompleteness. This is a case of excessive data leakage---a model is trained using features that otherwise would not be available when the model needs to be applied for real prediction. There are also Cartesian product relations for which every triple formed by the Cartesian product of applicable subjects and objects is a true fact. Link prediction on the aforementioned relations is easy and can be achieved with even better accuracy using straightforward rules instead of sophisticated embedding models. A more fundamental defect of these models is that the link prediction scenario, given such data, is non-existent in the real-world. This paper is the first systematic study with the main objective of assessing the true effectiveness of embedding models when the unrealistic triples are removed. Our experiment results show these models are much less accurate than what we used to perceive. Their poor accuracy renders link prediction a task without truly effective automated solution. Hence, we call for re-investigation of possible effective approaches.

Supplementary Material

Source Code (3318464.3380599_source_code.zip)

Download
230.84 MB

Read me (3318464.3380599_readme.pdf)

Download
23.66 KB

References

[1]

Farahnaz Akrami, Lingbing Guo, Wei Hu, and Chengkai Li. 2018. Re-evaluating Embedding-Based Knowledge Graph Completion Methods. In Proceedings of the 27th ACM International Conference on Information and Knowledge Management (CIKM). 1779--1782. https://doi.org/10.1145/3269206.3269266

Abstract

Supplementary Material

References

Cited By

Index Terms

Recommendations

Comprehensive Analysis of Freebase and Dataset Creation for Robust Evaluation of Knowledge Graph Link Prediction Models

Re-evaluating Embedding-Based Knowledge Graph Completion Methods

An Approach Based on Semantic Similarity to Explaining Link Predictions on Knowledge Graphs

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Badges

Author Tags

Qualifiers

Funding Sources

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

View options

PDF

eReader

Get Access

Login options

Full Access

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations