Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3485447.3511938acmconferencesArticle/Chapter ViewAbstractPublication PageswebconfConference Proceedingsconference-collections
research-article

Trustworthy Knowledge Graph Completion Based on Multi-sourced Noisy Data

Published: 25 April 2022 Publication History
  • Get Citation Alerts
  • Abstract

    Knowledge graphs (KGs) have become a valuable asset for many AI applications. Although some KGs contain plenty of facts, they are widely acknowledged as incomplete. To address this issue, many KG completion methods are proposed. Among them, open KG completion methods leverage the Web to find missing facts. However, noisy data collected from diverse sources may damage the completion accuracy. In this paper, we propose a new trustworthy method that exploits facts for a KG based on multi-sourced noisy data and existing facts in the KG. Specifically, we introduce a graph neural network with a holistic scoring function to judge the plausibility of facts with various value types. We design value alignment networks to resolve the heterogeneity between values and map them to entities even outside the KG. Furthermore, we present a truth inference model that incorporates data source qualities into the fact scoring function, and design a semi-supervised learning way to infer the truths from heterogeneous values. We conduct extensive experiments to compare our method with the state-of-the-arts. The results show that our method achieves superior accuracy not only in completing missing facts but also in discovering new facts.

    References

    [1]
    Ralph Abboud, Ismail Ceylan, Thomas Lukasiewicz, and Tommaso Salvatori. 2020. BoxE: A Box Embedding Model for Knowledge Base Completion. In NeurIPS. Curran Associates, Inc., online, 9649–9661.
    [2]
    Sören Auer, Christian Bizer, Georgi Kobilarov, Jens Lehmann, Richard Cyganiak, and Zachary G. Ives. 2007. DBpedia: A Nucleus for a Web of Open Data. In ISWC/ASWC. Springer, Busan, South Korea, 722–735.
    [3]
    Ivana Balazevic, Carl Allen, and Timothy M. Hospedales. 2019. TuckER: Tensor Factorization for Knowledge Graph Completion. In EMNLP-IJCNLP. ACL, Hong Kong, China, 5184–5193.
    [4]
    Kurt D. Bollacker, Colin Evans, Praveen Paritosh, Tim Sturge, and Jamie Taylor. 2008. Freebase: A Collaboratively Created Graph Database for Structuring Human Knowledge. In SIGMOD. ACM, Vancouver, BC, Canada, 1247–1250.
    [5]
    Antoine Bordes, Nicolas Usunier, Alberto García-Durán, Jason Weston, and Oksana Yakhnenko. 2013. Translating Embeddings for Modeling Multi-relational Data. In NIPS. Curran Associates, Inc., Lake Tahoe, NV, USA, 2787–2795.
    [6]
    Ursin Brunner and Kurt Stockinger. 2020. Entity Matching with Transformer Architectures - A Step Forward in Data Integration. In EDBT. OpenProceedings.org, Copenhagen, Denmark, 463–473.
    [7]
    Ermei Cao, Difeng Wang, Jiacheng Huang, and Wei Hu. 2020. Open Knowledge Enrichment for Long-tail Entities. In WWW. ACM, Taipei, Taiwan, 384–394.
    [8]
    Zongsheng Cao, Qianqian Xu, Zhiyong Yang, Xiaochun Cao, and Qingming Huang. 2021. Dual Quaternion Knowledge Graph Embeddings. In AAAI. AAAI Press, online, 6894–6902.
    [9]
    Qian Chen, Xiaodan Zhu, Zhen-Hua Ling, Si Wei, Hui Jiang, and Diana Inkpen. 2017. Enhanced LSTM for Natural Language Inference. In ACL. ACL, Vancouver, BC, Canada, 1657–1668.
    [10]
    Caglar Demir and Axel-Cyrille Ngonga Ngomo. 2021. Convolutional Complex Knowledge Graph Embeddings. In ESWC. Springer, Heraklion, Greece, 409–424.
    [11]
    Tim Dettmers, Pasquale Minervini, Pontus Stenetorp, and Sebastian Riedel. 2018. Convolutional 2D Knowledge Graph Embeddings. In AAAI. AAAI Press, New Orleans, LA, USA, 1811–1818.
    [12]
    Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In NAACL-HLT. ACL, Minneapolis, MN, USA, 4171–4186.
    [13]
    AnHai Doan, Alon Y. Halevy, and Zachary G. Ives. 2012. Principles of Data Integration. Morgan Kaufmann, Waltham, MA, USA.
    [14]
    Xin Dong, Evgeniy Gabrilovich, Geremy Heitz, Wilko Horn, Ni Lao, Kevin Murphy, Thomas Strohmann, Shaohua Sun, and Wei Zhang. 2014. Knowledge Vault: A Web-scale Approach to Probabilistic Knowledge Fusion. In KDD. ACM, New York, NY, USA, 601–610.
    [15]
    Alberto García-Durán and Mathias Niepert. 2018. KBLRN: End-to-end Learning of Knowledge Base Representations with Latent, Relational, and Numerical Features. In UAI. AUAI Press, Monterey, CA, USA, 372–381.
    [16]
    Lingbing Guo, Zequn Sun, and Wei Hu. 2019. Learning to Exploit Long-term Relational Dependencies in Knowledge Graphs. In ICML. PMLR, Long Beach, CA, USA, 2505–2514.
    [17]
    William L. Hamilton, Zhitao Ying, and Jure Leskovec. 2017. Inductive Representation Learning on Large Graphs. In NeurIPS. Curran Associates, Inc., Long Beach, CA, USA, 1024–1034.
    [18]
    Shaoxiong Ji, Shirui Pan, Erik Cambria, Pekka Marttinen, and Philip S. Yu. 2021. A Survey on Knowledge Graphs: Representation, Acquisition and Applications. IEEE Transactions on Neural Networks and Learning Systems early access (2021), 1–21.
    [19]
    Rik Koncel-Kedziorski, Dhanush Bekal, Yi Luan, Mirella Lapata, and Hannaneh Hajishirzi. 2019. Text Generation from Knowledge Graphs with Graph Transformers. In NAACL-HLT. ACL, Minneapolis, MN, USA, 2284–2293.
    [20]
    Agustinus Kristiadi, Mohammad Asif Khan, Denis Lukovnikov, Jens Lehmann, and Asja Fischer. 2019. Incorporating Literals into Knowledge Graph Embeddings. In ISWC. Springer, Auckland, New Zealand, 347–363.
    [21]
    Qi Li, Yaliang Li, Jing Gao, Lu Su, Bo Zhao, Murat Demirbas, Wei Fan, and Jiawei Han. 2014. A Confidence-aware Approach for Truth Discovery on Long-tail Data. Proceedings of the VLDB Endowment 8, 4 (2014), 425–436.
    [22]
    Yaliang Li, Jing Gao, Chuishi Meng, Qi Li, Lu Su, Bo Zhao, Wei Fan, and Jiawei Han. 2015. A Survey on Truth Discovery. ACM SIGKDD Explorations Newsletter 17, 2 (2015), 1–16.
    [23]
    Yuan Li, Benjamin I. P. Rubinstein, and Trevor Cohn. 2019. Truth Inference at Scale: A Bayesian Model for Adjudicating Highly Redundant Crowd Annotations. In WWW. ACM, San Francisco, CA, USA, 1028–1038.
    [24]
    Sidharth Mudgal, Han Li, Theodoros Rekatsinas, AnHai Doan, Youngchoon Park, Ganesh Krishnan, Rohit Deep, Esteban Arcaute, and Vijay Raghavendra. 2018. Deep Learning for Entity Matching: A Design Space Exploration. In SIGMOD. ACM, Houston, TX, USA, 19–34.
    [25]
    Dai Quoc Nguyen, Thanh Vu, Tu Dinh Nguyen, Dat Quoc Nguyen, and Dinh Q. Phung. 2019. A Capsule Network-based Embedding Model for Knowledge Graph Completion and Search Personalization. In NAACL-HLT. ACL, Minneapolis, MN, USA, 2180–2189.
    [26]
    Lei Niu, Chenpeng Fu, Qiang Yang, Zhixu Li, Zhigang Chen, Qingsheng Liu, and Kai Zheng. 2021. Open-world Knowledge Graph Completion with Multiple Interaction Attention. World Wide Web 24, 1 (2021), 419–439.
    [27]
    Jeff Pasternack and Dan Roth. 2010. Knowing What to Believe (when you already know something). In COLING. ACL, Beijing, China, 877–885.
    [28]
    Jeff Pasternack and Dan Roth. 2013. Latent Credibility Analysis. In WWW. IW3C2, Rio de Janeiro, Brazil, 1009–1020.
    [29]
    Andrea Rossi, Denilson Barbosa, Donatella Firmani, Antonio Matinata, and Paolo Merialdo. 2021. Knowledge Graph Embedding for Link Prediction: A Comparative Analysis. ACM Transactions on Knowledge Discovery from Data 15, 2 (2021), 14:1–14:49.
    [30]
    Stuart Russell and Peter Norvig. 2020. Artificial Intelligence: A Modern Approach(4th ed.). Prentice Hall, Hoboken, NJ, USA.
    [31]
    Ahmad Sakor, Isaiah Onando Mulang, Kuldeep Singh, Saeedeh Shekarpour, Maria-Esther Vidal, Jens Lehmann, and Sören Auer. 2019. Old is Gold: Linguistic Driven Approach for Entity and Relation Linking of Short Text. In NAACL-HLT. ACL, Minneapolis, MN, USA, 2336–2346.
    [32]
    Michael Sejr Schlichtkrull, Thomas N. Kipf, Peter Bloem, Rianne van den Berg, Ivan Titov, and Max Welling. 2018. Modeling Relational Data with Graph Convolutional Networks. In ESWC. Springer, Heraklion, Crete, Greece, 593–607.
    [33]
    Haseeb Shah, Johannes Villmow, Adrian Ulges, Ulrich Schwanecke, and Faisal Shafait. 2019. An Open-world Extension to Knowledge Graph Completion Models. In AAAI. AAAI Press, Honolulu, HI, USA, 3044–3051.
    [34]
    Chao Shang, Yun Tang, Jing Huang, Jinbo Bi, Xiaodong He, and Bowen Zhou. 2019. End-to-end Structure-aware Convolutional Networks for Knowledge Base Completion. In AAAI. AAAI Press, Honolulu, HI, USA, 3060–3067.
    [35]
    Baoxu Shi and Tim Weninger. 2018. Open-world Knowledge Graph Completion. In AAAI. AAAI Press, New Orleans, LA, USA, 1957–1964.
    [36]
    Zequn Sun, Chengming Wang, Wei Hu, Muhao Chen, Jian Dai, Wei Zhang, and Yuzhong Qu. 2020. Knowledge Graph Alignment Network with Gated Multi-hop Neighborhood Aggregation. In AAAI. AAAI Press, New York, NY, USA, 222–229.
    [37]
    Yi Tay, Luu Anh Tuan, Minh C. Phan, and Siu Cheung Hui. 2017. Multi-task Neural Network for Non-discrete Attribute Prediction in Knowledge Graphs. In CIKM. ACM, Singapore, 1029–1038.
    [38]
    Amos Tversky. 1977. Features of Similarity. Psychological Review 84, 4 (1977), 327–352.
    [39]
    Shikhar Vashishth, Soumya Sanyal, Vikram Nitin, and Partha P. Talukdar. 2020. Composition-based Multi-relational Graph Convolutional Networks. In ICLR. OpenReview.net, Addis Ababa, Ethiopia, 1–16.
    [40]
    Denny Vrandecic and Markus Krötzsch. 2014. Wikidata: A Free Collaborative Knowledgebase. Commun. ACM 57, 10 (2014), 78–85.
    [41]
    Quan Wang, Zhendong Mao, Bin Wang, and Li Guo. 2017. Knowledge Graph Embedding: A Survey of Approaches and Applications. IEEE Transactions on Knowledge and Data Engineering 29, 12(2017), 2724–2743.
    [42]
    Shen Wang, Xiaokai Wei, Cícero Nogueira dos Santos, Zhiguo Wang, Ramesh Nallapati, Andrew O. Arnold, Bing Xiang, Philip S. Yu, and Isabel F. Cruz. 2021. Mixed-curvature Multi-relational Graph Neural Network for Knowledge Graph Completion. In WWW. ACM, Ljubljana, Slovenia, 1761–1771.
    [43]
    Xianzhi Wang, Quan Z. Sheng, Xiu Susie Fang, Lina Yao, Xiaofei Xu, and Xue Li. 2015. An Integrated Bayesian Approach for Effective Multi-truth Discovery. In CIKM. ACM, Melbourne, Australia, 493–502.
    [44]
    Ledell Wu, Fabio Petroni, Martin Josifoski, Sebastian Riedel, and Luke Zettlemoyer. 2020. Scalable Zero-shot Entity Linking with Dense Entity Retrieval. In EMNLP. ACL, online, 6397–6407.
    [45]
    Wentao Wu, Hongsong Li, Haixun Wang, and Kenny Qili Zhu. 2012. Probase: A Probabilistic Taxonomy for Text Understanding. In SIGMOD. ACM, Scottsdale, AZ, USA, 481–492.
    [46]
    Ruobing Xie, Zhiyuan Liu, Jia Jia, Huanbo Luan, and Maosong Sun. 2016. Representation Learning of Knowledge Graphs with Entity Descriptions. In AAAI. AAAI Press, New York, NY, USA, 2659–2665.
    [47]
    Bishan Yang, Wen-tau Yih, Xiaodong He, Jianfeng Gao, and Li Deng. 2015. Embedding Entities and Relations for Learning and Inference in Knowledge Bases. In ICLR. OpenReview.net, San Diego, CA, USA, 1–12.
    [48]
    Runqi Yang, Jianhai Zhang, Xing Gao, Feng Ji, and Haiqing Chen. 2019. Simple and Effective Text Matching with Richer Alignment Features. In ACL. ACL, Florence, Italy, 4699–4709.
    [49]
    Xiaoxin Yin, Jiawei Han, and Philip S. Yu. 2008. Truth Discovery with Multiple Conflicting Information Providers on the Web. IEEE Transactions on Knowledge and Data Engineering 20, 6(2008), 796–808.
    [50]
    Bo Zhao, Benjamin I. P. Rubinstein, Jim Gemmell, and Jiawei Han. 2012. A Bayesian Approach to Discovering Truth from Conflicting Sources for Data Integration. Proceedings of the VLDB Endowment 5, 6 (2012), 550–561.
    [51]
    Yudian Zheng, Guoliang Li, Yuanbing Li, Caihua Shan, and Reynold Cheng. 2017. Truth Inference in Crowdsourcing: Is the Problem Solved?Proceedings of the VLDB Endowment 10, 5 (2017), 541–552.

    Cited By

    View all

    Index Terms

    1. Trustworthy Knowledge Graph Completion Based on Multi-sourced Noisy Data
          Index terms have been assigned to the content through auto-classification.

          Recommendations

          Comments

          Information & Contributors

          Information

          Published In

          cover image ACM Conferences
          WWW '22: Proceedings of the ACM Web Conference 2022
          April 2022
          3764 pages
          ISBN:9781450390965
          DOI:10.1145/3485447
          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Sponsors

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          Published: 25 April 2022

          Permissions

          Request permissions for this article.

          Check for updates

          Author Tags

          1. Knowledge graph completion
          2. noisy data
          3. truth inference

          Qualifiers

          • Research-article
          • Research
          • Refereed limited

          Funding Sources

          Conference

          WWW '22
          Sponsor:
          WWW '22: The ACM Web Conference 2022
          April 25 - 29, 2022
          Virtual Event, Lyon, France

          Acceptance Rates

          Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

          Contributors

          Other Metrics

          Bibliometrics & Citations

          Bibliometrics

          Article Metrics

          • Downloads (Last 12 months)100
          • Downloads (Last 6 weeks)5

          Other Metrics

          Citations

          Cited By

          View all

          View Options

          Get Access

          Login options

          View options

          PDF

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader

          HTML Format

          View this article in HTML Format.

          HTML Format

          Media

          Figures

          Other

          Tables

          Share

          Share

          Share this Publication link

          Share on social media