Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3360901.3364443acmconferencesArticle/Chapter ViewAbstractPublication Pagesk-capConference Proceedingsconference-collections
research-article

Ranking Knowledge Graphs By Capturing Knowledge about Languages and Labels

Published: 23 September 2019 Publication History

Abstract

Capturing knowledge about the mulitilinguality of a knowledge graph is of supreme importance to understand its applicability across multiple languages. Several metrics have been proposed for describing mulitilinguality at the level of a whole knowledge graph. Albeit enabling the understanding of the ecosystem of knowledge graphs in terms of the utilized languages, they are unable to capture a fine-grained description of the languages in which the different entities and properties of the knowledge graph are represented. This lack of representation prevents the comparison of existing knowledge graphs in order to decide which are the most appropriate for a multilingual application.
In this work, we approach the problem of ranking knowledge graphs based on their language features and propose LINGVO, a framework able to capture mulitilinguality at different levels of granularity. Grounded in knowledge graph descriptions, LINGVO is, additionally, able to solve the problem of ranking knowledge graphs according to a degree of mulitilinguality of the represented entities. We have empirically studied the effectiveness of LINGVO in a benchmark of queries to be executed against existing knowledge graphs. The observed results provide evidence that LINGVO captures the mulitilinguality of the studied knowledge graphs similarly than a crowd-sourced gold standard.

References

[1]
Jeremy Debattista, Sören Auer, and Christoph Lange. 2016. Luzzu - A Methodology and Framework for Linked Data Quality Assessment. J. Data and Information Quality 8, 1 (2016), 4:1--4:32.
[2]
Dennis Diefenbach, Vanessa López, Kamal Deep Singh, and Pierre Maret. 2018. Core Techniques of Question Answering Systems over Knowledge Bases: A Survey. Knowl. Inf. Syst. 55, 3 (2018), 529--569. https://doi.org/10.1007/ s10115-017--1100-y
[3]
Basil Ell, Denny Vrandecic, and Elena Paslaru Bontas Simperl. 2011. Labels in the Web of Data. In The Semantic Web - ISWC 2011 - 10th International Semantic Web Conference, Bonn, Germany, October 23--27, 2011, Proceedings, Part I. 162--176.
[4]
Kemele M. Endris, Philipp D. Rohde, Maria-Esther Vidal, and Sören Auer. 2019. Ontario: Federated Query Processing Against a Semantic Data Lake. In Database and Expert Systems Applications, DEXA 2019, Linz, Austria, Proceedings, Part I. 379--395.
[5]
Asunción Gómez-Pérez, Daniel Vila-Suero, Elena Montiel-Ponsoda, Jorge Gracia, and Guadalupe Aguado de Cea. 2013. Guidelines for Multilingual Linked Data. In 3rd International Conference on Web Intelligence, Mining and Semantics, WIMS '13, Madrid, Spain, June 12--14, 2013. 3.
[6]
Jorge Gracia, Elena Montiel-Ponsoda, Philipp Cimiano, Asunción Gómez-Pérez, Paul Buitelaar, and John P. McCrae. 2012. Challenges for the Multilingual Web of Data. J. Web Sem. 11 (2012), 63--71.
[7]
Oktie Hassanzadeh and Mariano P. Consens. 2009. Linked Movie Data Base. In Proceedings of the WWW2009 Workshop on Linked Data on the Web, LDOW 2009, Madrid, Spain, April 20, 2009.
[8]
Lucie-Aimée Kaffee, Alessandro Piscopo, Pavlos Vougiouklis, Elena Simperl, Leslie Carr, and Lydia Pintscher. 2017. A Glimpse into Babel: An Analysis of Multilinguality in Wikidata. In Proceedings of the 13th International Symposium on Open Collaboration, OpenSym 2017, Galway, Ireland, August 23--25, 2017. 14:1--14:5.
[9]
Lucie-Aimée Kaffee and Elena Simperl. 2018. The Human Face of the Web of Data: A Cross-sectional Study of Labels. In Proceedings of the 14th International Conference on Semantic Systems, SEMANTICS 2018, Vienna, Austria, September 10--13, 2018. 66--77.
[10]
Lucie-Aimée Kaffee, Alessandro Piscopo, Pavlos Vougiouklis, Elena Simperl, Leslie Carr, and Lydia Pintscher. 2017. A Glimpse into Babel: An Analysis of Multilinguality in Wikidata. In Proceedings of the 13th International Symposium on Open Collaboration. ACM, 14.
[11]
Denys Katerenchuk and Andrew Rosenberg. 2016. RankDCG: Rank-OOrdering Evaluation Measure. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016). European Language Resources Association (ELRA).
[12]
Jens Lehmann, Robert Isele, Max Jakob, Anja Jentzsch, Dimitris Kontokostas, Pablo N Mendes, Sebastian Hellmann, Mohamed Morsey, Patrick Van Kleef, Sören Auer, et al. 2015. DBpedia - A Large-scale, Multilingual Knowledge Base Extracted from Wikipedia. Semantic Web 6, 2 (2015), 167--195.
[13]
Farzaneh Mahdisoltani, Joanna Biega, and Fabian M. Suchanek. 2015. Yago3: A Knowledge Base from Multilingual Wikipedias. In CIDR 2015, Seventh Biennial Conference on Innovative Data Systems Research, Asilomar, CA, USA, January 4--7, 2015, Online Proceedings.
[14]
Michael Schmidt, Michael Meier, and Georg Lausen. 2010. Foundations of SPARQL query optimization. In Proceedings of the 13th International Conference on Database Theory. ACM, 4--33.
[15]
Aaron Swartz. 2002. MusicBrainz: A Semantic Web Service. IEEE Intelligent Systems 17, 1 (2002), 76--77.
[16]
Ricardo Usbeck, Ria Hari Gusmita, Axel-Cyrille Ngonga Ngomo, and Muhammad Saleem. 2018. 9th Challenge on Question Answering over Linked Data (QALD-9) (invited paper). In Joint proceedings of the 4thWorkshop on Semantic Deep Learning (SemDeep-4) and NLIWoD4; and 9th Question Answering over Linked Data challenge (QALD-9) co-located with 17th International SemanticWeb Conference (ISWC 2018), Monterey, California, United States of America.
[17]
Amrapali Zaveri, Anisa Rula, Andrea Maurino, Ricardo Pietrobon, Jens Lehmann, and Sören Auer. 2016. Quality assessment for Linked Data: A Survey. Semantic Web 7, 1 (2016), 63--93.

Cited By

View all
  • (2024)Tracing the Impact of Bias in Link PredictionProceedings of the 39th ACM/SIGAPP Symposium on Applied Computing10.1145/3605098.3635912(1626-1633)Online publication date: 8-Apr-2024
  • (2022)Compositional Generalization in Multilingual Semantic Parsing over WikidataTransactions of the Association for Computational Linguistics10.1162/tacl_a_0049910(937-955)Online publication date: 7-Sep-2022
  • (2022)Knowledge Graph Question Answering Datasets and Their GeneralizabilityProceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3477495.3531751(3209-3218)Online publication date: 6-Jul-2022

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
K-CAP '19: Proceedings of the 10th International Conference on Knowledge Capture
September 2019
281 pages
ISBN:9781450370080
DOI:10.1145/3360901
  • General Chairs:
  • Mayank Kejriwal,
  • Pedro Szekely,
  • Program Chair:
  • Raphaël Troncy
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 23 September 2019

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. knowledge graph
  2. multilinguality
  3. question answering
  4. ranking

Qualifiers

  • Research-article

Funding Sources

Conference

K-CAP '19
Sponsor:
K-CAP '19: Knowledge Capture Conference
November 19 - 21, 2019
CA, Marina Del Rey, USA

Acceptance Rates

Overall Acceptance Rate 55 of 198 submissions, 28%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)19
  • Downloads (Last 6 weeks)1
Reflects downloads up to 13 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Tracing the Impact of Bias in Link PredictionProceedings of the 39th ACM/SIGAPP Symposium on Applied Computing10.1145/3605098.3635912(1626-1633)Online publication date: 8-Apr-2024
  • (2022)Compositional Generalization in Multilingual Semantic Parsing over WikidataTransactions of the Association for Computational Linguistics10.1162/tacl_a_0049910(937-955)Online publication date: 7-Sep-2022
  • (2022)Knowledge Graph Question Answering Datasets and Their GeneralizabilityProceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3477495.3531751(3209-3218)Online publication date: 6-Jul-2022

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media