Semantic Web - Volume Pre-press, issue Pre-press - Journals

Show:

results per page

Background knowledge in ontology matching: A survey

Authors: Portisch, Jan | Hladik, Michael | Paulheim, Heiko

Article Type: Research Article

Abstract: Ontology matching is an integral part for establishing semantic interoperability. One of the main challenges within the ontology matching operation is semantic heterogeneity, i.e. modeling differences between the two ontologies that are to be integrated. The semantics within most ontologies or schemas are, however, typically incomplete because they are designed within a certain context which is not explicitly modeled. Therefore, external background knowledge plays a major role in the task of (semi-) automated ontology and schema matching. In this survey, we introduce the reader to the general ontology matching problem. We review the background knowledge sources as well as …the approaches applied to make use of external knowledge. Our survey covers all ontology matching systems that have been presented within the years 2004–2021 at a well-known ontology matching competition together with systematically selected publications in the research field. We present a classification system for external background knowledge, concept linking strategies, as well as for background knowledge exploitation approaches. We provide extensive examples and classify all ontology matching systems under review in a resource/strategy matrix obtained by coalescing the two classification systems. Lastly, we outline interesting and yet underexplored research directions of applying external knowledge within the ontology matching process. Show more

Keywords: Ontology matching, schema matching, background knowledge, data integration, semantic integration, knowledge graphs, ontologies

DOI: 10.3233/SW-223085

Citation: Semantic Web, vol. Pre-press, no. Pre-press, pp. 1-55, 2022

Get PDF

MTab4D: Semantic annotation of tabular data with DBpedia

Authors: Nguyen, Phuc | Kertkeidkachorn, Natthawut | Ichise, Ryutaro | Takeda, Hideaki

Article Type: Research Article

Abstract: Semantic annotation of tabular data is the process of matching table elements with knowledge graphs. As a result, the table contents could be interpreted or inferred using knowledge graph concepts, enabling them to be useful in downstream applications such as data analytics and management. Nevertheless, semantic annotation tasks are challenging due to insufficient tabular data descriptions, heterogeneous schema, and vocabulary issues. This paper presents an automatic semantic annotation system for tabular data, called MTab4D, to generate annotations with DBpedia in three annotation tasks: 1) matching table cells to entities, 2) matching columns to entity types, and 3) matching pairs of …columns to properties. In particular, we propose an annotation pipeline that combines multiple matching signals from different table elements to address schema heterogeneity, data ambiguity, and noisiness. Additionally, this paper provides insightful analysis and extra resources on benchmarking semantic annotation with knowledge graphs. Experimental results on the original and adapted datasets of the Semantic Web Challenge on Tabular Data to Knowledge Graph Matching (SemTab 2019) show that our system achieves an impressive performance for the three annotation tasks. MTab4D’s repository is publicly available at https://github.com/phucty/mtab4dbpedia . Show more

Keywords: Table annotation, knowledge graph, DBpedia, semantic table interpretation

DOI: 10.3233/SW-223098

Citation: Semantic Web, vol. Pre-press, no. Pre-press, pp. 1-25, 2022

Get PDF

Is neuro-symbolic AI meeting its promises in natural language processing? A structured review

Authors: Hamilton, Kyle | Nayak, Aparna | Božić, Bojan | Longo, Luca

Article Type: Research Article

Abstract: Advocates for Neuro-Symbolic Artificial Intelligence (NeSy) assert that combining deep learning with symbolic reasoning will lead to stronger AI than either paradigm on its own. As successful as deep learning has been, it is generally accepted that even our best deep learning systems are not very good at abstract reasoning. And since reasoning is inextricably linked to language, it makes intuitive sense that Natural Language Processing (NLP), would be a particularly well-suited candidate for NeSy. We conduct a structured review of studies implementing NeSy for NLP, with the aim of answering the question of whether NeSy is indeed meeting its …promises: reasoning, out-of-distribution generalization, interpretability, learning and reasoning from small data, and transferability to new domains. We examine the impact of knowledge representation, such as rules and semantic networks, language structure and relational structure, and whether implicit or explicit reasoning contributes to higher promise scores. We find that systems where logic is compiled into the neural network lead to the most NeSy goals being satisfied, while other factors such as knowledge representation, or type of neural architecture do not exhibit a clear correlation with goals being met. We find many discrepancies in how reasoning is defined, specifically in relation to human level reasoning, which impact decisions about model architectures and drive conclusions which are not always consistent across studies. Hence we advocate for a more methodical approach to the application of theories of human reasoning as well as the development of appropriate benchmarks, which we hope can lead to a better understanding of progress in the field. We make our data and code available on github for further analysis.1 1 https://github.com/kyleiwaniec/neuro-symbolic-ai-systematic-review https://github.com/kyleiwaniec/neuro-symbolic-ai-systematic-review Show more

Keywords: Neuro-symbolic artificial intelligence, natural language processing, deep learning, knowledge representation & reasoning, structured review

DOI: 10.3233/SW-223228

Citation: Semantic Web, vol. Pre-press, no. Pre-press, pp. 1-42, 2022

Get PDF

Understanding the structure of knowledge graphs with ABSTAT profiles

Authors: Spahiu, Blerina | Palmonari, Matteo | Alva Principe, Renzo Arturo | Rula, Anisa

Article Type: Research Article

Abstract: While there has been a trend in the last decades for publishing large-scale and highly-interconnected Knowledge Graphs (KGs), their users often get overwhelmed by the task of understanding their content as a result of their size and complexity. Data profiling approaches have been proposed to summarize large KGs into concise and meaningful representations, so that they can be better explored, processed, and managed. Profiles based on schema patterns represent each triple in a KG with its schema-level counterpart, thus covering the entire KG with profiles of considerable size. In this paper, we provide empirical evidence that profiles based on schema …patterns, if explored with suitable mechanisms, can be useful to help users understand the content of big and complex KGs. ABSTAT provides concise pattern-based profiles and comes with faceted interfaces for profile exploration. Using this tool we present a user study based on query completion tasks. We demonstrate that users who look at ABSTAT profiles formulate their queries better and faster than users browsing the ontology of the KGs. The latter is a pretty strong baseline considering that many KGs do not even come with a specific ontology to be explored by the users. To the best of our knowledge, this is the first attempt to investigate the impact of profiling techniques on tasks related to knowledge graph understanding with a user study. Show more

Keywords: Data understanding, data profiling, summarization, rdf, knowledge graph

DOI: 10.3233/SW-223181

Citation: Semantic Web, vol. Pre-press, no. Pre-press, pp. 1-27, 2023

Get PDF

ImageSchemaNet: A framester graph for embodied commonsense knowledge

Authors: De Giorgis, Stefano | Gangemi, Aldo | Gromann, Dagmar

Article Type: Research Article

Abstract: Commonsense knowledge is a broad and challenging area of research which investigates our understanding of the world as well as human assumptions about reality. Deriving directly from the subjective perception of the external world, it is intrinsically intertwined with embodied cognition. Commonsense reasoning is linked to human sense-making, pattern recognition and knowledge framing abilities. This work presents a new resource that formalizes the cognitive theory of image schemas. Image schemas are dynamic conceptual building blocks originating from our sensorimotor interactions with the physical world, and enable our sense-making cognitive activity to assign coherence and structure to entities, events and situations …we experience everyday. ImageSchemaNet is an ontology that aligns pre-existing resources, such as FrameNet, VerbNet, WordNet and MetaNet from the Framester hub, to image schema theory. This article describes an empirical application of ImageSchemaNet, combined with semantic parsers, on the task of annotating natural language sentences with image schemas. Show more

Keywords: Image schemas, cognitive semantics, frame semantics, commonsense reasoning

DOI: 10.3233/SW-223084

Citation: Semantic Web, vol. Pre-press, no. Pre-press, pp. 1-25, 2022

Get PDF

Food process ontology requirements

Article Type: Research Article

Abstract: People often value the sensual, celebratory, and health aspects of food, but behind this experience exists many other value-laden agricultural production, distribution, manufacturing, and physiological processes that support or undermine a healthy population and a sustainable future. The complexity of such processes is evident in both every-day food preparation of recipes and in industrial food manufacturing, packaging and storage, each of which depends critically on human or machine agents, chemical or organismal ingredient references, and the explicit instructions and implicit procedures held in formulations or recipes. An integrated ontology landscape does not yet exist to cover all the entities at …work in this farm to fork journey. It seems necessary to construct such a vision by reusing expert-curated fit-to-purpose ontology subdomains and their relationship, material, and more abstract organization and role entities. The challenge is to make this merger be, by analogy, one language, rather than nouns and verbs from a dozen or more dialects which cannot be used directly in statements about some aspect of the farm to fork journey without expensive translation or substantial dialect education in order to understand a particular text or domain of knowledge. This work focuses on the ontology components – object and data properties and annotations – needed to model food processes or more general process modelling within the context of the Open Biological and Biomedical Ontology Foundry and congruent ontologies. Ideally these components can be brought together in a general process ontology that can be specialized not only for the food domain but for carrying out other protocols as well. Many operations involved in food identification, preparation, transportation and storage – shaking, boiling, mixing, freezing, labeling, shipping – are actually common to activities from manufacturing and laboratory work to local or home food preparation. Show more

Keywords: Ontology, food processing, recipe, process modelling, OBO Foundry

DOI: 10.3233/SW-223096

Citation: Semantic Web, vol. Pre-press, no. Pre-press, pp. 1-32, 2022

Get PDF

Searching for explanations of black-box classifiers in the space of semantic queries

Authors: Liartis, Jason | Dervakos, Edmund | Menis-Mastromichalakis, Orfeas | Chortaras, Alexandros | Stamou, Giorgos

Article Type: Research Article

Abstract: Deep learning models have achieved impressive performance in various tasks, but they are usually opaque with regards to their inner complex operation, obfuscating the reasons for which they make decisions. This opacity raises ethical and legal concerns regarding the real-life use of such models, especially in critical domains such as in medicine, and has led to the emergence of the eXplainable Artificial Intelligence (XAI) field of research, which aims to make the operation of opaque AI systems more comprehensible to humans. The problem of explaining a black-box classifier is often approached by feeding it data and observing its behaviour. In …this work, we feed the classifier with data that are part of a knowledge graph, and describe the behaviour with rules that are expressed in the terminology of the knowledge graph, that is understandable by humans. We first theoretically investigate the problem to provide guarantees for the extracted rules and then we investigate the relation of “explanation rules for a specific class” with “semantic queries collecting from the knowledge graph the instances classified by the black-box classifier to this specific class”. Thus we approach the problem of extracting explanation rules as a semantic query reverse engineering problem. We develop algorithms for solving this inverse problem as a heuristic search in the space of semantic queries and we evaluate the proposed algorithms on four simulated use-cases and discuss the results. Show more

Keywords: Explainable AI (XAI), opaque machine learning classifiers, knowledge graphs, description logics, semantic query answering, reverse query answering, post-hoc explainability, explanation rules

DOI: 10.3233/SW-233469

Citation: Semantic Web, vol. Pre-press, no. Pre-press, pp. 1-42, 2023

Get PDF

What is in your cookie box? Explaining ingredients of web cookies with knowledge graphs

Article Type: Research Article

Abstract: The General Data Protection Regulation (GDPR) has imposed strict requirements for data sharing, one of which is informed consent. A common way to request consent online is via cookies. However, commonly, users accept online cookies being unaware of the meaning of the given consent and the following implications. Once consent is given, the cookie “disappears”, and one forgets that consent was given in the first place. Retrieving cookies and consent logs becomes challenging, as most information is stored in the specific Internet browser’s logs. To make users aware of the data sharing implied by cookie consent and to support transparency and …traceability within systems, we present a knowledge graph (KG) based tool for personalised cookie consent information visualisation. The KG is based on the OntoCookie ontology, which models cookies in a machine-readable format and supports data interpretability across domains. Evaluation results confirm that the users’ comprehension of the data shared through cookies is vague and insufficient. Furthermore, our work has resulted in an increase of 47.5% in the users’ willingness to be cautious when viewing cookie banners before giving consent. These and other evaluation results confirm that our cookie data visualisation approach and tool help to increase users’ awareness of cookies and data sharing. Show more

Keywords: Cookies, consent, GDPR, ontology, knowledge graph, data sharing, comprehension

DOI: 10.3233/SW-233435

Citation: Semantic Web, vol. Pre-press, no. Pre-press, pp. 1-17, 2023

Get PDF

Psychiq and Wwwyzzerdd: Wikidata completion using Wikipedia

Authors: Erenrich, Daniel

Article Type: Research Article

Abstract: Despite its size, Wikidata remains incomplete and inaccurate in many areas. Hundreds of thousands of articles on English Wikipedia have zero or limited meaningful structure on Wikidata. Much work has been done in the literature to partially or fully automate the process of completing knowledge graphs, but little of it has been practically applied to Wikidata. This paper presents two interconnected practical approaches to speeding up the Wikidata completion task. The first is Wwwyzzerdd, a browser extension that allows users to quickly import statements from Wikipedia to Wikidata. Wwwyzzerdd has been used to make over 100 thousand edits to Wikidata. …The second is Psychiq, a new model for predicting instance and subclass statements based on English Wikipedia articles. Psychiq’s performance and characteristics make it well suited to solving a variety of problems for the Wikidata community. One initial use is integrating the Psychiq model into the Wwwyzzerdd browser extension. Show more

Keywords: Wikidata, Wikipedia, browser extension, knowledge graph completion

DOI: 10.3233/SW-233450

Citation: Semantic Web, vol. Pre-press, no. Pre-press, pp. 1-14, 2023

Get PDF

ProVe: A pipeline for automated provenance verification of knowledge graphs against textual sources

Authors: Amaral, Gabriel | Rodrigues, Odinaldo | Simperl, Elena

Article Type: Research Article

Abstract: Knowledge Graphs are repositories of information that gather data from a multitude of domains and sources in the form of semantic triples, serving as a source of structured data for various crucial applications in the modern web landscape, from Wikipedia infoboxes to search engines. Such graphs mainly serve as secondary sources of information and depend on well-documented and verifiable provenance to ensure their trustworthiness and usability. However, their ability to systematically assess and assure the quality of this provenance, most crucially whether it properly supports the graph’s information, relies mainly on manual processes that do not scale with size. ProVe …aims at remedying this, consisting of a pipelined approach that automatically verifies whether a Knowledge Graph triple is supported by text extracted from its documented provenance. ProVe is intended to assist information curators and consists of four main steps involving rule-based methods and machine learning models: text extraction, triple verbalisation, sentence selection, and claim verification. ProVe is evaluated on a Wikidata dataset, achieving promising results overall and excellent performance on the binary classification task of detecting support from provenance, with 87.5 % accuracy and 82.9 % F1-macro on text-rich sources. The evaluation data and scripts used in this paper are available in GitHub and Figshare. Show more

Keywords: Fact verification, data verbalisation, knowledge graphs

DOI: 10.3233/SW-233467

Citation: Semantic Web, vol. Pre-press, no. Pre-press, pp. 1-34, 2023

Get PDF

Display: 10 | 50 | 100 items per page

Semantic Web - Volume Pre-press, issue Pre-press

Background knowledge in ontology matching: A survey

MTab4D: Semantic annotation of tabular data with DBpedia

Is neuro-symbolic AI meeting its promises in natural language processing? A structured review

Understanding the structure of knowledge graphs with ABSTAT profiles

ImageSchemaNet: A framester graph for embodied commonsense knowledge

Food process ontology requirements

Searching for explanations of black-box classifiers in the space of semantic queries

What is in your cookie box? Explaining ingredients of web cookies with knowledge graphs

Psychiq and Wwwyzzerdd: Wikidata completion using Wikipedia

ProVe: A pipeline for automated provenance verification of knowledge graphs against textual sources

North America

Europe

Asia