Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2660517.2660538acmotherconferencesArticle/Chapter ViewAbstractPublication PagesfseConference Proceedingsconference-collections
research-article

DataID: towards semantically rich metadata for complex datasets

Published: 04 September 2014 Publication History
  • Get Citation Alerts
  • Abstract

    The constantly growing amount of Linked Open Data (LOD) datasets constitutes the need for rich metadata descriptions, enabling users to discover, understand and process the available data. This metadata is often created, maintained and stored in diverse data repositories featuring disparate data models that are often unable to provide the metadata necessary to automatically process the datasets described. This paper proposes DataID, a best-practice for LOD dataset descriptions which utilize RDF files hosted together with the datasets, under the same domain. We are describing the data model, which is based on the widely used DCAT and VoID vocabularies, as well as supporting tools to create and publish DataIDs and use cases that show the benefits of providing semantically rich metadata for complex datasets. As a proof of concept, we generated a DataID for the DBpedia dataset, which we will present in the paper.

    References

    [1]
    K. Alexander, R. Cyganiak, M. Hausenblas, and J. Zhao. Describing linked datasets. In LDOW, 2009.
    [2]
    C. Böhm, J. Lorey, and F. Naumann. Creating void descriptions for web-scale data. Web Semant., 9(3):339--345, Sept. 2011.
    [3]
    I. Ermilov, M. Martin, J. Lehmann, and S. Auer. Linked open data statistics: Collection and exploitation. In Knowledge Engineering and the Semantic Web, pages 242--249. Springer, 2013.
    [4]
    D. Kontokostas, C. Bratsas, S. Auer, S. Hellmann, I. Antoniou, and G. Metakides. Internationalization of linked data: The case of the greek dbpedia edition. Web Semantics: Science, Services and Agents on the World Wide Web, 15(0):51--61, 2012.
    [5]
    D. Kontokostas, P. Westphal, S. Auer, S. Hellmann, J. Lehmann, R. Cornelissen, and A. Zaveri. Test-driven evaluation of linked data quality. In Proceedings of the 23rd International Conference on World Wide Web, WWW '14, pages 747--758, Republic and Canton of Geneva, Switzerland, 2014. International World Wide Web Conferences Steering Committee.
    [6]
    F. Maali, R. Cyganiak, and V. Peristeras. Enabling interoperability of government data catalogues. In Electronic Government, pages 339--350. Springer, 2010.

    Cited By

    View all
    • (2024)The Big Data Value Chain for the Provision of AI-Enabled Energy Analytics ServicesMachine Learning Applications for Intelligent Energy Management10.1007/978-3-031-47909-0_2(29-51)Online publication date: 28-Jan-2024
    • (2022)A New Approach for Assessing Metadata Completeness in Open Data PortalsInternational Journal of Electronic Government Research10.4018/IJEGR.31363618:1(1-20)Online publication date: 1-Jan-2022
    • (2022)Paving the way for enriched metadata of linguistic linked dataSemantic Web10.3233/SW-22299413:6(1133-1157)Online publication date: 26-Sep-2022
    • Show More Cited By

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    SEM '14: Proceedings of the 10th International Conference on Semantic Systems
    September 2014
    161 pages
    ISBN:9781450329279
    DOI:10.1145/2660517
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Sponsors

    • St. Pölten University: St. Pölten University of Applied Sciences, Austria
    • University of Potsdam: University of Potsdam
    • PoolParty: PoolParty (Semantic Web Company GmbH)
    • University of Vienna: University of Vienna
    • Wolters Kluwer: Wolters Kluwer, Germany
    • Semantic Web Company: Semantic Web Company
    • STII: STI International
    • DBpedia Association: DBpedia Association

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 04 September 2014

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. dbpedia
    2. dcat
    3. documentation
    4. metadata
    5. provenance
    6. void

    Qualifiers

    • Research-article

    Funding Sources

    Conference

    SEM '14
    Sponsor:
    • St. Pölten University
    • University of Potsdam
    • PoolParty
    • University of Vienna
    • Wolters Kluwer
    • Semantic Web Company
    • STII
    • DBpedia Association

    Acceptance Rates

    SEM '14 Paper Acceptance Rate 22 of 59 submissions, 37%;
    Overall Acceptance Rate 22 of 59 submissions, 37%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)17
    • Downloads (Last 6 weeks)7

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)The Big Data Value Chain for the Provision of AI-Enabled Energy Analytics ServicesMachine Learning Applications for Intelligent Energy Management10.1007/978-3-031-47909-0_2(29-51)Online publication date: 28-Jan-2024
    • (2022)A New Approach for Assessing Metadata Completeness in Open Data PortalsInternational Journal of Electronic Government Research10.4018/IJEGR.31363618:1(1-20)Online publication date: 1-Jan-2022
    • (2022)Paving the way for enriched metadata of linguistic linked dataSemantic Web10.3233/SW-22299413:6(1133-1157)Online publication date: 26-Sep-2022
    • (2019)Benchmarking question answering systemsSemantic Web10.3233/SW-18031210:2(293-304)Online publication date: 21-Jan-2019
    • (2018)GERBIL – Benchmarking Named Entity Recognition and Linking consistentlySemantic Web10.3233/SW-1702869:5(605-625)Online publication date: 1-Jan-2018
    • (2018)Wikidata through the eyes of DBpediaSemantic Web10.3233/SW-1702779:4(493-503)Online publication date: 1-Jan-2018
    • (2018)Linked Web APIs datasetSemantic Web10.3233/SW-1702599:4(381-391)Online publication date: 29-Jun-2018
    • (2017)IDOLProceedings of the 13th International Conference on Semantic Systems10.1145/3132218.3132238(49-56)Online publication date: 11-Sep-2017
    • (2017)Linked Thesauri Quality Assessment and Documentation for Big Data Discovery2017 International Conference on High Performance Computing & Simulation (HPCS)10.1109/HPCS.2017.16(37-44)Online publication date: Jul-2017
    • (2017)A Conceptual Building-Block and Practical OpenStreetMap-Interface for Sharing References to Hydrologic FeaturesAdvances in Human Factors, Sustainable Urban Planning and Infrastructure10.1007/978-3-319-60450-3_14(137-148)Online publication date: 13-Jun-2017
    • Show More Cited By

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media