Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2539150.2539240acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiiwasConference Proceedingsconference-collections
research-article

SPARQL Endpoint Metrics for Quality-Aware Linked Data Consumption

Published: 02 December 2013 Publication History
  • Get Citation Alerts
  • Abstract

    In recent years, dozens of publicly accessible Linked Data repositories containing vast amounts of knowledge presented in the Resource Description Framework (RDF) format have been set up worldwide. By utilizing the SPARQL query language, users can consume, integrate, and present data from a federation of sources for different application scenarios. However, several challenges arise for distributed query processing across multiple SPARQL endpoints, such as devising suitable query optimization or result caching strategies.
    For implementing these techniques, one crucial aspect lies in determining appropriate endpoint features. In this work, we introduce several metrics that enable universal and finegrained characterization of arbitrary Linked Data repositories. We present comprehensive approaches for deriving these metrics and validate them through extensive evaluation on real-world SPARQL endpoints. Finally, we discuss possible implications of our findings for data consumers.

    References

    [1]
    K. Alexander, R. Cyganiak, M. Hausenblas, and J. Zhao. Describing linked datasets - on the design and usage of voiD, the "vocabulary of interlinked datasets". In Proceedings of the WWW Workshop on Linked Data on the Web (LDOW), Madrid, Spain, 2009.
    [2]
    M. Arias, J. D. Fernández, M. A. Martínez-Prieto, and P. de la Fuente. An empirical study of real-world SPARQL queries. In Proceedings of the International Workshop on Usage Analysis and the Web of Data, Hyderabad, India, 2011.
    [3]
    C. Bizer and R. Cyganiak. D2R server--publishing relational databases on the semantic web. In Proceedings of the International Semantic Web Conference (ISWC), Athens, GA, USA, 2006.
    [4]
    C. Bizer and A. Schultz. The Berlin SPARQL benchmark. International Journal on Semantic Web and Information Systems, 5(2):1--24, 2009.
    [5]
    O. Görlitz and S. Staab. SPLENDID: SPARQL endpoint federation exploiting VOID descriptions. In Proceedings of the International Workshop on Consuming Linked Data (COLD), Bonn, Germany, 2011.
    [6]
    O. Görlitz, M. Thimm, and S. Staab. SPLODGE: Systematic generation of SPARQL benchmark queries for linked open data. In Proceedings of the International Semantic Web Conference (ISWC), pages 116--132. Boston, MA, USA, 2012.
    [7]
    C. G. Jorge Pérez, Marcelo Arenas. Semantics and complexity of SPARQL. ACM Transactions on Database Systems (TODS), 34(3):16:1--16:45, 2009.
    [8]
    J. Lehmann, R. Isele, M. Jakob, A. Jentzsch, D. Kontokostas, P. N. Mendes, S. Hellmann, M. Morsey, P. van Kleef, S. Auer, and C. Bizer. DBpedia - a large-scale, multilingual knowledge base extracted from wikipedia. Semantic Web Journal, 2013. Under review.
    [9]
    J. Lorey and F. Naumann. Detecting SPARQL query templates for data prefetching. In Proceedings of the Extended Semantic Web Conference (ESWC), pages 124--139, Montpellier, France, 2013.
    [10]
    M. Morsey, J. Lehmann, S. Auer, and A.-C. N. Ngomo. DBpedia SPARQL benchmark - performance assessment with real queries on real data. In Proceedings of the International Semantic Web Conference (ISWC), pages 454--469, 2011.
    [11]
    R. Prasad, C. Dovrolis, M. Murray, and K. Claffy. Bandwidth estimation: metrics, measurement techniques, and tools. IEEE Network, 17(6):27--35, 2003.
    [12]
    B. Quilitz and U. Leser. Querying distributed RDF data sources with SPARQL. In Proceedings of the Extended Semantic Web Conference (ESWC), pages 524--538, Tenerife, Canary Islands, 2008.
    [13]
    M. Schmidt, O. Görlitz, P. Haase, G. Ladwig, A. Schwarte, and T. Tran. FedBench: A benchmark suite for federated semantic data query processing. In Proceedings of the International Semantic Web Conference (ISWC), pages 585--600, Koblenz, Germany, 2011.
    [14]
    M. Schmidt, T. Hornung, G. Lausen, and C. Pinkel. Sp2bench: A SPARQL performance benchmark. In Proceedings of the International Conference on Data Engineering (ICDE), pages 222--233, Shanghai, China, 2009.
    [15]
    A. Schwarte, P. Haase, K. Hose, R. Schenkel, and M. Schmidt. FedX: A federation layer for distributed query processing on linked open data. In Proceedings of the Extended Semantic Web Conference (ESWC), pages 481--486, Heraklion, Greece, 2011.
    [16]
    M. Stocker, A. Seaborne, A. Bernstein, C. Kiefer, and D. Reynolds. SPARQL basic graph pattern optimization using selectivity estimation. In Proceedings of the International World Wide Web Conference (WWW), pages 595--604, Beijing, China, 2008.

    Index Terms

    1. SPARQL Endpoint Metrics for Quality-Aware Linked Data Consumption

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image ACM Other conferences
        IIWAS '13: Proceedings of International Conference on Information Integration and Web-based Applications & Services
        December 2013
        753 pages
        ISBN:9781450321136
        DOI:10.1145/2539150
        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        In-Cooperation

        • @WAS: International Organization of Information Integration and Web-based Applications and Services

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 02 December 2013

        Permissions

        Request permissions for this article.

        Check for updates

        Author Tags

        1. Distributed query processing
        2. Linked Data
        3. Quality of Service metrics
        4. RDF
        5. SPARQL
        6. Semantic Web

        Qualifiers

        • Research-article
        • Research
        • Refereed limited

        Conference

        IIWAS '13

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • 0
          Total Citations
        • 103
          Total Downloads
        • Downloads (Last 12 months)3
        • Downloads (Last 6 weeks)2

        Other Metrics

        Citations

        View Options

        Get Access

        Login options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media