Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.5555/2887352.2887361guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Defining and executing assessment tests on linked data for statistical analysis

Published: 23 October 2011 Publication History
  • Get Citation Alerts
  • Abstract

    Currently there is a strong trend for governmental agencies to publish statistical data as Linked Data (e.g. Eurostat, data.gov.uk). Unfortunately, these published datasets are still very diverse in their structure, making the analysis very complicated and technical. In this paper, we analyze datasets according to defined assessment tests and exploit both domain knowledge and the inherent semantic annotations. Therefore we scan existing datasets for known patterns that signify e.g., typical numerical data blocks or potentially temporal and geographical dimensions. Thus, Linked Data is made evaluable for a possible usage in standard statistical analysis tools. This allows researchers to use statistical data from diverse Linked Data sources for analysis with only a minimum of technical expertise used for integration of the data.

    References

    [1]
    Gregory, A., Vardigan, M.: The Web of Linked Data. Realizing the Potential for the Social Sciences (2010), http://odaf.org/papers/201010_Gregory_Arofan_186.pdf
    [2]
    Heath, T., Bizer, C.: Linked Data: Evolving the Web into a Global Data Space (1st edition). Synthesis Lectures on the Semantic Web: Theory and Technology, 1:1, 1-136. Morgan & Claypool (2011)
    [3]
    Dodds, L., Davis, I.: Linked Data Patterns. A pattern catalogue for modelling, publishing, and consuming Linked Data, http://patterns.dataincubator.org/book/
    [4]
    W3C RDF Validation Service, http://www.w3.org/RDF/Validator/
    [5]
    University of Manchester: OWL Validator, http://owl.cs.manchester.ac.uk/validator/
    [6]
    Vapour, a Linked Data Validator, http://vapour.sourceforge.net/
    [7]
    Berners-Lee, T. (2006): Design Issues: Linked Data. http://www.w3.org/DesignIssues/LinkedData.html
    [8]
    Pedantic Web Group, http://pedantic-web.org/
    [9]
    Hogan, A., Harth, A., Passant, A., Decker, S., Polleres, A.: Weaving the Pedantic Web. 3rd International Workshop on Linked Data on the Web (LDOW2010). Workshop at the 19th International World Wide Web Conference, CEUR (2010)
    [10]
    Hausenblas, M., Halb, W., Raimond, Y., Feigenbaum, L., Ayers, D.: SCOVO: Using Statistics on the Web of Data. In: Proceedings of the 6th European Semantic Web Conference: Research and Applications (Heraklion, Crete, Greece) pp. 708--722 (2009)
    [11]
    Cyganiak, R., Reynolds, D., Tennison, J.: The RDF Data Cube vocabulary (2011), http://publishing-statistical-data.googlecode.com/svn/trunk/specs/src/main/html/cube.html
    [12]
    Hartig, O.: Provenance Information in the Web of Data. In Proceedings of the Linked Data on the Web (LDOW). Workshop at the World Wide Web Conference (WWW), Madrid, Spain (2009)
    [13]
    Fürber, C., Hepp, M.: Using SPARQL and SPIN for Data Quality Management on the Semantic Web, in: BIS 2010. Proceedings of the 13th International Conference on Business Information Systems, Berlin, Germany, Springer LNBIP Vol 47, pp. 35-46 (2010)
    [14]
    Prud'hommeaux, E., Seaborne, A.: SPARQL Query Language for RDF. W3C Recommendation (2008), http://www.w3.org/TR/rdf-sparql-query/
    [15]
    Knublauch, H.: SPIN - Modeling Vocabulary (2011), http://spinrdf.org/spin.html
    [16]
    Fürber, C., Hepp, M.: Towards a Vocabulary for Data Quality Management in Semantic Web Architectures, in: Proceedings of the 1st International Workshop on Linked Web Data Management (LWDM 2011), in conjunction with the 14th International Conference on Extending Database Technology (EDBT 2011), Uppsala, Sweden (2011)
    [17]
    Venkata Narasimha Pavan Kappara, Ryutaro Ichise, O. P. Vyas: LiDDM: A Data Mining System for Linked Data. In Proceedings of the LDOW2011 (2011)
    [18]
    SPARQL client for R: http://cran.r-project.org/web/packages/SPARQL/
    [19]
    DCMI Metadata Terms, http://dublincore.org/documents/2010/10/11/dcmi-terms/
    [20]
    Isele, R., Jentzsch, A., Bizer, C.: Silk Server - Adding missing Links while consuming Linked Data. 1st International Workshop on Consuming Linked Data (COLD 2010), Shanghai, November (2010)
    [21]
    SERIMI: RDF Interlinking. https://github.com/samuraraujo/SERIMI-RDF-Interlinking

    Cited By

    View all
    • (2014)Semantic Representation and Computation of Cloud-Based Customer Relationship Management SolutionsProceedings of the Confederated International Workshops on On the Move to Meaningful Internet Systems: OTM 2014 Workshops - Volume 884210.1007/978-3-662-45550-0_35(347-357)Online publication date: 27-Oct-2014

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image Guide Proceedings
    COLD'11: Proceedings of the Second International Conference on Consuming Linked Data - Volume 782
    October 2011
    142 pages
    • Editors:
    • Olaf Hartig,
    • Andreas Harth,
    • Juan Sequeda

    Publisher

    CEUR-WS.org

    Aachen, Germany

    Publication History

    Published: 23 October 2011

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 11 Aug 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2014)Semantic Representation and Computation of Cloud-Based Customer Relationship Management SolutionsProceedings of the Confederated International Workshops on On the Move to Meaningful Internet Systems: OTM 2014 Workshops - Volume 884210.1007/978-3-662-45550-0_35(347-357)Online publication date: 27-Oct-2014

    View Options

    View options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media