Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3184558.3186963acmotherconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
demonstration
Free access

Chisel: Sculpting Tabular and Non-Tabular Data on the Web

Published: 23 April 2018 Publication History

Abstract

Chisel is a tool for flexible manipulation of CSV-like data, motivated by the recent effort of the World Wide Web Consortium (W3C) towards a recommendation for tabular data and metadata on the Web. In brief, Chisel supports an expressive built-in schema language for CSV-like data, that can handle both tabular and non-tabular data. Furthermore, it supports a simple programming language for transforming tabular and non-tabular CSV-like data. In the demo, we showcase the system for specifying and validating schemas, building transformations, and setting up a pipeline for automatic conversion of "wild" CSV-like data into structured tabular data. We present use cases for Chisel specifically targeted at exemplifying the ease of specifying, modifying, and understanding Sculpt schemas as well as extracting and transforming data.

References

[1]
Marcelo Arenas, Francisco Maturana, Cristian Riveros, and Domagoj Vrgo. 2016. A framework for annotating CSV-like data. PVLDB 9, 11 (2016), 876--887.
[2]
Johannes Doleschal, Wim Martens, Frank Neven, and Adam Witkowski. 2018. Satisfiability for SCULPT schemas for CSV-like data. In ICDT. To appear.
[3]
Michael J. Fischer and Richard E. Ladner. 1979. Propositional Dynamic Logic of Regular Programs. J. Comput. System Sci. 18, 2 (1979), 194--211.
[4]
Wim Martens, Frank Neven, Matthias Niewerth, and Thomas Schwentick. 2017. BonXai: Combining the Simplicity of DTD with the Expressiveness of XML Schema. ACM Trans. Database Syst. 42, 3, Article 15 (2017), 42 pages.
[5]
Wim Martens, Frank Neven, and Stijn Vansummeren. 2015. SCULPT: A Schema Language for Tabular Data on the Web. In WWW. 702--720.
[6]
Rufus Pollock, Jeni Tennison, Gregg Kellogg, and Ivan Herman. 2015. Metadata Vocabulary for Tabular Data. Technical Report. World Wide Web Consortium (W3C). https://www.w3.org/TR/2015/REC-tabular-metadata-20151217/.
[7]
Jonathan Robie, Michael Dyck, and Josh Spiegel. 2017. XML Path Language (XPath) 3.1. Technical Report. World Wide Web Consortium (W3C). https://www.w3.org/TR/2017/REC-xpath-31--20170321/.
[8]
Jeremy Tandy, Davide. Ceolin, and Eric. Stephan. 2017. CSV on the Web: Use Cases and Requirements. Technical Report. World Wide Web Consortium (W3C). https://w3c.github.io/csvw/use-cases-and-requirements/.
[9]
Jeni Tennison. 2014. 2014: The Year of CSV. https://www.theodi.org/blog/ 2014-the-year-of-csv. (2014). https://www.youtube.com/watchv=a8piOmSsJ2I.
[10]
Jeni Tennison and Gregg Kellogg. 2015. Model for Tabular Data and Metadata on the Web. Technical Report. World Wide Web Consortium (W3C). https://www.w3.org/TR/2015/REC-tabular-data-model-20151217/.

Cited By

View all
  • (2023)DIWIFT: Discovering Instance-wise Influential Features for Tabular DataProceedings of the ACM Web Conference 202310.1145/3543507.3583382(1673-1682)Online publication date: 30-Apr-2023

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
WWW '18: Companion Proceedings of the The Web Conference 2018
April 2018
2023 pages
ISBN:9781450356404
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

  • IW3C2: International World Wide Web Conference Committee

In-Cooperation

Publisher

International World Wide Web Conferences Steering Committee

Republic and Canton of Geneva, Switzerland

Publication History

Published: 23 April 2018

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. CSV
  2. schema languages
  3. semi-structured data
  4. tabular data

Qualifiers

  • Demonstration

Funding Sources

  • Special Research Fund (BOF) of Hasselt University
  • Deutsche Forschungsgemeinschaft (DFG)

Conference

WWW '18
Sponsor:
  • IW3C2
WWW '18: The Web Conference 2018
April 23 - 27, 2018
Lyon, France

Acceptance Rates

Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)474
  • Downloads (Last 6 weeks)51
Reflects downloads up to 07 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2023)DIWIFT: Discovering Instance-wise Influential Features for Tabular DataProceedings of the ACM Web Conference 202310.1145/3543507.3583382(1673-1682)Online publication date: 30-Apr-2023

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media