Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2745754.2745774acmconferencesArticle/Chapter ViewAbstractPublication PagespodsConference Proceedingsconference-collections
research-article

BonXai: Combining the simplicity of DTD with the expressiveness of XML Schema

Published: 20 May 2015 Publication History

Abstract

While the migration from DTD to XML Schema was driven by a need for increased expressivity and flexibility, the latter was also significantly more complex to use and understand. Whereas DTDs are characterized by their simplicity, XML Schema Definitions (XSDs) are notoriously difficult. In this paper, we introduce the XML specification language BonXai which possesses most features of XSDs, including its expressivity, while retaining the simplicity of DTDs. In brief, the latter is achieved by sacrificing the explicit use of types in favor of simple patterns expressing contexts for elements. The goal of BonXai is by no means to replace XML Schema, but rather to provide a simpler DTD-like alternative to schema designers that do not need the explicit use of types. Therefore, BonXai can be seen as a practical front-end for XML Schema. A particular strong point of BonXai is its solid foundation rooted in a decade of theoretical work around pattern-based schemas. We present in detail the formal model for BonXai and discuss translation algorithms to and from XML Schema.

References

[1]
G. J. Bex, F. Neven, and J. Van den Bussche. DTDs versus XML Schema: A practical study. In International Workshop on the Web and Databases (WebDB), pages 79--84, 2004.
[2]
A. Brüggemann-Klein and D. Wood. One-unambiguous regular languages. Information and Computation, 142(2):182--206, 1998.
[3]
P. Caron, Y. Han, and L. Mignot. Generalized one-unambiguity. In International Conference on Developments in Language Theory (DLT), pages 129--140, 2011.
[4]
C. S. Coen, P. Marinelli, and F. Vitali. Schemapath, a minimal extension to XML Schema for conditional constraints. In International World Wide Web Conference (WWW), pages 164--174, 2004.
[5]
W. Czerwi\'nski, W. Martens, and T. Masopust. Efficient separability of regular languages by subsequences and suffixes. In International Colloquium on Automata, Languages and Programming (ICALP), pages 150--161, 2013.
[6]
DSD. Document structure description (DSD). http://www.brics.dk/DSD/, 2002.
[7]
A. Ehrenfeucht and H. P. Zeiger. Complexity measures for regular expressions. Journal of Computer and System Sciences, 12(2):134--146, 1976.
[8]
D. Fiorello, N. Gessa, P. Marinelli, and F. Vitali. DTD
[9]
2.0: Adding support for co-constraints. In Extreme Markup Languages, 2004.
[10]
S. Gao, C. Sperberg-McQueen, H. Thompson, N. Mendelsohn, D. Beech, and M. Maloney. W3C XML Schema definition language (XSD) 1.1 part 1: Structures. www.w3.org/TR/2012/REC-xmlschema11--1--20120405/, April 2012.
[11]
W. Gelade and F. Neven. Succinctness of pattern-based schema languages for XML. Journal of Computer and System Sciences, 77(3):505--519, 2011.
[12]
P. Hofman and W. Martens. Separability by short subsequences and subwords. In International Conference on Database Theory (ICDT), 2015.
[13]
G. Kasneci and T. Schwentick. The complexity of reasoning about pattern-based XML schemas. In International Symposium on Principles of Database Systems (PODS), pages 155--164, 2007.
[14]
K. Losemann, W. Martens, and M. Niewerth. Descriptional complexity of deterministic regular expressions. In International Symposium on Mathematical Foundations of Computer Science (MFCS), pages 643--654, 2012.
[15]
W. Martens, V. Mattick, M. Niewerth, S. Agarwal, N. Douib, O. Garbe, D. Günther, D. Oliana, J. Kroniger, F. Lücke, T. Melikoglu, K. Nordmann, G. Özen, T. Schlitt, L. Schmidt, J. Westhoff, and D. Wolff. Design of the BonXai schema language. Available at www.theoinf.uni-bayreuth.de/ download/bonxai-spec.pdf, Manuscript 2014.
[16]
W. Martens, F. Neven, M. Niewerth, and T. Schwentick. Developing and analyzing XSDs through bonXai. International Conference on Very Large Data Bases (VLDB), 5(12):1994--1997, 2012.
[17]
W. Martens, F. Neven, and T. Schwentick. Simple off the shelf abstractions of XML Schema. Sigmod RECORD, 36(3):15--22, 2007.
[18]
W. Martens, F. Neven, T. Schwentick, and G. J. Bex. Expressiveness and complexity of XML Schema. ACM Transactions on Database Systems, 31(3):770--813, 2006.
[19]
W. Martens and J. Niehren. On the minimization of XML Schemas and tree automata for unranked trees. Journal of Computer and System Sciences, 73(4):550--583, 2007.
[20]
A. Møller and M. Schwartzbach. An introduction to XML and web technologies. Addison-Wesley, 2006.
[21]
M. Murata, D. Lee, M. Mani, and K. Kawaguchi. Taxonomy of XML schema languages using formal language theory. ACM Transactions on Internet Technology, 5(4):660--704, 2005.
[22]
T. Place, L. van Rooijen, and M. Zeitoun. Separating regular languages by piecewise testable and unambiguous languages. In International Symposium on Mathematical Foundations of Computer Science (MFCS), pages 729--740, 2013.
[23]
RelaxNG. Relax NG specification. http://www.relaxng.org/spec-20011203.html, 2001.
[24]
Schematron. Schematron. http://www.schematron.com/, 1999.
[25]
C. Sperberg-McQueen and H. Thompson. XML Schema. http://www.w3.org/XML/Schema, 2005.

Cited By

View all
  • (2022)Validating RDF DataundefinedOnline publication date: 23-Mar-2022
  • (2020)Inferring Restricted Regular Expressions with Interleaving from Positive and Negative SamplesAdvances in Knowledge Discovery and Data Mining10.1007/978-3-030-47436-2_58(769-781)Online publication date: 11-May-2020
  • (2018)Transformation of fuzzy spatiotemporal data from XML to object-oriented databaseEarth Science Informatics10.1007/s12145-018-0342-x11:3(449-461)Online publication date: 24-Mar-2018
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
PODS '15: Proceedings of the 34th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems
May 2015
358 pages
ISBN:9781450327572
DOI:10.1145/2745754
  • General Chair:
  • Tova Milo,
  • Program Chair:
  • Diego Calvanese
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 20 May 2015

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. bonxai
  2. schema language
  3. xml

Qualifiers

  • Research-article

Funding Sources

Conference

SIGMOD/PODS'15
Sponsor:
SIGMOD/PODS'15: International Conference on Management of Data
May 31 - June 4, 2015
Victoria, Melbourne, Australia

Acceptance Rates

PODS '15 Paper Acceptance Rate 25 of 80 submissions, 31%;
Overall Acceptance Rate 642 of 2,707 submissions, 24%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)2
  • Downloads (Last 6 weeks)0
Reflects downloads up to 03 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2022)Validating RDF DataundefinedOnline publication date: 23-Mar-2022
  • (2020)Inferring Restricted Regular Expressions with Interleaving from Positive and Negative SamplesAdvances in Knowledge Discovery and Data Mining10.1007/978-3-030-47436-2_58(769-781)Online publication date: 11-May-2020
  • (2018)Transformation of fuzzy spatiotemporal data from XML to object-oriented databaseEarth Science Informatics10.1007/s12145-018-0342-x11:3(449-461)Online publication date: 24-Mar-2018
  • (2017)BonXaiACM Transactions on Database Systems10.1145/310596042:3(1-42)Online publication date: 24-Aug-2017
  • (2017)Deciding Universality of ptNFAs is PSpace-CompleteSOFSEM 2018: Theory and Practice of Computer Science10.1007/978-3-319-73117-9_29(413-427)Online publication date: 22-Dec-2017

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media