Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2351476.2351482acmotherconferencesArticle/Chapter ViewAbstractPublication PagesideasConference Proceedingsconference-collections
research-article

Schematron schema inference

Published: 08 August 2012 Publication History

Abstract

In this paper we introduce a method to infer a Schematron schema from a set of XML documents. We analyze different aspect of Schematron schema generation. Since the automatic inferring of XML documents is not a new problem, we will introduce only a single method that we will use in our experimental implementation. In the implementation we generate a grammar using the introduced inferring method and we allow the user to modify the grammar. The grammar is then transformed into Schematron schema by the use of our algorithm. Experimental results are a part of the paper.

References

[1]
H. Ahonen. Generating grammars for structured documents using grammatical inference methods. PhD thesis, Department of Computer Science, University of Helsinki, Series of Publications A, Report A-1996-4, 1996.
[2]
G. J. Bex, W. Gelade, F. Neven, and S. Vansummeren. Learning Deterministic Regular Expressions for the Inference of Schemas from XML Data. WWW "08, pages 825--834, New York, NY, USA, 2008. ACM.
[3]
G. J. Bex, F. Neven, T. Schwentick, and S. Vansummeren. Inference of Concise Regular Expressions and DTDs. ACM Trans. Database Syst., 35:11:1--11:47, May 2010.
[4]
S. Boag, D. Chamberlin, M. Fernández, D. Florescu, J. Robie, and J. Siméon. XQuery 1.0: An XML Query Language. W3C Working Draft, 2005. http://www.w3.org/TR/xquery/.
[5]
J. Hidders and J. Paredaens. XPath/XQuery. In L. Liu and M. T. Özsu, editors, Encyclopedia of Database Systems, pages 3659--3665. Springer US, 2009.
[6]
R. Jeliffe. Converting Models to Schematron, 2006. http://www.oreillynet.com/xml/blog/2006/11/converting_content_models_to_s.html.
[7]
M. Kay. XSL transformations (XSLT) version 2.0. W3C recommendation, W3C, January 2007. http://www.w3.org/TR/xslt20/.
[8]
M. Kozák. Schematron Schema Inference. Charles University in Prague, Czech Republic, 2012. http://www.ksi.mff.cuni.cz/~mlynkova/dp/Kozak.pdf.
[9]
I. Mlynkova, K. Toman, and J. Pokorny. Statistical Analysis of Real XML Data Collections. In COMAD'06, pages 20--31, New Delhi, India, 2006. Tata McGraw-Hill Publishing.
[10]
M. Murata, D. Lee, M. Mani, and K. Kawaguchi. Taxonomy of XML Schema Languages using Formal Language Theory. ACM Trans. Internet Technol., 5(4):660--704, Nov. 2005.
[11]
A. Raman, J. Patrick, and P. North. The sk-strings Method for Inferring PFSA. In ICML'97, Nashville, Tennessee, 1997.
[12]
H. S. Thompson, D. Beech, M. Maloney, and N. Mendelsohn. XML Schema Part 1: Structures Second Edition. World Wide Web Consortium, Recommendation REC-xmlschema-1-20041028, October 2004.
[13]
E. van der Vlist. Schematron. O'Reilly, 3 2007.
[14]
O. Vošta, I. Mlýnková, and J. Pokorný. Even an Ant Can Create an XSD. In DASFAA'08, pages 35--50, Berlin, Heidelberg, 2008. Springer-Verlag.
[15]
M. švirec. Efficient Detection of XML Integrity Constraints. Charles University in Prague, Czech Republic, 2011. http://www.ksi.mff.cuni.cz/~mlynkova/dp/Svirec.pdf.
[16]
R. K. Wong and J. Sankey. On Structural Inference for XML Data. Technical report, University of New South Wales, School of Computer Science and Engineering, UNSW-CSE-0312, 2003.
[17]
F. Yergeau, T. Bray, J. Paoli, C. M. Sperberg-McQueen, and E. Maler. Extensible Markup Language (XML) 1.0. W3C Recommendation, XML Core Working Group, World Wide Web Consortium, 2004. http://www.w3.org/TR/2004/REC-XML-20040204.

Cited By

View all
  • (2022)The Application of Directed Hyper-Graphs for Analysis of Models of Information SystemsMathematics10.3390/math1005075910:5(759)Online publication date: 27-Feb-2022
  • (2022)Self-Adapting Design and Maintenance of Multi-Model DatabasesProceedings of the 26th International Database Engineered Applications Symposium10.1145/3548785.3548810(9-15)Online publication date: 22-Aug-2022

Index Terms

  1. Schematron schema inference

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    IDEAS '12: Proceedings of the 16th International Database Engineering & Applications Sysmposium
    August 2012
    261 pages
    ISBN:9781450312349
    DOI:10.1145/2351476
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    • Charles University: Charles University
    • BytePress
    • Concordia University: Concordia University

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 08 August 2012

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. XML schema inference
    2. schematron

    Qualifiers

    • Research-article

    Conference

    IDEAS '12
    Sponsor:
    • Charles University
    • Concordia University

    Acceptance Rates

    Overall Acceptance Rate 74 of 210 submissions, 35%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)4
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 14 Oct 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2022)The Application of Directed Hyper-Graphs for Analysis of Models of Information SystemsMathematics10.3390/math1005075910:5(759)Online publication date: 27-Feb-2022
    • (2022)Self-Adapting Design and Maintenance of Multi-Model DatabasesProceedings of the 26th International Database Engineered Applications Symposium10.1145/3548785.3548810(9-15)Online publication date: 22-Aug-2022

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media