Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.5555/645505.656442guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Incremental Validation of XML Documents

Published: 08 January 2003 Publication History

Abstract

We investigate the incremental validation of XML documents with respect to DTDs and XML Schemas, under updates consisting of element tag renamings, insertions and deletions. DTDs are modeled as extended context-free grammars and XML Schemas are abstracted as "specialized DTDs", allowing to decouple element types from element tags. For DTDs, we exhibit an O ( m log n ) incremental validation algorithm using an auxiliary structure of size O ( n ), where n is the size of the document and m the number of updates. For specialized DTDs, we provide an O ( m log 2 n ) incremental algorithm, again using an auxiliary structure of size O ( n ). This is a significant improvement over brute-force re-validation from scratch.

References

[1]
A. Bruggemann-Klein, M. Murata, and D. Wood. Regular tree and regular hedge languages over non-ranked alphabets. HKUST-TCSC-2001-05, HKUST 2001. Available at http://www.cs.ust.hk/tcsc/RR/2001-05.ps.gz.
[2]
A. Bruggemann-Klein and D. Wood. One-unambiguous regular languages. Information and Computation , 142(2):182-206, 1998.
[3]
C. Beeri and T. Milo. Schemas for integration and translation of structured and semi-structured data. In Int'l. Conf. on Database Theory , pages 296- 313, 1999.
[4]
S. Cluet, C. Delobel, J. Simeon, and K. Smaga. Your mediators need data conversion! In Proc. ACM SIGMOD , 177-188, 1998.
[5]
T. Cormen and C. Leiserson and R. Rivest. Introduction to Algorithms, Mc Graw-Hill, 1992.
[6]
G. Dong and J. Su. Space-bounded foies. In Proc. ACM PODS , 139-150, 1995.
[7]
C. Ghezzi and D. Mandrioli. Augmenting parsers to support incrementality. JACM , 27(3), 1980.
[8]
H. Garcia-Molina, J. Ullman, and J. Widom. Database Systems: The Complete Book . Prentice Hall, 2001.
[9]
B. Hesse and N. Immerman. Complete problems for dynamic complexity classes. Proc. IEEE LICS , 313-322, 2002.
[10]
F. Jalili and J. Gallier. Building friendly parsers. In Proc. ACM POPL , 1982.
[11]
J. Larcheveque. Optimal incremental parsing. ACM Transactions on Programming Languages and Systems , 17(1), 1995.
[12]
W. Li. A simple and efficient incremental LL(1) parsing. In Theory and Practice of Informatics , 1995.
[13]
G. Linden. Incremental updates in structured documents, 1993. Licentiate Thesis, Report C-1993-19, Department of Computer Science, University of Helsinki.
[14]
M. Lohrey. On the parallel complexity of tree automata. In Proceedings of the 12th RTA, LNCS 2051 , 2001.
[15]
A. Murching, Y. Prasant, and Y. Srikant. Incremental recursive descent parsing. Computer Languages , 15(4), 1990.
[16]
P.B. Miltersen, S. Subramanian, J.S. Vitter, and R. Tamassia. Complexity models for incremental computation. TCS , 130(1):203-236, 1994.
[17]
F. Neven. Automata, logic and XML. In Computer Science Logic , 2-26, 2002.
[18]
L. Petrone. Reusing batch parsers as incremental parsers. In Proc. FSTTCS , 1995.
[19]
S. Patnaik and N. Immerman. Dyn-FO: A parallel, dynamic complexity class. JCSS , 55(2), 1997.
[20]
Y. Papakonstantinou and V. Vianu. DTD inference for views of XML data. In Proc. ACM PODS , 35-46, 2000.
[21]
L. Segoufin. Personal communication, 2002.
[22]
H. Vollmer. Introduction to Circuit Complexity . Springer Verlag, 1999.
[23]
W3C. The extensible markup language (XML), 1998. W3C Recomendation available at http://www.w3c.org/XML.
[24]
W3C. XML schema definition, 2001. W3C Recomendation available at http://www.w3c.org/XML/Schema.
[25]
T. Wagner and S. Graham. Efficient and flexible incremental parsing. ACM Transactions on Programming Languages and Systems , 20(2), 1998.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Guide Proceedings
ICDT '03: Proceedings of the 9th International Conference on Database Theory
January 2003
454 pages

Publisher

Springer-Verlag

Berlin, Heidelberg

Publication History

Published: 08 January 2003

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 07 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2021)Database Principles and Challenges in Text AnalysisACM SIGMOD Record10.1145/3484622.348462450:2(6-17)Online publication date: 31-Aug-2021
  • (2009)Running tree automata on probabilistic XMLProceedings of the twenty-eighth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems10.1145/1559795.1559831(227-236)Online publication date: 29-Jun-2009
  • (2008)XFlatInformation Sciences: an International Journal10.1016/j.ins.2007.09.017178:3(774-787)Online publication date: 1-Feb-2008
  • (2007)Incremental validation of string-based XML data in databases, file systems, and streamsProceedings of the 11th East European conference on Advances in databases and information systems10.5555/1780119.1780152(314-329)Online publication date: 29-Sep-2007
  • (2007)Efficient Revalidation of XML DocumentsIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2007.100419:4(554-567)Online publication date: 1-Apr-2007
  • (2007)Weaving temporal and reliability aspects into a schema tapestryData & Knowledge Engineering10.1016/j.datak.2007.04.00663:3(752-773)Online publication date: 1-Dec-2007
  • (2006)On Simplification of Database Integrity ConstraintsFundamenta Informaticae10.5555/2369336.236933871:4(371-417)Online publication date: 1-Dec-2006
  • (2006)On Simplification of Database Integrity ConstraintsFundamenta Informaticae10.5555/1227517.122751971:4(371-417)Online publication date: 1-Mar-2006
  • (2006)Schema-mediated exchange of temporal XML dataProceedings of the 25th international conference on Conceptual Modeling10.1007/11901181_17(212-227)Online publication date: 6-Nov-2006
  • (2006)Efficient incremental validation of XML documents after composite updatesProceedings of the 4th international conference on Database and XML Technologies10.1007/11841920_8(107-121)Online publication date: 10-Sep-2006
  • Show More Cited By

View Options

View options

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media