Abstract
Writing correct and precise XPath queries needs much effort from users: the user must be familiar with the complex structure of the queried XML documents and has to compose queries, which must be syntactically and semantically correct and precise. Incorrect queries select no data and thus lead to highly inefficient processing of queries. Unprecise queries might select more data than what the user really wants and thus might lead to unnecessarily high processing and transportation costs. Therefore, we propose a schema-based approach to the satisfiability test and to the refinement of users’ XPath queries. Our schema-based approach checks whether or not an XPath query conforms to the constraints given in the schema, rewrites and refines the XPath query according to the information of the schema. If an XPath query does not conform to the constraints given in the schema, its results will be every time an empty node set, which is a hint for semantic errors in the XPath query. Our rewriting approach for XPath queries replaces wildcards with specific node tests, replaces recursive axes with non-recursive axes, eliminates reverse axes, and redundant location steps. Thus, our rewriting approach generates a query, which contains more information, and can be more easily refined by the user in comparison to the original query. Our performance analysis shows the optimization potential of avoiding the evaluation of unsatisfiable XPath queries and of processing rewritten and refined XPath queries.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Amer-Uahis, S., Cho, S., Laksmanan, L.K.S., Srivastava, D.: Minimization of tree pattern queries. In: SIGMOD 2001 (2001)
Benedikt, M., Fan, W., Geerts, F.: XPath Satisfiability in the presence of DTDs. In: PODS 2005 (2005)
Brown, A., Fuchs, M., Robie, J., Wadler, P.: MSL: A model for W3C XML Schema. In: Proceedings International WWW Conference, Hong-Kong (2001)
Chan, C.Y., Fan, W., Zeng, Y.: Taming XPath Queries by Minimizing Wildcard Steps. In: VLDB 2004 (2004)
Fan, W., Chan, C., Garofalakis, M.: Secure XML querying with security views. In: SIGMOD 2004 (2004)
Fan, W., Yu, J.X., Lu, H., Lu, J., Zeng, Y.: Query Translation from XPath to SQL in the Presence of Recursive DTDs. In: VLDB 2005 (2005)
Franceschet, M.: XPathMark – An XPath benchmark for XMark. Research report PP-2005-04, University of Amsterdam, the Netherlands (2005)
Groppe, S.: XML Query Reformulation for XPath, XSLT and XQuery. Sierke-Verlag, Göttingen, Germany (2005) ISBN 3-933893-24-0
Groppe, J., Groppe, S.: A Prototype of a Schema-Based XPath Satisfiability Tester. In: Bressan, S., Küng, J., Wagner, R. (eds.) DEXA 2006. LNCS, vol. 4080, pp. 93–103. Springer, Heidelberg (2006)
Groppe, J., Groppe, S.: Filtering Unsatisfiabile XPath Queries. In: ICEIS 2006 (2006)
Groppe, S., Böttcher, S., Groppe, J.: XPath Query Simplification with regard to the Elimination of Intersect and Except Operators. In: XSDM 2006 (2006)
Hidders, J.: Satisfiability of xPath expressions. In: Lausen, G., Suciu, D. (eds.) DBPL 2003. LNCS, vol. 2921, pp. 21–36. Springer, Heidelberg (2004)
Kwong, A., Gertz, M.: Schema-based optimization of XPath expressions. Techn. Report University of California (2002)
Lakshmanan, L., Ramesh, G., Wang, H., Zhao, Z.: On Testing Satisfiability of Tree Pattern Queries. In: VLDB 2004 (2004)
Martens, W., Neven, F.: Fronties of tractability for typechecking simple XML transformations. In: VLDB 2004 (2004)
Olteanu, D., Meuss, H., Furche, T., Bry, F.: XPath: Looking Forward. XML-Based Data Management (XMLDM), EDBT Workshops (2002)
Ramanan, P.: Efficient algorithms for minimizing tree pattern queries. In: SIGMOD 2002 (2002)
Schmidt, D.A.: The structure of Typed programming languages. MIT Press, Cambridge (1994)
Wadler, P.: Two semantics for XPath. Tech. Report (2000)
Wood, P.T.: Minimising Simple XPath Expressions. In: WebDB 2001 (2001)
W3C: XML Schema Part 1: Structures Second Edition. W3C Recommendation (2004), www.w3.org/TR/xmlschema-1
W3C: XPath Version 1.0, W3C Recommendation (1999), www.w3.org/TR/xpath/
W3C: XPath Version 2.0, W3C Working Draft (2003), www.w3.org/TR/xpath20/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Groppe, J., Groppe, S. (2006). Satisfiability-Test, Rewriting and Refinement of Users’ XPath Queries According to XML Schema Definitions. In: Manolopoulos, Y., Pokorný, J., Sellis, T.K. (eds) Advances in Databases and Information Systems. ADBIS 2006. Lecture Notes in Computer Science, vol 4152. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11827252_5
Download citation
DOI: https://doi.org/10.1007/11827252_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-37899-0
Online ISBN: 978-3-540-37900-3
eBook Packages: Computer ScienceComputer Science (R0)