Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/584931.584936acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
Article

Honey, I shrunk the XQuery!: an XML algebra optimization approach

Published: 08 November 2002 Publication History
  • Get Citation Alerts
  • Abstract

    A lot of work is being done in the database community on mapping of XML data into and out of relational database systems, specifically, the query processing over such data using XQuery. We discuss our solution, the XML Algebra Tree (XAT), as part of our larger XML management system called Rainbow.Rainbow uses XQuery to describe the loading and extracting of XML data into relational systems and also for the execution of queries against pre-defined XML views of that stored data. The XML algebra tree of the query against those views is merged with the queries that define the views to form a larger tree. Because the XML formatting operators are interleaved with the computation operators, this XAT must then be optimized before being translated into one or more SQL statements that can be executed on the database. SQL translation is composed of computation pushdown and SQL generation.The computation pushdown splits the tree into the XML-specific and SQL-doable operators, which is then going to be converted into SQL statements. However, the XAT after computation pushdown may contain unreferenced columns or unused operators. Leaving these operators in the tree will create unnecessarily large SQL statements and will slow down the overall execution.Our main contributions to XML query processing, outlined in this paper, are threefold. One, we describe an algebra based on XATs for modeling XQuery expressions. Two, we propose rewriting rules to optimize XQueries by XAT operator cancel out. Lastly, we show a cutting algorithm to remove unreferenced columns and operators from the trees. We have fully implemented the techniques discussed in this paper in the Rainbow system. A preliminary experimental study compares the performance of execution before and after operator cancel out and cutting.

    References

    [1]
    T. Bray, J. Paoli, and C. M. Sperberg-McQueen. Extensaible Markup Language (XML) 1.0. http://www.w3.org/TR/REC-xml, Feburary 1998.
    [2]
    M. J. Carey, J. Kiernan, J. Shanmugasundaram, E. J. Shekita, and S. N. Subramanian. XPERANTO: Middleware for Publishing Object-Relational Data as XML Documents. In The VLDB Journal, pages 646--648, 2000.
    [3]
    DB2 UDB XML Extender. XML Extender Administration and Programming. http://www 4.ibm.com/software/data/db2/extenders/xmlext/library.html, December 1999.
    [4]
    DSRG. Rainbow: Relational Database Auto-Tuning for Efficient XML Query Processing. http://davis.wpi.edu/dsrg/rainbow, 2002.
    [5]
    M. F. Fernandez, A. Morishima, D. Suciu, and W. C. Tan. Publishing Relational Data in XML: the SilkRoute Approach. IEEE Data Engineering Bulletin, 24(2):12--19, 2001.
    [6]
    H. Garcia-Molina, J. Ullman, and J. Widom. Database System Implementation. Prentice-Hall, 2000.
    [7]
    I. Manolescu, D. Florescu, D. Kossmann, F. Xhumari, and D. Olteanu. Agora: Living with xml and relational. In A. E. Abbadi, M. L. Brodie, S. Chakravarthy, U. Dayal, N. Kamel, G. Schlageter, and K.-Y. Whang, editors, VLDB 2000, Proceedings of 26th International Conference on Very Large Data Bases, September 10-14, 2000, Cairo, Egypt, pages 623--626. Morgan Kaufmann, 2000.
    [8]
    Microsoft Corp. Microsoft sql server. http://www.microsoft.com.
    [9]
    J. Naughton, D. DeWitt, D. Maier, and J. C. etc. The Niagara Internet Query System. IEEE Data Engineering Bulletin, 24(2):27--33, 2001.
    [10]
    Oracle Technologies Network. Using XML in Oracle Database Applications. http://technet.oracle.com/tech/xml/htdocs/about_oracle_xml_products.htm, November 1999.
    [11]
    A. Sahuguet. Kweelt: More than just "yet another framework to query xml!". In Demo Session Proceedings of SIGMOD'01, page 602, 2001.
    [12]
    W3C. XML Query Use Cases. http://www.w3.org/TR/xmlquery-use-cases, 2002.
    [13]
    W3C. XQuery 1.0 Formal Semantics. http://www.w3.org/TR/query-semantics, 2002.
    [14]
    X. Zhang, G. Mitchell, W.-C. Lee, and E. A. Rundensteiner. Clock: Synchronizing Internal Relational Storage with External XML Documents. In RIDE-DM, pages 111--118, April 2001.
    [15]
    X. Zhang, M. Mulchandani, S. Christ, B. Murphy, and E. A. Rundensteiner. Rainbow: Mapping-Driven XQuery Processing System. In Demo Session Proceedings of SIGMOD'02, page 614, 2002.
    [16]
    X. Zhang, B. Pielech, and E. A. Rundensteiner. XAT Optimization. Technical report, Worcester Polytechnic Institute, 2002. to appear.
    [17]
    X. Zhang and E. A. Rundensteiner. XAT: XML Algebra for the Rainbow System. Technical Report WPI-CS-TR-02-24, Worcester Polytechnic Institute, July 2002.

    Cited By

    View all
    • (2019)Querying XML documents using Prolog enginesInformation Processing and Management: an International Journal10.1016/j.ipm.2019.05.01156:5(1753-1770)Online publication date: 1-Sep-2019
    • (2018)Incremental View Maintenance for Property Graph QueriesProceedings of the 2018 International Conference on Management of Data10.1145/3183713.3183724(1843-1845)Online publication date: 27-May-2018
    • (2016)Semistructured Models, Queries and Algebras in the Big Data EraProceedings of the 2016 International Conference on Management of Data10.1145/2882903.2912573(2229-2233)Online publication date: 26-Jun-2016
    • Show More Cited By

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    WIDM '02: Proceedings of the 4th international workshop on Web information and data management
    November 2002
    116 pages
    ISBN:1581135939
    DOI:10.1145/584931
    • Program Chairs:
    • Roger Chiang,
    • Ee-Peng Lim
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 08 November 2002

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. XML
    2. XQuery
    3. algebra
    4. operator
    5. optimization
    6. relational

    Qualifiers

    • Article

    Conference

    CIKM02

    Upcoming Conference

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)6
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 10 Aug 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2019)Querying XML documents using Prolog enginesInformation Processing and Management: an International Journal10.1016/j.ipm.2019.05.01156:5(1753-1770)Online publication date: 1-Sep-2019
    • (2018)Incremental View Maintenance for Property Graph QueriesProceedings of the 2018 International Conference on Management of Data10.1145/3183713.3183724(1843-1845)Online publication date: 27-May-2018
    • (2016)Semistructured Models, Queries and Algebras in the Big Data EraProceedings of the 2016 International Conference on Management of Data10.1145/2882903.2912573(2229-2233)Online publication date: 26-Jun-2016
    • (2015)PAXQuery: Efficient Parallel Processing of Complex XQueryIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2015.239111027:7(1977-1991)Online publication date: 1-Jul-2015
    • (2014)A Survey on XML FragmentationACM SIGMOD Record10.1145/2694428.269443443:3(24-35)Online publication date: 4-Dec-2014
    • (2012)Optimising XML---RDF data integrationProceedings of the 9th international conference on The Semantic Web: research and applications10.1007/978-3-642-30284-8_67(838-843)Online publication date: 27-May-2012
    • (2011)TraCXProceedings of the 28th British national conference on Advances in databases10.5555/2075914.2075933(182-193)Online publication date: 12-Jul-2011
    • (2011)TraCX: Transformation of Compressed XMLAdvances in Databases10.1007/978-3-642-24577-0_19(182-193)Online publication date: 2011
    • (2008)Towards a physical XML independent XQuery/SQL/XML engineProceedings of the VLDB Endowment10.14778/1454159.14541771:2(1356-1367)Online publication date: 1-Aug-2008
    • (2007)Processing global XQuery queries based on static query decompositionProceedings of the 5th international conference on Parallel and Distributed Processing and Applications10.5555/2395970.2396005(326-336)Online publication date: 29-Aug-2007
    • Show More Cited By

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media