skip to main content
10.1145/584931.584936acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
Article

Honey, I shrunk the XQuery!: an XML algebra optimization approach

Published:08 November 2002Publication History

ABSTRACT

A lot of work is being done in the database community on mapping of XML data into and out of relational database systems, specifically, the query processing over such data using XQuery. We discuss our solution, the XML Algebra Tree (XAT), as part of our larger XML management system called Rainbow.Rainbow uses XQuery to describe the loading and extracting of XML data into relational systems and also for the execution of queries against pre-defined XML views of that stored data. The XML algebra tree of the query against those views is merged with the queries that define the views to form a larger tree. Because the XML formatting operators are interleaved with the computation operators, this XAT must then be optimized before being translated into one or more SQL statements that can be executed on the database. SQL translation is composed of computation pushdown and SQL generation.The computation pushdown splits the tree into the XML-specific and SQL-doable operators, which is then going to be converted into SQL statements. However, the XAT after computation pushdown may contain unreferenced columns or unused operators. Leaving these operators in the tree will create unnecessarily large SQL statements and will slow down the overall execution.Our main contributions to XML query processing, outlined in this paper, are threefold. One, we describe an algebra based on XATs for modeling XQuery expressions. Two, we propose rewriting rules to optimize XQueries by XAT operator cancel out. Lastly, we show a cutting algorithm to remove unreferenced columns and operators from the trees. We have fully implemented the techniques discussed in this paper in the Rainbow system. A preliminary experimental study compares the performance of execution before and after operator cancel out and cutting.

References

  1. T. Bray, J. Paoli, and C. M. Sperberg-McQueen. Extensaible Markup Language (XML) 1.0. http://www.w3.org/TR/REC-xml, Feburary 1998.Google ScholarGoogle Scholar
  2. M. J. Carey, J. Kiernan, J. Shanmugasundaram, E. J. Shekita, and S. N. Subramanian. XPERANTO: Middleware for Publishing Object-Relational Data as XML Documents. In The VLDB Journal, pages 646--648, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. DB2 UDB XML Extender. XML Extender Administration and Programming. http://www 4.ibm.com/software/data/db2/extenders/xmlext/library.html, December 1999.Google ScholarGoogle Scholar
  4. DSRG. Rainbow: Relational Database Auto-Tuning for Efficient XML Query Processing. http://davis.wpi.edu/dsrg/rainbow, 2002.Google ScholarGoogle Scholar
  5. M. F. Fernandez, A. Morishima, D. Suciu, and W. C. Tan. Publishing Relational Data in XML: the SilkRoute Approach. IEEE Data Engineering Bulletin, 24(2):12--19, 2001.Google ScholarGoogle Scholar
  6. H. Garcia-Molina, J. Ullman, and J. Widom. Database System Implementation. Prentice-Hall, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. I. Manolescu, D. Florescu, D. Kossmann, F. Xhumari, and D. Olteanu. Agora: Living with xml and relational. In A. E. Abbadi, M. L. Brodie, S. Chakravarthy, U. Dayal, N. Kamel, G. Schlageter, and K.-Y. Whang, editors, VLDB 2000, Proceedings of 26th International Conference on Very Large Data Bases, September 10-14, 2000, Cairo, Egypt, pages 623--626. Morgan Kaufmann, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Microsoft Corp. Microsoft sql server. http://www.microsoft.com.Google ScholarGoogle Scholar
  9. J. Naughton, D. DeWitt, D. Maier, and J. C. etc. The Niagara Internet Query System. IEEE Data Engineering Bulletin, 24(2):27--33, 2001.Google ScholarGoogle Scholar
  10. Oracle Technologies Network. Using XML in Oracle Database Applications. http://technet.oracle.com/tech/xml/htdocs/about_oracle_xml_products.htm, November 1999.Google ScholarGoogle Scholar
  11. A. Sahuguet. Kweelt: More than just "yet another framework to query xml!". In Demo Session Proceedings of SIGMOD'01, page 602, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. W3C. XML Query Use Cases. http://www.w3.org/TR/xmlquery-use-cases, 2002.Google ScholarGoogle Scholar
  13. W3C. XQuery 1.0 Formal Semantics. http://www.w3.org/TR/query-semantics, 2002.Google ScholarGoogle Scholar
  14. X. Zhang, G. Mitchell, W.-C. Lee, and E. A. Rundensteiner. Clock: Synchronizing Internal Relational Storage with External XML Documents. In RIDE-DM, pages 111--118, April 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. X. Zhang, M. Mulchandani, S. Christ, B. Murphy, and E. A. Rundensteiner. Rainbow: Mapping-Driven XQuery Processing System. In Demo Session Proceedings of SIGMOD'02, page 614, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. X. Zhang, B. Pielech, and E. A. Rundensteiner. XAT Optimization. Technical report, Worcester Polytechnic Institute, 2002. to appear.Google ScholarGoogle Scholar
  17. X. Zhang and E. A. Rundensteiner. XAT: XML Algebra for the Rainbow System. Technical Report WPI-CS-TR-02-24, Worcester Polytechnic Institute, July 2002.Google ScholarGoogle Scholar

Index Terms

  1. Honey, I shrunk the XQuery!: an XML algebra optimization approach

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          WIDM '02: Proceedings of the 4th international workshop on Web information and data management
          November 2002
          116 pages
          ISBN:1581135939
          DOI:10.1145/584931
          • Program Chairs:
          • Roger Chiang,
          • Ee-Peng Lim

          Copyright © 2002 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 8 November 2002

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • Article

          Upcoming Conference

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader