skip to main content
10.1145/1007568.1007579acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
Article

Tree logical classes for efficient evaluation of XQuery

Published: 13 June 2004 Publication History

Abstract

XML is widely praised for its flexibility in allowing repeated and missing sub-elements. However, this flexibility makes it challenging to develop a bulk algebra, which typically manipulates sets of objects with identical structure. A set of XML elements, say of type book, may have members that vary greatly in structure, e.g. in the number of author sub-elements. This kind of heterogeneity may permeate the entire document in a recursive fashion: e.g., different authors of the same or different book may in turn greatly vary in structure. Even when the document conforms to a schema, the flexible nature of schemas for XML still allows such significant variations in structure among elements in a collection. Bulk processing of such heterogeneous sets is problematic.In this paper, we introduce the notion of logical classes (LC) of pattern tree nodes, and generalize the notion of pattern tree matching to handle node logical classes. This abstraction pays off significantly in allowing us to reason with an inherently heterogeneous collection of elements in a uniform, homogeneous way. Based on this, we define a Tree Logical Class (TLC) algebra that is capable of handling the heterogeneity arising in XML query processing, while avoiding redundant work. We present an algorithm to obtain a TLC algebra expression from an XQuery statement (for a large fragment of XQuery). We show how to implement the TLC algebra efficiently, introducing the nest-join as an important physical operator for XML query processing. We show that evaluation plans generated using the TLC algebra not only are simpler but also perform better than those generated by competing approaches. TLC is the algebra used in the Timber [8] system developed at the University of Michigan.

References

[1]
S. Al-Khalifa, H. V. Jagadish, N. Koudas, J. M. Patel, D. Srivastava, and Y. Wu. Structural joins: A primitive for efficient XML query pattern matching. In Proc. ICDE Conf., Mar. 2002.]]
[2]
S. Boag, D. Chamberlin, M. F. Fernandez, D. Florescu, J. Robie, and J. Simeon. XQuery 1.0: An XML query languge. Working Draft. http://www.w3.org/TR/xquery.]]
[3]
N. Bruno, D. Srivastava, and N. Koudas. Holistic twig joins: Optimal XML pattern matching. In Proc. SIGMOD Conf., 2002.]]
[4]
Z. Chen, H. V. Jagadish, L. V. S. Lakshmanan, and S. Paparizos. From tree patterns to generalized tree patterns: On efficient evaluation of XQuery. In Proc. VLDB Conf., Sep. 2003.]]
[5]
C. Zhang, J. Naughton, D. Dewitt, Q. Luo, and G. Lohman. On supporting containment queries in relational database management systems. In Proc. SIGMOD Conf., 2001.]]
[6]
D. DeHaan, D. Toman, M. P. Consens, and M. T. Ozsu. A comprehensive XQuery to SQL translation using dynamic interval encoding. In Proc. SIGMOD Conf., Jun. 2003.]]
[7]
D. Florescu and D. Kossman. Storing and querying XML data using an RDMBS. IEEE Data Eng. Bull., 22(3), 1999.]]
[8]
H. V. Jagadish, S. Al-Khalifa, A. Chapman, L. V. S. Lakshmanan, A. Nierman, S. Paparizos, J. M. Patel, D. Srivastava, N. Wiwatwattana, Y. Wu, and C. Yu. Timber: A native XML database. VLDB Journal, 11(4), 2002.]]
[9]
H. V. Jagadish, L. V. S. Lakshmanan, D. Srivastava, and K. Thompson. TAX: A tree algebra for XML. In Proc. DBPL Conf., Sep. 2001.]]
[10]
B. Ludascher, Y. Papakonstantinou, and P. Velikhov. Navigation-driven evaluation of virtual mediated views. In Proc. EDBT Conf., Mar. 2000.]]
[11]
U. of Michigan. The Timber project. http://www.eecs.umich.edu/db/timber.]]
[12]
U. of Wisconsin. The Niagara internet query system. http://www.cs.wisc.edu/niagara/.]]
[13]
A. R. Schmidt, F. Waas, M. L. Kersten, M. J. Carey, I. Manolescu, and R. Busse. XMark: A benchmark for XML data management. In Proc. VLDB Conf., 2002.]]
[14]
H. Schoning. Tamino - A DBMS designed for XML. In Proc. ICDE Conf., 2001.]]
[15]
J. Shanmugasundaram, K. Tufte, C. Zhang, G. He, D. J. DeWitt, and J. Naughton. Relational databases for querying XML documents: Limitations and opportunities. In Proc. VLDB Conf., 1999.]]
[16]
J. Simeon and M. F. Fernandez. Galax, an open implementation of XQuery. http://db.bell-labs.com/galax/.]]
[17]
I. Tatarinov, S. Viglas, K. Beyer, J. Shanmugasundaram, E. Shekita, and C. Zhang. Storing and querying ordered XML using a relational database system. In Proc. SIGMOD Conf., 2002.]]
[18]
S. D. Viglas, L. Galanis, D. J. DeWitt, D. Maier, and J. F. Naughtonn. Putting XML query algebras into context. http://www.cs.wisc.edu/niagara/.]]
[19]
Y. Wu, J. M. Patel, and H. V. Jagadish. Structural join order selection for XML query optimization. In Proc. ICDE Conf., Mar. 2003.]]
[20]
X-Hive Corp. X-Hive/DB native XML storage. http://www.x-hive.com/.]]
[21]
XMark, an XML benchmark project. http://www.xml-benchmark.org/.]]
[22]
X. Zhang, B. Pielech, and E. A. Rundensteier. Honey, i shrunk the XQuery!---an XML algebra optimization approach. In Workshop on Web Information and Data Management, 2002.]]

Cited By

View all
  • (2025)Algebraic Operations on Fuzzy Spatiotemporal XML Data and Querying Fuzzy Spatiotemporal Data Using XQueryFuzzy Spatiotemporal XML Data Management10.1007/978-3-031-81033-6_5(167-223)Online publication date: 26-Feb-2025
  • (2021)Demythization of Structural XML Query Processing: Comparison of Holistic and Binary ApproachesIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2019.294615733:4(1439-1452)Online publication date: 1-Apr-2021
  • (2019)Answering Queries Using Views, Second EditionSynthesis Lectures on Data Management10.2200/S00884ED2V01Y201811DTM05414:3(1-275)Online publication date: 15-Apr-2019
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGMOD '04: Proceedings of the 2004 ACM SIGMOD international conference on Management of data
June 2004
988 pages
ISBN:1581138598
DOI:10.1145/1007568
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 13 June 2004

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Article

Conference

SIGMOD/PODS04
Sponsor:

Acceptance Rates

Overall Acceptance Rate 785 of 4,003 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)5
  • Downloads (Last 6 weeks)1
Reflects downloads up to 05 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2025)Algebraic Operations on Fuzzy Spatiotemporal XML Data and Querying Fuzzy Spatiotemporal Data Using XQueryFuzzy Spatiotemporal XML Data Management10.1007/978-3-031-81033-6_5(167-223)Online publication date: 26-Feb-2025
  • (2021)Demythization of Structural XML Query Processing: Comparison of Holistic and Binary ApproachesIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2019.294615733:4(1439-1452)Online publication date: 1-Apr-2021
  • (2019)Answering Queries Using Views, Second EditionSynthesis Lectures on Data Management10.2200/S00884ED2V01Y201811DTM05414:3(1-275)Online publication date: 15-Apr-2019
  • (2019)An Algebra for Fuzzy Spatiotemporal Data in XMLIEEE Access10.1109/ACCESS.2019.28982287(22914-22926)Online publication date: 2019
  • (2019)Querying XML documents using Prolog enginesInformation Processing and Management: an International Journal10.1016/j.ipm.2019.05.01156:5(1753-1770)Online publication date: 1-Sep-2019
  • (2018)XQuery ProcessorsEncyclopedia of Database Systems10.1007/978-1-4614-8265-9_800(4846-4852)Online publication date: 7-Dec-2018
  • (2017)XQuery ProcessorsEncyclopedia of Database Systems10.1007/978-1-4899-7993-3_800-2(1-7)Online publication date: 19-Jan-2017
  • (2016)Spatiotemporal Query Algebra Based on Native XMLHandbook of Research on Innovative Database Query Processing Techniques10.4018/978-1-4666-8767-7.ch010(275-293)Online publication date: 2016
  • (2015)PAXQuery: Efficient Parallel Processing of Complex XQueryIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2015.239111027:7(1977-1991)Online publication date: 1-Jul-2015
  • (2015)Cost-based holistic twig joinsInformation Systems10.1016/j.is.2015.03.00452:C(21-33)Online publication date: 1-Aug-2015
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media