skip to main content
10.1145/1031453.1031462acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
Article

Ctree: a compact tree for indexing XML data

Published: 12 November 2004 Publication History

Abstract

In this paper, we propose a novel compact tree (Ctree) for XML indexing, which provides not only concise path summaries at the group level but also detailed child-parent links at the element level. Group level mapping allows efficient pruning of a large search space while element level mapping provides fast access to the parent of an element. Due to the tree nature of XML data and queries, such fast child-to-parent access is essential for efficient XML query processing. Using group-based element reference, Ctree enables the clustering of inverted lists according to groups, which provides efficient join between inverted lists and structural index group extents. Our experiments reveal that Ctree is efficient for processing both single-path and branching queries with various value predicates.

References

[1]
S. Abiteboul, P. Buneman, and D. Suciu. Data on the web: from relations to semistructured data and XML. Morgan Kaufmann Publishers, Los Altos, USA, 1999.]]
[2]
A. Arion, A. Bonifati, G. Costa and S. D Aguanno. Ioana Manolescu, Andrea Pugliese: Efficient Query Evaluation over Compressed XML Data. EDBT, 2004.]]
[3]
Q. Chen, A. Lim, and K. Ong. D(k)-Index: An adaptive Structural summary for graph-structured data. ACM SIGMOD, 2003. S.-Y. Chien, Z. Vagena, D. Zhang, V. J. Tsotras, and C. Zaniolo. Efficient Structural Joins on Indexed XML Documents, VLDB, 2002.]]
[4]
C. Chung, J. Min, and K.Shim. APEX: An adaptive path index for XML data. ACM SIGMOD, 2002.]]
[5]
B. Cooper, N. Sample, M. J. Franklin, G. R. Hjaltason, and M. Shadmon. A fast index for semistructured data. VLDB, 2001.]]
[6]
R. Goldman and J. Widom. Dataguides: Enabling query formulation and optimization in semistructured databases. VLDB, 1997.]]
[7]
R. Kaushik, P.Bohannon, J. Naughton, and H. Korth. Covering indexes for branching path queries. ACM SIGMOD, 2002.]]
[8]
R. Kaushik, P. Shenoy, P. Bohannon, and E. Gudes. Exploiting Local Similarity for Indexing Paths in Graph-Structured Data. ICDE, 2002.]]
[9]
R. Kaushik, P. Bohannon, J. F Naughton, and P. Shenoy. Updates for Structure Indexes. VLDB, 2002.]]
[10]
R. Kaushik, R. Krishnamurthy, J. F. Naughton and R. Ramakrishnan. On the Integration of Structure Indexes and Inverted Lists. SIGMOD, 2004.]]
[11]
Michael Ley. DBLP database web site. http://www.informatik.uni-trier.de/ley/db.]]
[12]
Q. Li and B.Moon. Indexing and querying XML data for regular path expressions. VLDB, 2001.]]
[13]
H. Jiang, H. Lu, W. Wang, and B.C. Ooi. XR-Tree: Indexing XML Data for Efficient Structural Joins. ICDE, 2003.]]
[14]
T. Milo, and D. Suciu. Index structures for path expression. ICDT, 1999.]]
[15]
S. Nestorov, J. Ullman, J. Wiener, and S. Chawathe. Representative objects: concise representations of semistructured, hierarchical data. ICDE, 1997.]]
[16]
P. Rao, and B. Moon. PRIX: Indexing and querying XML using Prunfer sequences, ICDE, 2004.]]
[17]
D. Srivastava, S. Al-Khalifa, H. V. Jagadish, N. Koudas, J. M. Patel, and Y. Wu. Structural joins: A primitive for efficient XML query pattern matching. ICDE, 2002.]]
[18]
I. Tatarinov, Z.G. Ives, A.Y. Halevy, D.S. Weld. Updating XML. SIGMOD, 2001.]]
[19]
H. Wang, S. Park, W. Fan, and P. S Yu. ViST: A dynamic index method for querying XML data by tree structures. SIGMOD, 2003.]]
[20]
F. Weigel, H. Meuss, F. Bry and K. U. Schulz. Content-Aware DataGuides: Interleaving IR and DB Indexing Techniques for Efficient Retrieval of Textual XML Data. ECIR, 2004.]]
[21]
M. Yoshikawa, T. Amagasa, T. Shimura, and S. Uemura. XRel: A path-based approach to storage and retrieval of XML documents using relational databases. ACM Transaction on Internet Technology, 1(1):110--141, August 2001.]]
[22]
C. Zhang, J. Naughton, D. DeWitt, Q. Luo, and G. Lohman. On supporting containment queries in relational database management systems. ACM SIGMOD, 2001.]]
[23]
S. Liu, Q. Zou, W. Chu. Configurable Indexing and Ranking for XML Information Retrieval. ACM SIGIR, 2004.]]
[24]
XMARK(The XML-benchmark project) http://monetdb.cwi.nl/xml.]]
[25]
INEX(Initiative for the Evaluation of XML Retrieval) <http://inex.is.informatik.uni-duisburg.de:2003/>.]]

Cited By

View all
  • (2025)InforTest: Informer-Based Testing for Applications in the Internet of Robotic ThingsIEEE Transactions on Industrial Informatics10.1109/TII.2024.348570721:2(1499-1507)Online publication date: Feb-2025
  • (2022)An Efficient Prefix-Based Labeling Scheme for XML Dynamic Updates Using Hexagonal PatternIEEE Access10.1109/ACCESS.2022.317843810(57107-57123)Online publication date: 2022
  • (2021)VerSaChIProceedings of the 30th ACM International Conference on Information & Knowledge Management10.1145/3459637.3482217(2812-2816)Online publication date: 26-Oct-2021
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
WIDM '04: Proceedings of the 6th annual ACM international workshop on Web information and data management
November 2004
168 pages
ISBN:1581139780
DOI:10.1145/1031453
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 12 November 2004

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Ctree
  2. XML index
  3. XQuery evaluation
  4. path summary
  5. value index

Qualifiers

  • Article

Conference

CIKM04
Sponsor:
CIKM04: Conference on Information and Knowledge Management
November 12 - 13, 2004
Washington DC, USA

Upcoming Conference

CIKM '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)5
  • Downloads (Last 6 weeks)0
Reflects downloads up to 15 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2025)InforTest: Informer-Based Testing for Applications in the Internet of Robotic ThingsIEEE Transactions on Industrial Informatics10.1109/TII.2024.348570721:2(1499-1507)Online publication date: Feb-2025
  • (2022)An Efficient Prefix-Based Labeling Scheme for XML Dynamic Updates Using Hexagonal PatternIEEE Access10.1109/ACCESS.2022.317843810(57107-57123)Online publication date: 2022
  • (2021)VerSaChIProceedings of the 30th ACM International Conference on Information & Knowledge Management10.1145/3459637.3482217(2812-2816)Online publication date: 26-Oct-2021
  • (2020)ChiSeLProceedings of the VLDB Endowment10.14778/3401960.340196413:10(1654-1668)Online publication date: 1-Jun-2020
  • (2020)Exploring XML Index Structures and Evaluating C-Tree Index-based Algorithm2020 3rd International Conference on Intelligent Sustainable Systems (ICISS)10.1109/ICISS49785.2020.9316052(212-218)Online publication date: 3-Dec-2020
  • (2018)Automata Approach to XML Data IndexingInformation10.3390/info90100129:1(12)Online publication date: 6-Jan-2018
  • (2017)Neighbor-Aware Search for Approximate Labeled Graph Matching using the Chi-Square StatisticsProceedings of the 26th International Conference on World Wide Web10.1145/3038912.3052561(1281-1290)Online publication date: 3-Apr-2017
  • (2017)Performance evaluation of various data structures in building efficient indexing schemes for XML documents2017 Tenth International Conference on Contemporary Computing (IC3)10.1109/IC3.2017.8284351(1-3)Online publication date: Aug-2017
  • (2014)Semantic-based Structural and Content indexing for the efficient retrieval of queries over large XML data repositoriesFuture Generation Computer Systems10.1016/j.future.2014.02.01037(212-231)Online publication date: Jul-2014
  • (2013)Semantic-based construction of content and structure XML indexProceedings of the Twenty-Fourth Australasian Database Conference - Volume 13710.5555/2525416.2525423(61-70)Online publication date: 29-Jan-2013
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media