Skip to main content

Efficiently Coding and Indexing XML Document

  • Conference paper
Database Systems for Advanced Applications (DASFAA 2005)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3453))

Included in the following conference series:

Abstract

In this paper, a novel and efficient numbering scheme is presented, which combines the label path information and data path information, and it can efficiently support all kinds of queries. A compact index structure, named HiD, is also proposed in this paper. Query algorithms based this index structure are introduced. At last, the comprehensive experiments are conducted to assess all the technologies in question.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Dietz, P.F.: Maintaining order in a linked list. In: Proceedings of the Fourteenth Annual ACM Symposium on Theory of Computing, San Francisco, California, May 5-7, pp. 122–127 (1982)

    Google Scholar 

  2. Sacks-Davis, R., Dao, T., Thom, J.A., Zobel, J.: Indexing Documents for Queries on Structure, Content and Attributes. In: Proc. of International Symposium on Digital Media Information Base (DMIB), Nara, Japan, pp. 236–245 (1997)

    Google Scholar 

  3. Clarke, C.L.A., Cormack, G.V., Burkowski, F.J.: An algebra for structured text search and a framework for its implementation. The Computer Journal 38(1), 43–56 (1995)

    Google Scholar 

  4. Kha, D.D., Yoshikawa, M., Uemura, S.: An XML Indexing Structure with Relative Region Coordinate. In: Proceedings of the 17th ICDE, Heidelberg, Germany, April 2001, pp. 313–320 (2001)

    Google Scholar 

  5. Li, Q., Moon, B.: Indexing and querying XML data for regular path expressions. In: Proceedings of the 27th VLDB, Roma, Italy (September 2001), pp. 361–370 (2001)

    Google Scholar 

  6. Zhang, C., Naughton, J.F., DeWitt, D.J., Luo, Q., Lohman, G.M.: On supporting containment queries in relational database management systems. In: Proceedings of the 27th ACM SIGMOD, Santa Barbara, California, USA (May 2001), pp. 425–436 (2001)

    Google Scholar 

  7. Wang, W., Jiang, H., Lu, H., Yu, J.X.: PbiTree Coding and Efficient Processing of Containment Join. In: Proceedings of 19th ICDE, pp. 391–402 (2003)

    Google Scholar 

  8. Al-Khalifa, et al.: Structural Joins: A Primitive for Efficient XML Query Pattern Matching. In: Proc. of ICDE, San Jose (February 2002)

    Google Scholar 

  9. Chien, S.-Y., Vagena, Z., Zhang, D., Tsotras, V.J., Zaniolo, C.: Efficient structural joins on indexed XML documents. In: Proceedings of the 28th VLDB Conference, Hong Kong, China (August 2002)

    Google Scholar 

  10. Halverson, A., Burger, J., et al.: Mixed Mode XML Query Processing. In: Proceedings of the 29th VLDB, Berlin, Germany, pp. 361–370 (2003)

    Google Scholar 

  11. Goldman, R., Widom, J.: DataGuides: Enabling Query Formulation and Optimization in Semistructured Databases. In: Proceedings of the 23rd VLDB Conference, Athens, Greece (1997)

    Google Scholar 

  12. Bremer, J.M., Gertz, M.: An Efficient XML Node Identification and Indexing Scheme. Teach report. Department of Computer Science University of California, Davis (January 27, 2003)

    Google Scholar 

  13. Wang, H., Philip, 1.P.W.F., Yu, S.: ViST: A Dynamic Index Method for Querying XML Data by Tree Structures. In: SIGMOD 2003 (2003)

    Google Scholar 

  14. Chamberlin, D., Florescu, D., Robie, J., Simon, J., Stefanescu, M.: XQuery: A query language for XML W3C working draft. Technical Report WD-xquery-20010215, World Wide Web Consortium (2001)

    Google Scholar 

  15. Chamberlin, D., Robie, J., Florescu, D.: Quilt: An XML query language for heterogeneous data sources. In: WebDB (May 2000)

    Google Scholar 

  16. Clark, J., DeRose, S.: XML path language (XPath) version 1.0 w3c recommendation. Technical Report REC-xpath-19991116, World Wide Web Consortium (1999)

    Google Scholar 

  17. Cohen, E., Kaplan, H., Milo, T.: Labeling dynamic XML trees. In: PODS, pp. 271–281 (2002)

    Google Scholar 

  18. Zhang, et al.: On Supporting Containment Queries in Relational Database Management Systems. In: SIGMOD Conference (2001)

    Google Scholar 

  19. Sleepycat Software. The Berkeley Database (Berkeley DB), http://www.sleepycat.com

  20. Ley, M.: DBLP database web site, http://www.informatik.uni-trier.de/~ley/db

  21. XMARK: The XML-benchmark project, http://monetdb.cwi.nl/xml

  22. Rao, P., Moon, B.: PRIX: Indexing And Querying XML Using Prufer Sequences. In: ICDE 2004 (March 2004)

    Google Scholar 

  23. Jiang, H., Lu, H., Wang, W., Ooi, B.C.: XR-Tree:indexing XML Data for Efficent Structural Joins. In: ICDE (2003)

    Google Scholar 

  24. Bruno, N., Koudas, N., Srivastava, D.: Holistic Twig Joins: Optimal XML Pattern Matching. In: SIGMOD 2002 (2002)

    Google Scholar 

  25. Jiang, H., Wang, W., Lu, H.: Holistic Twig Joins on Indexed XML Documents. In: VLDB 2003 (2003)

    Google Scholar 

  26. SAX (Simple API for XML), http://sax.sourceforge.net

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Han, Z., Xi, C., Le, J. (2005). Efficiently Coding and Indexing XML Document. In: Zhou, L., Ooi, B.C., Meng, X. (eds) Database Systems for Advanced Applications. DASFAA 2005. Lecture Notes in Computer Science, vol 3453. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11408079_14

Download citation

  • DOI: https://doi.org/10.1007/11408079_14

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-25334-1

  • Online ISBN: 978-3-540-32005-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics