Skip to main content

A Filter Index for Complex Queries on Semi-structured Data

  • Conference paper
Advances in Web-Age Information Management (WAIM 2003)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2762))

Included in the following conference series:

  • 425 Accesses

Abstract

Answering a query on XML data usually involves breaking it into a number of small components (e.g., edges, paths, twigs, etc.), evaluating them and joining the results. In this paper we propose an alternative technique that uses these components to filter a large part of the database that does not qualify them, before validating the query on the actual data. Our methodology uses a signature index to search fast and prune effectively the search space. The efficiency of the proposed technique is demonstrated by comparison with an existing index, on real data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Al-Khalifa, S., Jagadish, H.V., Koudas, N., Patel, J.M., Srivastava, D., Wu, Y.: Structural joins: A primitive for efficient xml query pattern matching. In: International Conference on Data Engineering. IEEE Computer Society Press, Los Alamitos (2002)

    Google Scholar 

  2. Bruno, N., Koudas, N., Srivastava, D.: Holistic twig joins: Optimal xml pattern matching. In: SIGMOD Conference. ACM Press, New York (2002)

    Google Scholar 

  3. Chung, C.-W., Min, J.-K., Shim, K.: Apex: An adaptive path index for xml data. In: SIGMOD Conference. ACM Press, New York (2002)

    Google Scholar 

  4. Goldman, R., Widom, J.: Dataguides: Enabling query formulation and optimization in semistructured databases. In: VLDB Conference, pp. 436–445. Morgan Kaufmann, San Francisco (1997)

    Google Scholar 

  5. Guttman, A.: R-trees: A dynamic index structure for spatial searching. In: Yormark, B. (ed.) SIGMOD Conference, pp. 47–57. ACM Press, New York (1984)

    Google Scholar 

  6. Kaushik, R., Bohannon, P., Naughton, J.F., Korth, H.F.: Covering indexes for branching path queries. In: SIGMOD Conference. ACM Press, New York (2002)

    Google Scholar 

  7. Kaushik, R., Shenoy, P., Bohannon, P., Gudes, E.: Exploiting local similarity for efficient indexing of paths in graph structured data. In: International Conference on Data Engineering. IEEE Computer Society Press, Los Alamitos (2002)

    Google Scholar 

  8. Ley, M.: Dblp computer science bibliography database, http://www.informatik.unitrier.de/ley/db/

  9. Li, Q., Moon, B.: Indexing and querying xml data for regular path expressions. In: VLDB Conference, pp. 361–370. Morgan Kaufmann, San Francisco (2001)

    Google Scholar 

  10. Milo, T., Suciu, D.: Index structures for path expressions. In: Beeri, C., Bruneman, P. (eds.) ICDT 1999. LNCS, vol. 1540, pp. 277–295. Springer, Heidelberg (1998)

    Chapter  Google Scholar 

  11. Shanmugasundaram, J., Tufte, K., Zhang, C., He, G., DeWitt, D.J., Naughton, J.F.: Relational databases for querying xml documents: Limitations and opportunities. In: VLDB Conference, pp. 302–314 (1999)

    Google Scholar 

  12. W3C. Xml path language (xpath), http://www.w3.org/TR/xpath

  13. W3C. Xml query language (xquery), http://www.w3.org/TR/xquery

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2003 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Lian, W., Mamoulis, N., Cheung, D.W. (2003). A Filter Index for Complex Queries on Semi-structured Data. In: Dong, G., Tang, C., Wang, W. (eds) Advances in Web-Age Information Management. WAIM 2003. Lecture Notes in Computer Science, vol 2762. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-45160-0_40

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-45160-0_40

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-40715-7

  • Online ISBN: 978-3-540-45160-0

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics