Abstract
Answering a query on XML data usually involves breaking it into a number of small components (e.g., edges, paths, twigs, etc.), evaluating them and joining the results. In this paper we propose an alternative technique that uses these components to filter a large part of the database that does not qualify them, before validating the query on the actual data. Our methodology uses a signature index to search fast and prune effectively the search space. The efficiency of the proposed technique is demonstrated by comparison with an existing index, on real data.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Al-Khalifa, S., Jagadish, H.V., Koudas, N., Patel, J.M., Srivastava, D., Wu, Y.: Structural joins: A primitive for efficient xml query pattern matching. In: International Conference on Data Engineering. IEEE Computer Society Press, Los Alamitos (2002)
Bruno, N., Koudas, N., Srivastava, D.: Holistic twig joins: Optimal xml pattern matching. In: SIGMOD Conference. ACM Press, New York (2002)
Chung, C.-W., Min, J.-K., Shim, K.: Apex: An adaptive path index for xml data. In: SIGMOD Conference. ACM Press, New York (2002)
Goldman, R., Widom, J.: Dataguides: Enabling query formulation and optimization in semistructured databases. In: VLDB Conference, pp. 436–445. Morgan Kaufmann, San Francisco (1997)
Guttman, A.: R-trees: A dynamic index structure for spatial searching. In: Yormark, B. (ed.) SIGMOD Conference, pp. 47–57. ACM Press, New York (1984)
Kaushik, R., Bohannon, P., Naughton, J.F., Korth, H.F.: Covering indexes for branching path queries. In: SIGMOD Conference. ACM Press, New York (2002)
Kaushik, R., Shenoy, P., Bohannon, P., Gudes, E.: Exploiting local similarity for efficient indexing of paths in graph structured data. In: International Conference on Data Engineering. IEEE Computer Society Press, Los Alamitos (2002)
Ley, M.: Dblp computer science bibliography database, http://www.informatik.unitrier.de/ley/db/
Li, Q., Moon, B.: Indexing and querying xml data for regular path expressions. In: VLDB Conference, pp. 361–370. Morgan Kaufmann, San Francisco (2001)
Milo, T., Suciu, D.: Index structures for path expressions. In: Beeri, C., Bruneman, P. (eds.) ICDT 1999. LNCS, vol. 1540, pp. 277–295. Springer, Heidelberg (1998)
Shanmugasundaram, J., Tufte, K., Zhang, C., He, G., DeWitt, D.J., Naughton, J.F.: Relational databases for querying xml documents: Limitations and opportunities. In: VLDB Conference, pp. 302–314 (1999)
W3C. Xml path language (xpath), http://www.w3.org/TR/xpath
W3C. Xml query language (xquery), http://www.w3.org/TR/xquery
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lian, W., Mamoulis, N., Cheung, D.W. (2003). A Filter Index for Complex Queries on Semi-structured Data. In: Dong, G., Tang, C., Wang, W. (eds) Advances in Web-Age Information Management. WAIM 2003. Lecture Notes in Computer Science, vol 2762. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-45160-0_40
Download citation
DOI: https://doi.org/10.1007/978-3-540-45160-0_40
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40715-7
Online ISBN: 978-3-540-45160-0
eBook Packages: Springer Book Archive