Caching Frequent XML Query Patterns

Zhan, Xin; Li, Jianzhong; Wang, Hongzhi; He, Zhenying

doi:10.1007/11610496_8

Xin Zhan²¹,
Jianzhong Li²¹,
Hongzhi Wang²¹ &
…
Zhenying He²¹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3842))

Included in the following conference series:

Asia-Pacific Web Conference

574 Accesses
1 Citations

Abstract

As XML becomes prevailing on the Internet, efficient management of XML queries becomes more important. Caching frequent queries can expedite XML query processing. In this paper, we propose a framework to address an NP-hard optimization problem, caching frequent query patterns. We develop several algorithms to respectively generate query subpatterns, check query containment, and choose query subpatterns for caching. Experimental results show that our algorithms are efficient and scalable.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

W3C. XQuery 1.0: An XML Query Language (April 2005)
Google Scholar
Chen, L., Rundensteiner, E.A.: A Fine-Grained Replacement Strategy for XML Query Cache. In: WIDW, MeLean, Virginia (2002)
Google Scholar
Chen, L., Rundensteiner, E.A.: ACE-XQ: A CachE-aware XQuery Answering System. In: Proc. of WebDB, Madison, WI, pp. 31–36 (2002)
Google Scholar
Dar, S., Franklin, M.J., Jonsson, B.: Semantic Data Caching and Replacement. In: VLDB, Bombay, India, pp. 330–341 (1996)
Google Scholar
Haas, L.M., Kossmann, D., Ursu, I.: Loading a Cache with Query Results. In: Proceedings of the 25^th VLDB Conference, Edinburgh, Scotland (1999)
Google Scholar
Chidlovskii, B., Broghoff, U.M.: Semantic Caching of Web Queries. VLDB Journal 9(1), 2–12 (2000)
Article Google Scholar
Wu, Y., Patel, J.M., Jagadish, H.V.: Estimating Answer Sizes for XML Queries. In: Jensen, C.S., Jeffery, K., Pokorný, J., Šaltenis, S., Bertino, E., Böhm, K., Jarke, M. (eds.) EDBT 2002. LNCS, vol. 2287, p. 590. Springer, Heidelberg (2002)
Chapter Google Scholar
Miklau, G., Suciu, D.: Containment and Equivalence for an XPath Fragment. In: Proc. of the 21^st ACM SIGACT-SIGMOD-SIGART Symp. on Principles of Database Systems (PODS), Madison, Wisconsin, USA, June 3-5 (2002)
Google Scholar
Flesca, S., Furfaro, F., Masciari, E.: On the minimization of XPath queries. In: VLDB, Berlin, Germany (2003)
Google Scholar
W3C. XPath 1.0: XML Path Language (November 1999), http://www.w3.org/TR/xpath
Dehaspe, L., Toivonen, H., King, R.D.: Finding Frequent Substructures in Chemical Compounds. In: Proc. of ACM SIGKDD, pp. 30–36 (1998)
Google Scholar
Kuramochi, M., Karypis, G.: Frequent Subgrapf Discovery. In: IEEE Int. Conference on Data Mining, pp. 313–320 (2001)
Google Scholar
Agrawal, R., Srikant, R.: Fast algorithms for mining association rules. In: VLDB (September 1994)
Google Scholar
Zaki, M.: Efficiently Mining Frequent Trees in a Forest. In: ACM SIGMOD (2002)
Google Scholar
Yang, L.N., Lee, M.L., Hsu, W.: Efficient Mining of XML Query Patterns for Caching. In: VLDB, Berlin, Germany (2003)
Google Scholar
Jagadish, H.V., Lakshmanan, L.V.S., Milo, T., Srivastava, D., Vista, D.: Querying network directories. In: Proceedings of the ACM SIGMOD Conference on Management of Data, Philadelphia, PA (June 1999)
Google Scholar
Beoder, A.: On the Resemblance and Containment of Documents. In: IEEE SEQUENCES 1997, pp. 21–29 (1998)
Google Scholar
Chen, Z., Korn, F., Koudas, N., Muthukrishnan, S., Ng, R.T., Srivastava, D.: Counting Twig Matches in a Tree. In: ICDE (2001)
Google Scholar
Yannakakis, M.: Algorithm for acyclic database scheme. In: Zaniolo, Delovel (eds.) VLDB, Los Altos CA. Morgan Kaufman pubs., San Francisco (1981)
Google Scholar
Amer-Yahia, S., Choo, S., Lakshmanan, L.V.S., Srivastava, D.: Minimization of tree pattern queries. In: SIGMOD (2001)
Google Scholar
Milo, T., Suciu, D.: Index structures for path expressions. In: Beeri, C., Bruneman, P. (eds.) ICDT 1999. LNCS, vol. 1540, pp. 277–295. Springer, Heidelberg (1998)
Chapter Google Scholar
Appromixation Algorithms. Springer, Heidelberg (2001)
Google Scholar
DBLP data set, Available at: http://www.informatik.uni-trier.de/ley/db/index.html

Download references

Author information

Authors and Affiliations

Department of Computer Science and Engineering, Harbin Institute of Technology, China
Xin Zhan, Jianzhong Li, Hongzhi Wang & Zhenying He

Authors

Xin Zhan
View author publications
You can also search for this author in PubMed Google Scholar
Jianzhong Li
View author publications
You can also search for this author in PubMed Google Scholar
Hongzhi Wang
View author publications
You can also search for this author in PubMed Google Scholar
Zhenying He
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Information Technology and Electrical Engineering, The University of Queensland, Brisbane, QLD, Australia
Heng Tao Shen
School of Computer Science and Technology, Heilongjiang University, P.O. Box, 150080, Harbin, China
Jinbao Li
Department of Computer Science and Engineering, Shanghai Jiatong University, 80 Dongcuan Road, 200240, Shanghai, China
Minglu Li
Department of Computer Science, College of Liberal Arts and Science, University of Iowa, 52242, Iowa City, IA, USA
Jun Ni
UNC Chapel Hill,
Wei Wang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhan, X., Li, J., Wang, H., He, Z. (2006). Caching Frequent XML Query Patterns. In: Shen, H.T., Li, J., Li, M., Ni, J., Wang, W. (eds) Advanced Web and Network Technologies, and Applications. APWeb 2006. Lecture Notes in Computer Science, vol 3842. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11610496_8

Download citation

DOI: https://doi.org/10.1007/11610496_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-31158-4
Online ISBN: 978-3-540-32435-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics