Abstract
We propose a practical approach to the progressive processing of (FWR) XQuery queries on multiple XML streams, called Twig’n Join (or TnJ). The query is decomposed into a query plan combining several twig queries on the individual streams, followed by a multi-way join and a final twig query. The processing is itself accordingly decomposed into three pipelined stages progressively producing streams of XML fragments. Twig’n Join combines the advantages of the recently proposed TwigM algorithm and our previous work on relational result-rate based progressive joins. In addition, we introduce a novel dynamic probing technique, called Result-Oriented Probing (ROP), which determines an optimal probing sequence for the multi-way join. This significantly reduces the amount of redundant probing for results. We comparatively evaluate the performance of Twig’n Join using both synthetic and real-life data from standard XML query processing benchmarks. We show that Twig’n Join is indeed effective and efficient for processing multiple XML streams.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Hong, M., Demers, A., Gehrke, J., Koch, C., Riedewald, M., White, W.: Massively multi-query join processing in publish/subscribe systems. In: SIGMOD, pp. 761–772 (2007)
Tao, Y., Yiu, M.L., Papadias, D., Hadjieleftheriou, M., Mamoulis, N.: RPJ: Producing fast join results on streams through rate-based optimization. In: SIGMOD, pp. 371–382 (2005)
Tok, W.H., Bressan, S., Lee, M.-L.: RRPJ: Result-rate based progressive relational join. In: Kotagiri, R., Radha Krishna, P., Mohania, M., Nantajeewarawat, E. (eds.) DASFAA 2007. LNCS, vol. 4443, pp. 43–54. Springer, Heidelberg (2007)
Tok, W.H., Bressan, S., Lee, M.L.: Twig’n join: Progressive query processing of multiple Xml streams. Technical Report TRA9/07, National University of Singapore (2007)
Florescu, D., Hillery, C., Kossmann, D., Lucas, P., Riccardi, F., Westmann, T., Carey, M.J., Sundararajan, A., Agrawal, G.: The bea/xqrl streaming xquery processor. In: Aberer, K., Koubarakis, M., Kalogeraki, V. (eds.) VLDB 2003. LNCS, vol. 2944, pp. 997–1008. Springer, Heidelberg (2004)
Peng, F., Chawathe, S.S.: Xpath queries on streaming data. In: SIGMOD, pp. 431–442 (2003)
Li, X., Agrawal, G.: Efficient evaluation of xquery over streaming data. In: VLDB, pp. 265–276 (2005)
Chen, Y., Davidson, S.B., Zheng, Y.: An efficient xpath query processor for Xml streams. In: ICDE, p. 79 (2006)
Viglas, S., Naughton, J.F., Burger, J.: Maximizing the output rate of multi-way join queries over streaming information sources. In: Aberer, K., Koubarakis, M., Kalogeraki, V. (eds.) VLDB 2003. LNCS, vol. 2944, pp. 285–296. Springer, Heidelberg (2004)
Clark, J.: The expat Xml parser (2003), http://expat.sourceforge.net
Schmidt, A., Waas, F., Kersten, M.L., Carey, M.J., Manolescu, I., Busse, R.: Xmark: A benchmark for Xml data management. In: Bressan, S., Chaudhri, A.B., Li Lee, M., Yu, J.X., Lacroix, Z. (eds.) CAiSE 2002 and VLDB 2002. LNCS, vol. 2590, pp. 974–985. Springer, Heidelberg (2003)
XML Data Repository (2002), http://www.cs.washington.edu/research/xmldatasets/
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Tok, W.H., Bressan, S., Lee, ML. (2008). Twig’n Join: Progressive Query Processing of Multiple XML Streams. In: Haritsa, J.R., Kotagiri, R., Pudi, V. (eds) Database Systems for Advanced Applications. DASFAA 2008. Lecture Notes in Computer Science, vol 4947. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-78568-2_45
Download citation
DOI: https://doi.org/10.1007/978-3-540-78568-2_45
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-78567-5
Online ISBN: 978-3-540-78568-2
eBook Packages: Computer ScienceComputer Science (R0)