Abstract
Twig pattern matching plays an important role in XML query processing, holistic twig pattern matching algorithms have been proposed and are considered to be effective since they avoid producing large number of intermediate results. Meanwhile, automaton-based approaches are naturally used in filtering XML streams, because Finite State Machines(FSMs) are driven by events which conform to event-based XML parser SAX. In this paper, we proposed a hybrid approach combining FSM and holistic twig matching algorithm to find occurrences of twig pattern in XML streams. That is, we locate the lowest common ancestor(LCA) of return node(s) in twig pattern, decompose the twig pattern according to the LCA, use automaton-based approach for processing the sub twig pattern above LCA, and regular holistic twig pattern matching algorithm for the sub twig pattern below LCA. It only needs to buffer the elements between the start and end tag of LCA. Experiments show the effectiveness of this approach.
Chapter PDF
References
Babcock, B., Babu, S., Datar, M., Motwani, R., Widom, J.: Models and Issues in Data Stream Systems. In: Proceedings of the 21st ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems (PODS 2002 ), pp. 1–16 (2002)
Bruno, N., Koudas, N., Srivastava, D.: Holistic twig joins: Optimal XML pattern matching. In: Proceedings of the ACM SIGMOD International Conference on Managment of Data, pp. 310–321 (2002)
Qin, L., Yu, J.X., Ding, B.: TwigList: Make Twig Pattern Matching Fast. In: Kotagiri, R., Radha Krishna, P., Mohania, M., Nantajeewarawat, E. (eds.) DASFAA 2007. LNCS, vol. 4443, pp. 850–862. Springer, Heidelberg (2007)
Chen, S., Li, H., Tatemura, J., et al.: Twig2Stack: Bottom-up Processing of Generalized-Tree-Pattern Queries Over XML Documents. In: 30th International Conference on Very Large Data Bases, VLDB 2006, Seoul, Korea, pp. 283–294 (2006)
Chen, Z., et al.: From Tree Patterns to Generalized Tree Patterns: On Efficient Evaluation of XQuery. In: VLDB 2003, Berlin, Germany (2003)
Diao, Y., Altinel, M., Franklin, M., Zhang, H., Fischer, P.: Path sharing and predicate evaluation for high-performance XML filtering. ACM Transactions on Database Systems 28, 467–516 (2003)
Diao, Y., Franklin, M.: Query processing for high-volume XML message brokering. In: Proceedings of the 29th International Conference on Very Large Data Bases, vol. 29, pp. 261–272. VLDB Endowment (2003)
Josifovski, V., Fontoura, M., Barta, A.: Querying XML streams. VLDB Journal (2004)
Altinel, M., Franklin, M.J.: Efficient filtering of XML documents for selective dissemination of information. In: Proceedings of VLDB Conference (2000)
Weidong, Y., Baile, S.: A Survey of XML Stream Management. Journal of Computer Research and Development (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Deng, Z., Liao, H., Gao, H. (2012). Twig Pattern Matching Running on XML Streams. In: Wang, H., et al. Web Technologies and Applications. APWeb 2012. Lecture Notes in Computer Science, vol 7234. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-29426-6_6
Download citation
DOI: https://doi.org/10.1007/978-3-642-29426-6_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-29425-9
Online ISBN: 978-3-642-29426-6
eBook Packages: Computer ScienceComputer Science (R0)