Abstract
Joining data streams using various types of windows is an established method of stream processing. The limitation of window size due to memory constraint takes a heavy toll on the accuracy of the query result. Through this paper, we propose a unique windowing technique based on innovative cost functions for join query processing under memory constraints. The logical window construction is controlled through unique data structure and maintained using load shedding technique with least overhead. We applied our technique on XML streams domain and proved the effectiveness of our strategy through measuring the accuracy of the result from joining two XML streams using standard XQuery. With assumption of acceptability of an approximate solution with acceptable error bound in the face of unbounded, complex XML stream, we have tried to come up with a low overhead architecture for load shedding and tested its usefulness through a set of cost functions.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Fegaras, L., Dash, R., Wang, Y.H.: A Fully Pipelined XQuery Processor. In: XIME-P (2006)
Polyzotis, N., Garofalakis, M., Ioannidis, Y.: Approximate XML Query Answers. In: ACM SIGMOD (2004)
Ayad, A., Naughton, J., Wright, S., Srivastava, U.: Approximating Streaming Window Joins Under CPU Limitations. In: ICDE 2006 (2006)
Gedik, B., Wu, K.-L., Yu, P.S., Liu, L.: A Load Shedding Framework and Optimizations for M-way Windowed Stream Joins. IEEE, Los Alamitos (2007)
Law, Y.-N., Zaniolo, C.: Load Shedding for Window Joins on Multiple Data Streams.In: The First International Workshop on Scalable Stream Processing Systems (SSPS’07), April 16-20, Istanbul, Turkey (2007)
Srivastava, U., Widom, J.: Memory-limited execution of windowed stream joins. In: VLDB (2004)
Hong, M., Demers, A., Gehrke, J., Koch, C., Riedewald, M., White, W.: Massively Multi-Query Join Processing. In: SIGMOD 2007, June 11-14, Beijing, China (2007)
Babcock, B., Babu, S., Datar, M., Motwani, R., Widom, J.: Models and issues in data stream systems. In: Proc. of the 2002 ACM Symp.on Principles of Database Systems, June 2002, pp. 1–16 (2002)
Golab, L., Oz̈su, M.T.: Processing sliding window multijoins in continuous queries over data streams. In: VLDB (2003)
Chandramouli, B., Yang, J.: End-to-end support for joins in large-scale publish/subscribe systems. In: Proceedings of the VLDB Endowment, August 2008, vol. 1(1) (2008)
Cormode, G., Shkapenyuk, V., Srivastava, D., Xu, B.: Forward Decay: A Practical Time Decay Model for Streaming Systems. In: ICDE 2009, pp. 138–149 (2009)
Das, A., Gehrke, J., Riedwald, M.: Approximate join processing over data streams. In: Proc. 2003 ACM SIGMOD Conf., pp. 40–51 (2003)
Stylus Studio – XML Editor, XML Data Integration, XML Tools, Web Services and XQuery, http://www.stylusstudio.com/
Pal, S., Cseri, I., Seeliger, O., Rys, M., Schaller, G., Yu, W., Tomic, D., Baras, A.: XQuery Implementation in a Relational Database System. In: VLDB 2005 (2005)
Dash, R., Fegaras, L.: Synopsis based Load Shedding in XML Streams. In: DataX’09 (2009)
Ley, M.: Dblp xml records, http://www.informatik.uni-trier.de/~ley/db/
Schmidt, A., Waas, F., Kersten, M.L., Carey, M.J., Manolescu, I., Busse, R.: Xmark: A benchmark for xml data management. In: Proceedings of the International Conference on Very Large Data Bases, pp. 974–985 (2002)
Kader, R.A., Boncz, P., Manegold, S., van Keulen, M.: ROX: Run-time Optimization of XQueries. In: SIGMOD’09, June 29-July 2 (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Dash, R., Fegaras, L. (2010). A Load Shedding Framework for XML Stream Joins. In: Bringas, P.G., Hameurlain, A., Quirchmayr, G. (eds) Database and Expert Systems Applications. DEXA 2010. Lecture Notes in Computer Science, vol 6261. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15364-8_21
Download citation
DOI: https://doi.org/10.1007/978-3-642-15364-8_21
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15363-1
Online ISBN: 978-3-642-15364-8
eBook Packages: Computer ScienceComputer Science (R0)