Abstract
Join is a fundamental operator in a Data Stream Management System (DSMS). It is more efficient to share execution of multiple windowed joins than separate execution of everyone because the former saves a part of cost in common windows. Therefore, shared window join is adopted widely in multi-queries DSMS. When all tasks of queries exceed maximum system capacity, the overloaded DSMS fails to process all of its input data and keep up with the rates of data arrival. Especially in a time-critical environment, queries should be completed not just timely but within certain deadlines. In this paper, we address load shedding approach for shared window join over real-time data streams. A load shedding algorithm LS-SJRT-CW is proposed to handle queries shared window join in overloaded real-time system effectively. It would reduce load shedding overhead by adjusting sliding window size. Experiment results show that our algorithm would decrease average deadline miss ratio over some ranges of workloads.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Golab, L., Ozsu, M.T.: Issues in data stream management. SIGMOD Record 32(2), 5–14 (2003)
Babcock, B., Datar, M., Motwani, R.: Load shedding for aggregation queries over data streams. In: Proceedings of ICDE, Boston, USA, pp. 350–361 (2004)
Tatbul, N., Cetintemel, U., Zdonik, S., Cherniack, M., Stonebraker, M.: Load shedding in a data stream manager. In: Proceedings of VLDB, Berlin, Germany, pp. 309–320 (2003)
Srivastava, U., Widom, J.: Memory-limited execution of windowed stream joins. In: Proceedings of VLDB, Toronto, Canada, pp. 324–335 (2004)
Gedik, B., Wu, K.L., Yu, P., Liu, L.: Adaptive load shedding for windowed stream joins. In: Proceedings of the 14th ACM international conference on Information and knowledge management (CIKM), Bremen, Germany, pp. 171–178 (2005)
Ayad, A., Naughton, J., Wright, S., Srivastava, U.: Approximating streaming window joins under CPU limitations. In: Proceedings of ICDE, Atlanta, Georgia, p. 142 (2006)
Yan, Y., Jin, C.Q., Cao, F., Wang, H.J., Zhou, A.Y.: Load shedding for shared window joins over data streams. Journal of Computer Research and Development 41(10), 1836–1841 (2004) (in Chinese)
Hammad, M.A., Franklin, M.J., Aref, W.G., Elmagarmid, A.K.: Scheduling for shared window joins over data streams. In: Proceedings of VLDB, Berlin, Germany, pp. 297–308 (2003)
Tu, Y.C., Liu, S., Prabhakar, S., Yao, B.: Load shedding in stream databases: a control-based approach. In: Proceedings of VLDB, Seoul, Korea, pp. 787–798 (2006)
Wei, Y., Prasad, V., Son, S.H., Stankovic, J.A.: Prediction-based QoS management for real-time data streams. In: Proceedings of RTSS, Rio de Janeiro, Brazil, pp. 344-358 (2006)
Li, X., Ma, L., Li, K., Wang, K., Wang, H.A.: Adaptive load management over real-time data streams. In: Proceedings of the 4th International Conference on Fuzzy Systems and Knowledge Discovery (FSKD), Hainan, China, pp. 719–725 (2007)
Madden, S., Shah, M.A., Hellerstein, J.M., Raman, V.: Continuously adaptive continuous queries over streams. In: Proceedings of SIGMOD, Madison, USA, pp. 49–60 (2002)
Motwani, R., Widom, J., Arasu, A., et al.: Query processing, resource management, and approximation in a data stream management system. In: Proceedings of CIDR, Asilomar, USA, pp. 245–256 (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ma, L., Liang, D., Zhang, Q., Li, X., Wang, H. (2009). Load Shedding for Shared Window Join over Real-Time Data Streams. In: Li, Q., Feng, L., Pei, J., Wang, S.X., Zhou, X., Zhu, QM. (eds) Advances in Data and Web Management. APWeb WAIM 2009 2009. Lecture Notes in Computer Science, vol 5446. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-00672-2_56
Download citation
DOI: https://doi.org/10.1007/978-3-642-00672-2_56
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-00671-5
Online ISBN: 978-3-642-00672-2
eBook Packages: Computer ScienceComputer Science (R0)