Abstract
The sliding-window model of computation is motivated by the assumption that, in certain data-stream processing applications, recent data is more useful and pertinent than older data. In such cases, we would like to answer questions about the data only over the last N most recent data elements (N is a parameter). We formalize this model of computation and answer questions about how much space and computation time is required to solve certain problems under the sliding-window model.
Material in this chapter also appears in Data Stream Management: Processing High-Speed Data Streams, edited by Minos Garofalakis, Johannes Gehrke and Rajeev Rastogi, published by Springer-Verlag.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
N. Alon, Y. Matias, and M. Szegedy. The space complexity of approximating the frequency moments. In Proc. of the 1996 Annual ACM Symp. on Theory of Computing, pages 20–29, 1996.
A. Arasu and G. Manku. Approximate counts and quantiles over sliding windows,. In Proc. of the 2004 ACM Symp. Principles of Database Systems, pages 286–296, June 2004.
B. Babcock, M. Datar, and R. Motwani. Sampling from a moving window over streaming data. In Proc. of the 2002 Annual ACM-SIAM Symp. on Discrete Algorithms, pages 633–634, 2002.
B. Babcock, M. Datar, R. Motwani, and L. O’Callaghan. Maintaining variance and k-medians over data stream windows. In Proc. of the 2003 ACM Symp. on Principles of Database Systems, pages 234–243, June 2003.
E. Cohen and M. Strauss. Maintaining time-decaying stream aggregates. In Proc. of the 2003 ACM Symp. on Principles of Database Systems, pages 223–233, June 2003.
A. Das, J. Gehrke, and M. Riedwald. Approximate join processing over data streams. In Proc. of the 2003 ACM SIGMOD Intl. Conf. on Management of Data, pages 40–51, 2003.
M. Datar. Algorithms for Data Stream Systems. PhD thesis, Stanford University, Stanford, CA, USA, December 2003.
M. Datar, A. Gionis, P. Indyk, and R. Motwani. Maintaining stream statistics over sliding windows. SIAM Journal on Computing, 31(6): 1794–1813, 2002.
M. Datar and S. Muthukrishnan. Estimating rarity and similarity over data stream windows. In Proc. of the 2002 Annual European Symp. on Algorithms, pages 323–334, September 2002.
J. Feigenbaum, S. Kannan, M. Strauss, and M. Viswanathan. An approximate 11-difference algorithm for massive data streams. In Proc. of the 1999 Annual IEEE Symp. on Foundations of Computer Science, pages 501–511, 1999.
A. Gilbert, S. Guha, P. Indyk, Y. Kotidis, S. Muthukrishnan, and M. Strauss. Fast, small-space algorithms for approximate histogram maintenance. In Proc. of the 2002 Annual ACM Symp. on Theory of Computing, 2002.
A. Gilbert, Y. Kotidis, S. Muthukrishnan, and M. Strauss. Surfing wavelets on streams: One-pass summaries for approximate aggregate queries. In Proc. of the 2001 Intl. Conf. on Very Large Data Bases, pages 79–88, 2001.
M. Greenwald and S. Khanna. Space-efficient online computation of quantile summaries. In Proc. of the 2001 ACM SIGMOD Intl. Conf. on Management of Data, pages 58–66, 2001.
S. Guha, N. Mishra, R. Motwani, and L. O’Callaghan. Clustering data streams. In Proc. of the 2000 Annual IEEE Symp. on Foundations of Computer Science, pages 359–366, November 2000.
P. Indyk. Stable distributions, pseudorandom generators, embeddings and data stream computation. In Proc. of the 2000 Annual IEEE Symp. on Foundations of Computer Science, pages 189–197, 2000.
J. Kang, J. F. Naughton, and S. Viglas. Evaluating window joins over unbounded streams. In Proc. of the 2003 Intl. Conf. on Data Engineering, March 2003.
X. Lin, H. Lu, J. Xu, and J. X. Yu. Continuously maintaining quantile summaries of the most recent n elements over a data stream. In Proc. of the 2004 Intl. Conf. on Data Engineering, March 2004.
R. Motwani and P. Raghavan. Randomized Algorithms. Cambridge University Press, 1995.
J.S. Vitter. Random sampling with a reservoir. ACM Trans. on Mathematical Software, 11(1):37–57, 1985.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2007 Springer Science+Business Media, LLC
About this chapter
Cite this chapter
Datar, M., Motwani, R. (2007). The Sliding-Window Computation Model and Results. In: Aggarwal, C.C. (eds) Data Streams. Advances in Database Systems, vol 31. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-47534-9_8
Download citation
DOI: https://doi.org/10.1007/978-0-387-47534-9_8
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-387-28759-1
Online ISBN: 978-0-387-47534-9
eBook Packages: Computer ScienceComputer Science (R0)