Abstract
We study a synopsis abstract data structure similar to an array abstract data type commonly seen in textbooks. Specifically, we model an array \(\mathbf{A}\) of \(N\) real values, \(\{i: 0\le i < N\}=[0,N)\). Like to an array, a user can pose point and range queries; in addition, the data is dynamic (i.e., continuously updated) and can be multidimensional. We study approximate versions of this abstract data structure that provide only an approximate representation of \(\mathbf{A}\), using much less space than the full data set. Our focus is on two related types of sparse summaries, histograms and Haar wavelets.
S. Muthukrishnan supported by NSF ITR 0220280. M. Strauss supported in part by NSF DMS-0354600 and NSF DMS-0510203.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
F. d’Amore, P.G. Franciosa, On the optimal binary plane partition for sets of isothetic rectangles. Inf. Process. Lett. 44(5), 255–259 (1992)
M. Blum, R. Floyd, V. Pratt, R. Rivest, R. Tarjan, Time bounds for selection. J. Comput. Syst. Sci. 7, 448–461 (1972)
G. Cormode, S. Muthukrishnan, An improved data stream summary: the count-min sketch and its applications, in LATIN (2004), pp. 29–38
H. Jagadish, N. Koudas, S. Muthukrishnan, V. Poosala, K. Sevcik, T. Suel, Optimal histograms with quality guarantees, in Proc. of the 1998 Intl. Conf. on Very Large Data Bases (VLDB) (1998), pp. 275–286
A. Gilbert, Y. Kotidis, S. Muthukrishnan, M. Strauss, Surfing wavelets on streams: one-pass summaries for approximate aggregate queries, in Proc. of the 2001 Intl. Conf. on Very Large Data Bases (VLDB) (2001), pp. 79–88
A. Gilbert, S. Guha, P. Indyk, Y. Kotidis, S. Muthukrishnan, M. Strauss, Fast, small-space algorithms for approximate histogram maintenance, in Proc. STOC (2002), pp. 389–398
S. Guha, P. Indyk, S. Muthukrishnan, M. Strauss, Histogramming data streams with fast per-item processing, in Proc. ICALP (2002), pp. 681–692
S. Guha, N. Koudas, K. Shim, Data-streams and histograms, in Proc. of the 2001 Annual ACM Symp. on Theory of Computing (STOC) (2001), pp. 471–475
S. Guha, N. Koudas, Approximating a data stream for querying and estimation: algorithms and performance evaluation, in Proc. of the 2002 Intl. Conf. on Data Engineering (ICDE) (2002), pp. 567–576
S. Muthukrishnan, V. Poosala, T. Suel, On rectangular partitionings in two dimensions: algorithms, complexity, and applications, in Proc. ICDT (1999), pp. 236–256
S. Muthukrishnan, M. Strauss, in Rangesum Histograms. Proc. ACM-SIAM SODA (2003), pp. 233–242
S. Muthukrishnan, M. Strauss, Maintenance of multidimensional histograms, in Proc. FSTTCS (2003), pp. 352–362
G. Strang, V. Strela, Orthogonal multiwavelets with vanishing moments, in Proc. SPIE, ed. by H.H. Szu. Wavelet Applications, vol. 2242 (1994), pp. 2–9
N. Thaper, S. Guha, P. Indyk, N. Koudas, Dynamic multidimensional histograms, in Proc. ACM SIGMOD Conference (2002), pp. 428–439
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Muthukrishnan, S., Strauss, M. (2016). Approximate Histogram and Wavelet Summaries of Streaming Data. In: Garofalakis, M., Gehrke, J., Rastogi, R. (eds) Data Stream Management. Data-Centric Systems and Applications. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-28608-0_13
Download citation
DOI: https://doi.org/10.1007/978-3-540-28608-0_13
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28607-3
Online ISBN: 978-3-540-28608-0
eBook Packages: Computer ScienceComputer Science (R0)