Skip to main content

Approximate Histogram and Wavelet Summaries of Streaming Data

  • Chapter
  • First Online:
Data Stream Management

Abstract

We study a synopsis abstract data structure similar to an array abstract data type commonly seen in textbooks. Specifically, we model an array \(\mathbf{A}\) of \(N\) real values, \(\{i: 0\le i < N\}=[0,N)\). Like to an array, a user can pose point and range queries; in addition, the data is dynamic (i.e., continuously updated) and can be multidimensional. We study approximate versions of this abstract data structure that provide only an approximate representation of \(\mathbf{A}\), using much less space than the full data set. Our focus is on two related types of sparse summaries, histograms and Haar wavelets.

S. Muthukrishnan supported by NSF ITR 0220280. M. Strauss supported in part by NSF DMS-0354600 and NSF DMS-0510203.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

eBook
USD 12.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. F. d’Amore, P.G. Franciosa, On the optimal binary plane partition for sets of isothetic rectangles. Inf. Process. Lett. 44(5), 255–259 (1992)

    Article  MathSciNet  MATH  Google Scholar 

  2. M. Blum, R. Floyd, V. Pratt, R. Rivest, R. Tarjan, Time bounds for selection. J. Comput. Syst. Sci. 7, 448–461 (1972)

    Article  MathSciNet  MATH  Google Scholar 

  3. G. Cormode, S. Muthukrishnan, An improved data stream summary: the count-min sketch and its applications, in LATIN (2004), pp. 29–38

    Google Scholar 

  4. H. Jagadish, N. Koudas, S. Muthukrishnan, V. Poosala, K. Sevcik, T. Suel, Optimal histograms with quality guarantees, in Proc. of the 1998 Intl. Conf. on Very Large Data Bases (VLDB) (1998), pp. 275–286

    Google Scholar 

  5. A. Gilbert, Y. Kotidis, S. Muthukrishnan, M. Strauss, Surfing wavelets on streams: one-pass summaries for approximate aggregate queries, in Proc. of the 2001 Intl. Conf. on Very Large Data Bases (VLDB) (2001), pp. 79–88

    Google Scholar 

  6. A. Gilbert, S. Guha, P. Indyk, Y. Kotidis, S. Muthukrishnan, M. Strauss, Fast, small-space algorithms for approximate histogram maintenance, in Proc. STOC (2002), pp. 389–398

    Google Scholar 

  7. S. Guha, P. Indyk, S. Muthukrishnan, M. Strauss, Histogramming data streams with fast per-item processing, in Proc. ICALP (2002), pp. 681–692

    Google Scholar 

  8. S. Guha, N. Koudas, K. Shim, Data-streams and histograms, in Proc. of the 2001 Annual ACM Symp. on Theory of Computing (STOC) (2001), pp. 471–475

    Google Scholar 

  9. S. Guha, N. Koudas, Approximating a data stream for querying and estimation: algorithms and performance evaluation, in Proc. of the 2002 Intl. Conf. on Data Engineering (ICDE) (2002), pp. 567–576

    Google Scholar 

  10. S. Muthukrishnan, V. Poosala, T. Suel, On rectangular partitionings in two dimensions: algorithms, complexity, and applications, in Proc. ICDT (1999), pp. 236–256

    Google Scholar 

  11. S. Muthukrishnan, M. Strauss, in Rangesum Histograms. Proc. ACM-SIAM SODA (2003), pp. 233–242

    Google Scholar 

  12. S. Muthukrishnan, M. Strauss, Maintenance of multidimensional histograms, in Proc. FSTTCS (2003), pp. 352–362

    Google Scholar 

  13. G. Strang, V. Strela, Orthogonal multiwavelets with vanishing moments, in Proc. SPIE, ed. by H.H. Szu. Wavelet Applications, vol. 2242 (1994), pp. 2–9

    Google Scholar 

  14. N. Thaper, S. Guha, P. Indyk, N. Koudas, Dynamic multidimensional histograms, in Proc. ACM SIGMOD Conference (2002), pp. 428–439

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to S. Muthukrishnan .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Muthukrishnan, S., Strauss, M. (2016). Approximate Histogram and Wavelet Summaries of Streaming Data. In: Garofalakis, M., Gehrke, J., Rastogi, R. (eds) Data Stream Management. Data-Centric Systems and Applications. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-28608-0_13

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-28608-0_13

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-28607-3

  • Online ISBN: 978-3-540-28608-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics