Skip to main content
Log in

Wavelet synopsis for hierarchical range queries with workloads

  • Regular Paper
  • Published:
The VLDB Journal Aims and scope Submit manuscript

Abstract

Synopses structures and approximate query answering have become increasingly important in DSS/ OLAP applications with stringent response time requirements. Range queries are an important class of problems in this domain, and have a wide variety of applications and have been studied in the context of histograms. However, wavelets have been shown to be quite useful in several scenarios and in fact their multi-resolution structure makes them especially appealing for hierarchical domains. Furthermore the fact that the Haar wavelet basis has a linear time algorithm for the computation of coefficients has made the Haar basis one of the important and widely used synopsis structures. Very recently optimal algorithms were proposed for the wavelet synopsis construction problem for equality/point queries. In this paper we investigate the problem of optimum Haar wavelet synopsis construction for range queries with workloads. We provide optimum algorithms as well as approximation heuristics and demonstrate the effectiveness of these algorithms with our extensive experimental evaluation using synthetic and real-life data sets.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

References

  1. Acharya, S., Gibbons, P., Poosala, V., Ramaswamy, S.: The aqua approximate query answering system. Proc. of SIGMOD, pp. 574–576 (1999)

  2. Amsaleg L., Bonnet P., Franklin M.J., Tomasic A. and Urhan T. (1997). Improving responsiveness for wide-area data access. IEEE Data Eng 20(3): 3–11

    Google Scholar 

  3. Garofalakis, M.N., Gibbons, P.B.: Wavelet synopses with error guarantees. In: Proceedings of SIGMOD, pp. 476–487 (2002)

  4. Garofalakis M.N. and Gibbons P.B. (2004). Probabilistic wavelet synopses. ACM TODS 29: 43–90

    Article  Google Scholar 

  5. Garofalakis, M.N., Kumar, A.: Deterministic wavelet thresholding for maximum-error metrics. In: Proceedings of PODS, pp. 166–176 (2004)

  6. Gilbert, A., Guha, S., Indyk, P., Kotidis, Y., Muthukrishnan, S., Strauss, M.: Fast, small-space algorithms for approximate histogram maintanance. In: Proceedings of STOC, pp. 389–398 (2002)

  7. Gilbert, A.C., Kotidis, Y., Muthukrishnan, S., Strauss, M.: One-pass summaries for approximate aggregate queries. In: Proceedings of the VLDB conference, pp. 79–88 (2001)

  8. Gilbert, A.C., Kotidis, Y., Muthukrishnan, S., Strauss, M.: Optimal and approximate computation of summary statistics for range aggregates. In: Proceedings of PODS, pp. 227–236 (2001)

  9. Guha, S.: Space efficiency in synopsis construction problems. In: Proceedings of the VLDB Conference, pp. 409–420 (2005)

  10. Guha, S., Indyk, P., Muthukrishnan, S., Strauss, M.: Histogramming data streams with fast per-item processing. In: Proceedings of ICALP, pp. 681–692 (2002)

  11. Guha, S., Koudas, N.: Approximating a Data Stream for Querying and Estimation: Algorithms and Performance Evaluation. In: Proceedings of ICDE, pp. 567–576 (2002)

  12. Guha, S., Koudas, N., Shim, K.: Data Streams and Histograms. In: Proceedings of STOC, pp. 471–475 (2001)

  13. Guha, S., Koudas, N., Srivastava, D.: Fast algorithms for hierarchical range histogram construction. In: Proceedings of PODS, pp. 180–187 (2002)

  14. Guha, S., Shim, K., Woo, J.: Rehist: Relative error histogram construction algorithms. In: Proceedings of VLDB, pp. 300–311 (2004)

  15. Hellerstein, J.M., Haas, P.J., Wang, H.J.: Online aggregation. In: SIGMOD Conference, pp. 171–182 (1997)

  16. Ioannidis, Y.E.: Universality of serial histograms. In: Proceedings of the VLDB Conference (1993)

  17. Jagadish, H.V., Koudas, N., Muthukrishnan, S., Poosala, V., Sevcik, K.C., Suel, T.: Optimal histograms with quality guarantees. In: Proceedings of the VLDB Conference, pp. 275–286 (1998)

  18. Jagadish, H.V., Lakshmanan, V.S., Srivastava, D.: What can hierarchies do for data warehouse? Proceedings of the VLDB Conference, pp. 530–541 (1999)

  19. Koudas, N., Muthukrishnan, S., Srivastava, D.: Optimal histograms for hierarchical range queries. In: Proceedings of PODS, pp. 196–204 (2000)

  20. Matias, Y., Urieli, D.: Optimal workload-based weighted wavelet synopses. In: Proceedings of ICDT, pp. 368–382 (2005)

  21. Matias, Y., Urieli, D.: Optimal wavelet synopses for range-sum queries. In: Proceedings of ESA, pp. 504–515 (2006)

  22. Matias, Y., Vitter, J.S., Wang, M.: Wavelet-based histograms for selectivity estimation. In: Proceedings of SIGMOD, pp. 448–459 (1998)

  23. Muralikrishna, M., DeWitt, D.J.: Equi-depth histograms for estimating selectivity factors for multidimensional queries. In: SIGMOD Conference, pp. 28–36 (1998)

  24. Muthukrishnan, S.: Nonuniform sparse approximation with haar wavelet basis. DIMACS TR , 42 (2004)

  25. Muthukrishnan, S., Strauss, M.: Rangesum histograms. In: Proceedings of SODA, pp. 233–242 (2003)

  26. Poosala, V., Ioannidis, Y., Haas, P., Shekita, E.: Improved Histograms for Selectivity Estimation of Range Predicates. SIGMOD Conference, pp. 294–305 (1996)

  27. Selinger, P.G., Astrahan, M.M., Chamberlin, D.D., Lorie, R.A., Price, T.G.: Access path selection in a relational database management system. In: SIGMOD Conference, pp. 23–34 (1979)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sudipto Guha.

Additional information

Research was supported in part by the Alfred P. Sloan Research Fellowship and NSF awards CCF-0430376, CCF-0644119.

Research was supported by the Ministry of Information and Communication, Korea, under the College Information Technology Research Center Support Program, grant number IITA-2006-C1090-0603-0031.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Guha, S., Park, H. & Shim, K. Wavelet synopsis for hierarchical range queries with workloads. The VLDB Journal 17, 1079–1099 (2008). https://doi.org/10.1007/s00778-007-0052-3

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00778-007-0052-3

Keywords

Navigation