Skip to main content

A Clustered Dwarf Structure to Speed Up Queries on Data Cubes

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4654))

Abstract

Dwarf is a highly compressed structure, which compresses the cube by eliminating the semantic redundancies while computing a data cube. Although it has high compression ratio, Dwarf is slower in querying and more difficult in updating due to its structure characteristics. So we propose two novel clustering methods for query optimization: the recursion clustering method for point queries and the hierarchical clustering method for range queries. To facilitate the implementation, we design a partition strategy and a logical clustering mechanism. Experimental results show our methods can effectively improve the query performance on data cubes, and the recursion clustering method is suitable for both point queries and range queries.

Supported by the National Natural Science Foundation of China under Grant No.60473073, 60573090, 60673139.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Sismanis, Y., Roussopoulos, N., Deligianannakis, A., Kotidis, Y.: Dwarf: Shrinking the Petacube. In: SIGMOD, pp. 564–475 (2002)

    Google Scholar 

  2. Wang, W., Feng, J., Lu, H., Yu, J.X.: Condensed Cube: An Effective Approach to Reducing Data Cube Size. In: ICDE, pp. 155–165 (2002)

    Google Scholar 

  3. Vitter, J.S., Wang, M., Iyer, B.: Data Cube Approximation and Histograms via Wavelets. In: CIKM, pp. 96–104 (1998)

    Google Scholar 

  4. Barbara, D., Sullivan, M.: A Space-Efficient Way to Support Approximate Multidimensional Databases. Technical report, ISSE-TR-98-03 (1998)

    Google Scholar 

  5. Gibbons, P.B., Matias, Y.: New Sampling-Based Summary Statistics for Improving Approximate Query Answers. In: SIGMOD, pp. 331–342 (1998)

    Google Scholar 

  6. Acharya, S., Gibbons, P.B., Poosala, V.: Congressional Samples for Approximate Answering of Group-By Queries. In: SIGMOD, pp. 487–498 (2000)

    Google Scholar 

  7. Shanmugasundaram, J., Fayyad, U., Bradley, P.S.: Compressed Data Cubes for OLAP Aggregate Query Approximation on Continuous Dimensions. In: KDD, pp. 223–232 (1999)

    Google Scholar 

  8. Beyer, K., Ramakrishnan, R.: Bottom-Up Computation of Sparse and Iceberg Cubes. In: SIGMOD, pp. 359–370 (1999)

    Google Scholar 

  9. Xin, D., Han, J., Li, X., Wah, B.W.: Star-Cubing: Computing Iceberg Cubes by Top-Down and Bottom-Up Integration. In: VLDB, pp. 476–487 (2003)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Il Yeal Song Johann Eder Tho Manh Nguyen

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Leng, F., Bao, Y., Wang, D., Yu, G. (2007). A Clustered Dwarf Structure to Speed Up Queries on Data Cubes. In: Song, I.Y., Eder, J., Nguyen, T.M. (eds) Data Warehousing and Knowledge Discovery. DaWaK 2007. Lecture Notes in Computer Science, vol 4654. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74553-2_16

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-74553-2_16

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-74552-5

  • Online ISBN: 978-3-540-74553-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics