A Clustered Dwarf Structure to Speed Up Queries on Data Cubes

Leng, Fangling; Bao, Yubin; Wang, Daling; Yu, Ge

doi:10.1007/978-3-540-74553-2_16

A Clustered Dwarf Structure to Speed Up Queries on Data Cubes

Fangling Leng¹,
Yubin Bao¹,
Daling Wang¹ &
…
Ge Yu¹

Conference paper

1207 Accesses
2 Citations

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4654))

Abstract

Dwarf is a highly compressed structure, which compresses the cube by eliminating the semantic redundancies while computing a data cube. Although it has high compression ratio, Dwarf is slower in querying and more difficult in updating due to its structure characteristics. So we propose two novel clustering methods for query optimization: the recursion clustering method for point queries and the hierarchical clustering method for range queries. To facilitate the implementation, we design a partition strategy and a logical clustering mechanism. Experimental results show our methods can effectively improve the query performance on data cubes, and the recursion clustering method is suitable for both point queries and range queries.

Supported by the National Natural Science Foundation of China under Grant No.60473073, 60573090, 60673139.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Sismanis, Y., Roussopoulos, N., Deligianannakis, A., Kotidis, Y.: Dwarf: Shrinking the Petacube. In: SIGMOD, pp. 564–475 (2002)
Google Scholar
Wang, W., Feng, J., Lu, H., Yu, J.X.: Condensed Cube: An Effective Approach to Reducing Data Cube Size. In: ICDE, pp. 155–165 (2002)
Google Scholar
Vitter, J.S., Wang, M., Iyer, B.: Data Cube Approximation and Histograms via Wavelets. In: CIKM, pp. 96–104 (1998)
Google Scholar
Barbara, D., Sullivan, M.: A Space-Efficient Way to Support Approximate Multidimensional Databases. Technical report, ISSE-TR-98-03 (1998)
Google Scholar
Gibbons, P.B., Matias, Y.: New Sampling-Based Summary Statistics for Improving Approximate Query Answers. In: SIGMOD, pp. 331–342 (1998)
Google Scholar
Acharya, S., Gibbons, P.B., Poosala, V.: Congressional Samples for Approximate Answering of Group-By Queries. In: SIGMOD, pp. 487–498 (2000)
Google Scholar
Shanmugasundaram, J., Fayyad, U., Bradley, P.S.: Compressed Data Cubes for OLAP Aggregate Query Approximation on Continuous Dimensions. In: KDD, pp. 223–232 (1999)
Google Scholar
Beyer, K., Ramakrishnan, R.: Bottom-Up Computation of Sparse and Iceberg Cubes. In: SIGMOD, pp. 359–370 (1999)
Google Scholar
Xin, D., Han, J., Li, X., Wah, B.W.: Star-Cubing: Computing Iceberg Cubes by Top-Down and Bottom-Up Integration. In: VLDB, pp. 476–487 (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Information Science & Engineering, Northeastern University, Shenyang 110004, P.R.China
Fangling Leng, Yubin Bao, Daling Wang & Ge Yu

Authors

Fangling Leng
View author publications
You can also search for this author in PubMed Google Scholar
Yubin Bao
View author publications
You can also search for this author in PubMed Google Scholar
Daling Wang
View author publications
You can also search for this author in PubMed Google Scholar
Ge Yu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Il Yeal Song Johann Eder Tho Manh Nguyen

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Leng, F., Bao, Y., Wang, D., Yu, G. (2007). A Clustered Dwarf Structure to Speed Up Queries on Data Cubes. In: Song, I.Y., Eder, J., Nguyen, T.M. (eds) Data Warehousing and Knowledge Discovery. DaWaK 2007. Lecture Notes in Computer Science, vol 4654. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74553-2_16

Download citation

DOI: https://doi.org/10.1007/978-3-540-74553-2_16
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-74552-5
Online ISBN: 978-3-540-74553-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics