Skip to main content

Efficient Computation of Multi-feature Data Cubes

  • Conference paper
  • 1098 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4092))

Abstract

A Multi-Feature Cube (MF-Cube) query is a complex-data-mining query based on data cubes, which computes the dependent complex aggregates at multiple granularities. Existing computations designed for simple data cube queries can be used to compute distributive and algebraic MF-Cubes queries. In this paper we propose an efficient computation of holistic MF-Cubes queries. This method computes holistic MF-Cubes with PDAP (Part Distributive Aggregate Property). The efficiency is gained by using dynamic subset data selection strategy (Iceberg query technique) to reduce the size of materialized data cube. Also for efficiency, this approach adopts the chunk-based caching technique to reuse the output of previous queries. We experimentally evaluate our algorithm using synthetic and real-world datasets, and demonstrate that our approach delivers up to about twice the performance of traditional computations.

This work is partially supported by Australian large ARC grants (DP0449535, DP0559536 and DP0667060), a China NSFC major research Program (60496327), a China NSFC grant (60463003) and a grant from Overseas Outstanding Talent Research Program of Chinese Academy of Sciences (06S3011S01).

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Ross, K.A., Srivastava, D., Chatziantoniou, D.: Complex Aggregation at Multiple Granularities. In: Schek, H.-J., Saltor, F., Ramos, I., Alonso, G. (eds.) EDBT 1998. LNCS, vol. 1377, pp. 263–277. Springer, Heidelberg (1998)

    Chapter  Google Scholar 

  2. Ross, K., Srivastava, D.: Fast computation of sparse datacubes. In: Proc. Int. Conf. Very Large Data Bases, August 1997, Athens, Greece, pp. 116–125 (1997)

    Google Scholar 

  3. Zhao, Y., Deshpande, P.M., Naughton, J.F.: An Array-Based Algorithm for Simultaneous Multidimensional Aggregates. In: Proc. of ACM SIGMOD, pp. 159–170 (1997)

    Google Scholar 

  4. Beyer, K.S., Ramakrishnan, R.: Bottom-Up Computation of Sparse and Iceberg Cubes. In: Proc. Of ACM SIGMOD, pp. 359–370 (1999)

    Google Scholar 

  5. Agarwal, S., Agrawal, R., Deshpande, P.M., Gupta, A., Naughton, J.F., Ramakrishnan, R., Sarawagi, S.: On the Computation of Multidimensional Aggregates. In: Proc. Int’ Conf. Very Large Data Bases, pp. 506–521 (1996)

    Google Scholar 

  6. Gray, A., Bosworth, A., Layman, A., Pirahesh, H.: Datacube: A relational aggregation operator generalizing group-by, cross-tab, and sub-totals. In: Proc. of the IEEE ICDE, pp. 152–159 (1996)

    Google Scholar 

  7. Fang, M., Shivakumar, N., Garcia-Molina, H., Motwani, R., Ullman, J.D.: Computing iceberg queries efficiently. In: Proc. of 24th VLDB Conf., New York, pp. 299–310 (1998)

    Google Scholar 

  8. Han, J.P., Dong, G., Wang, K.: Efficient Computation of Iceberg Cubes with Complex Measures. In: Proc. ACM-SIGMOD Int’l Conf. Management of Data, pp. 1–12 (2001)

    Google Scholar 

  9. Deshpande, P.M., Ramasamy, K., Shukla, A.: Caching Multidimensional Queries Using Chunks. In: Proc. ACM SIGMOD, pp. 259–270 (1998)

    Google Scholar 

  10. Dong, G., Han, J., Joyce, M.W.L.: Mining Constrained Gradients in Large Databases. In: IEEE TKDE (2003)

    Google Scholar 

  11. Hahn, C., Warren, S., London, J.: Edited synoptic cloud reports from ships and land stations over the globe (1996), Available on: http://cdiac.esd.ornl.gov/cdiac/ndps/ndp026b.html

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Zhang, S., Wang, R., Guo, Y. (2006). Efficient Computation of Multi-feature Data Cubes. In: Lang, J., Lin, F., Wang, J. (eds) Knowledge Science, Engineering and Management. KSEM 2006. Lecture Notes in Computer Science(), vol 4092. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11811220_52

Download citation

  • DOI: https://doi.org/10.1007/11811220_52

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-37033-8

  • Online ISBN: 978-3-540-37035-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics