Abstract
This paper describes a scheme for partial pre-aggregation to speed up the response time of queries that are posed for the array-like interface, subject to the constraint that all pre-computed aggregates must fit into storage of a pre-determined size. The target query workload consists of all base and aggregate cells that are stored in a multidimensional array (i.e. cube). These queries are actually range queries pre-defined by users. Due to the huge size of all possible aggregate cells, the emphasis of our scheme is to reduce the overhead for query compilation. An efficient and effective query decomposition method is devised, which works well with a pre-aggregation scheme whereby pre-computed aggregates form a sub-cube of the full cube. A greedy algorithm is devised is to derive such a sub-cube. A HOLAP engine which implements this partial pre-aggregation scheme is described. Experimental results using both synthetic and real-life datasets are presented to demonstrate that the partial pre-aggregation scheme is viable, and for some complex queries, accelerates query execution by close to 300 times.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Baralis, E., Parabolschi, S., Teniente, E.: Materialized Views Selection in a Multidimensional Database. In: Proc. of VLDB (1997)
Beyer, K., Ramakrishnan, R.: Bottom-Up Computation of Sparse and Iceberg CUBEs. In: Proc. of SIGMOD (1999)
Gray, J., Bosworth, A., Layman, A., Prahesh, H.: Data Cube: A Relational Aggregation Operator Generalizing group-BY, Cross-Tabs, and Sub-Totals. In: Proc. of ICDE 1996, New Orleans (February 1996)
Gupta, H., Harinarayan, V., Rajaraman, A., Ullman, J.: Index Selection for OLAP. In: Proc. 13th ICDE, Manchester, UK (1997)
Gupta, H., Mumick, I.S.: Selection of Views to Materialize Under a Maintenance-Time Constraint. In: Proc. International Conf. on Database Theory (1999)
Hellerstein, J.: Data Warehousing, Decision Support & OLAP, http://redbook.cs.berkeley.edu/lec28.html
Hung, E., Cheung, D., Kao, B., Liang, Y.: An optimization problem in data cube system design. In: Terano, T., Chen, A.L.P. (eds.) PAKDD 2000. LNCS, vol. 1805, pp. 74–85. Springer, Heidelberg (2000)
Harinarayan, V., Rajaraman, A., Ullmna, J.: Implementing Data Cubes Efficiently. In: Proc. of ACM SIGMOD (1996)
Li, C.: A Partial Pre-Computation of Aggregates for OLAP Databases. M.Sc. thesis, Simon Fraser University (July 2003)
Luk, W.: ADODA: A Desktop Online Data Analyzer. In: Proc. of DASFAA 2001, Hong Kong, (2001)
Pendse, N.: Datadase Explosion. OLAP Reports, http://www.olapreport.com/DatabaseExplosion.htm
Vassiliadis, P., Sellis, T.: A Survey of Logical Models for OLAP Databases. SIGMOD Record 28(4) (1999)
Shukla, A., Deshpande, P., Naughton, J.: Materialized View Selection for Multidimensional Datasets. In: Proc. of the 24th VLDB Conf., New York (1998)
Witkowski, A., et al.: Spreadsheets in RDBMS for OLAP. In: Proc. of ACM SIGMOD, San Diego (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Luk, WS., Li, C. (2004). A Partial Pre-aggregation Scheme for HOLAP Engines. In: Kambayashi, Y., Mohania, M., Wöß, W. (eds) Data Warehousing and Knowledge Discovery. DaWaK 2004. Lecture Notes in Computer Science, vol 3181. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30076-2_13
Download citation
DOI: https://doi.org/10.1007/978-3-540-30076-2_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22937-7
Online ISBN: 978-3-540-30076-2
eBook Packages: Springer Book Archive