Skip to main content

Discovery-driven exploration of OLAP data cubes

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1377))

Abstract

Analysts predominantly use OLAP data cubes to identify regions of anomalies that may represent problem areas or new opportunities. The current OLAP systems support hypothesis-driven exploration of data cubes through operations such as drill-down, roll-up, and selection. Using these operations, an analyst navigates unaided through a huge search space looking at large number of values to spot exceptions. We propose a new discovery-driven exploration paradigm that mines the data for such exceptions and summarizes the exceptions at appropriate levels in advance. It then uses these exceptions to lead the analyst to interesting regions of the cube during navigation. We present the statistical foundation underlying our approach. We then discuss the computational issue of finding exceptions in data and making the process efficient on large multidimensional data bases.

This is an abridged version of the full paper that appears in [SAM98].

This is a preview of subscription content, log in via an institution.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. S. Agarwal, R. Agrawal, P.M. Deshpande, A. Gupta, J.F. Naughton, R. Ramakrishnan, and S. Sarawagi. On the computation of multidimensional aggregates. In Proc. of the 22nd Int'l Conference on Very Large Databases, pages 506–521, Mumbai (Bombay), India, September 1996.

    Google Scholar 

  2. Rakesh Agrawal, Ashish Gupta, and Sunita Sarawagi. Modeling multidimensional databases. In Proc. of the 13th Int'l Conference on Data Engineering, Birmingham, U.K., April 1997.

    Google Scholar 

  3. Arbor Software Corporation. Application Manager User's Guide, Essbase version 4.0. http://www.arborsoft.com.

    Google Scholar 

  4. Y. Bishop, S. Fienberg, and P. Holland. Discrete Multivariate Analysis theory and practice. The MIT Press, 1975.

    Google Scholar 

  5. William W. Cooley and Paul R Lohnes. Multivariate data analysis. Robert E. Krieger publishers, 1986.

    Google Scholar 

  6. George Colliat. OLAP, relational, and multidimensional database systems. Technical report, Arbor Software Corporation, Sunnyvale, CA, 1995.

    Google Scholar 

  7. J. Gray, A. Bosworth, A. Layman, and H. Pirahesh. Data cube: A relational aggregation operator generalizing group-by, cross-tabs and sub-totals. In Proc. of the 12th Int'l Conference on Data Engineering, pages 152–159, 1996.

    Google Scholar 

  8. D. Hoaglin, F. Mosteller, and Tukey. J. Exploring data tables, trends and shapes. Wiley series in probability, 1988.

    Google Scholar 

  9. D.C. Hoaglin, F. Mosteller, and J.W. Tukey. Understanding Robust and Exploratory Data Analysis. John Wiley, New York, 1983.

    MATH  Google Scholar 

  10. J. Mandel. A new analysis of variance model for non-additive data. Technometrics, 13:1–18, 1971.

    Article  Google Scholar 

  11. D.G. Montgomery. Design and Analysis of Experiments, chapter 13. John Wiley & sons, third edition, 1991.

    Google Scholar 

  12. The OLAP Council. MD-API the OLAP Application Program Interface Version 0.5 Specification, September 1996.

    Google Scholar 

  13. Sunita Sarawagi, Rakesh Agrawal, and Nimrod Megiddo. Discovery-driven exploration of OLAP data cubes. Research Report RJ 10102 (91918), IBM Almaden Research Center, San Jose, CA 95120, January 1998. Available from http://www.almaden.ibm.com/cs/quest.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Hans-Jörg Schek Gustavo Alonso Felix Saltor Isidro Ramos

Rights and permissions

Reprints and permissions

Copyright information

© 1998 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Sarawagi, S., Agrawal, R., Megiddo, N. (1998). Discovery-driven exploration of OLAP data cubes. In: Schek, HJ., Alonso, G., Saltor, F., Ramos, I. (eds) Advances in Database Technology — EDBT'98. EDBT 1998. Lecture Notes in Computer Science, vol 1377. Springer, Berlin, Heidelberg . https://doi.org/10.1007/BFb0100984

Download citation

  • DOI: https://doi.org/10.1007/BFb0100984

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-64264-0

  • Online ISBN: 978-3-540-69709-1

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics