Abstract
In large companies, On-Line Analytical Processing (OLAP) technologies are widely used by business analysts as a decision support tool. Nevertheless, while exploring the cube, analysts are rapidly confronted by analyzing a huge number of visible cells to identify the most interesting ones. Coupling OLAP technologies and mining methods may help them by the automation of this tedious task. In the scope of discovery-driven exploration, this paper presents two methods to detect and highlight interesting cells within a cube slice. The cell’s degree of interest is based on the calculation of either test-value or Chi-Square contribution. Indicators are computed instantaneously according to the user-defined dimensions drill-down. Their display is done by a color-coding system. A proof of concept implementation on the ORACLE 10g system is described at the end of the paper.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Agresti, A.: Categorical Data Analysis. Wiley, NY (1990)
Ben Messaoud, R., Boussaid, O., Rabaseda, S.: Efficient multidimensional data representations based on multiple correspondence analysis. In: KDD 2006. Proceedings of the Twelfth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Philadelphia, Pennsylvania, USA, August 20–23, 2006, ACM Press, New York (2006)
Ben Messaoud, R., Rabaseda, S., Boussaid, O., Bentayeb, F.: OpAC: A New OLAP Operator Based on a Data Mining Method. In: DB&IS 2004. Proceedings of the 6th International Baltic Conference on Databases and Information Systems, June 2004, Riga, Latvia (2004)
Chen, Q.: Mining exceptions and quantitative association rules in OLAP data cube. PhD Thesis of science, School of Computing Science, Simon Fraser University, British Columbia, Canada (1999)
Han, J.: OLAP Mining: An Integration of OLAP with Data Mining. In: DS-7. Proceedings of the 1997 IFIP Conference on Data Semantics, Leysin, Switzerland, October 1997, pp. 1–11 (1997)
Imielinski, T., Khachiyan, L., Abdulghani, A.: Cubegrades: Generalizing association rules. Technical report, Dept. Computer Science, Rutgers University (August 2000)
Kamber, M., Han, J., Chiang, J.: Metarule-Guided Mining of Multi-Dimensional Association Rules Using Data Cubes. Knowledge Discovery and Data Mining, 207–210 (1997)
Morineau, A.: Note sur la caractérisation statistique d’une classe et les valeurs-tests. Bulletin technique Centre Statistique Informatique Appliquées 2(1-2), 20–27 (1984)
Sarawagi, S., Agrawal, R., Megiddo, N.: Discovery-driven exploration of OLAP data cubes. Technical report, IBM Almaden Research Center, San Jose, USA (1998)
Sarawagi, S.: Explaining differences in multidimensional aggregates. In: VLDB 1999. Proceedings of the 25th International Conference On Very Large Databases, September 7-10, 1999, Edinburgh, Scotland, UK (1999)
Sarawagi, S.: User-adaptative exploration of multidimensional data. In: VLDB 2000. Proceedings of the 26th International Conference On Very Large Databases, September 10-14, 2000, Cairo, Egypt (2000)
Sathe, G., Sarawagi, S.: Intelligent Rollups in Multidimensional OLAP Data. In: VLDB 2001. Proceedings of the 27th International Conference On Very Large Databases, September 11-14, 2001, Roma, Italy (2001)
Zhu, H.: On-Line Analytical Mining of Association Rules. PhdThesis, Burnaby, British Columbia V5A 1S6, Canada (1998)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Cariou, V., Cubillé, J., Derquenne, C., Goutier, S., Guisnel, F., Klajnmic, H. (2007). Built-In Indicators to Automatically Detect Interesting Cells in a Cube. In: Song, I.Y., Eder, J., Nguyen, T.M. (eds) Data Warehousing and Knowledge Discovery. DaWaK 2007. Lecture Notes in Computer Science, vol 4654. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74553-2_12
Download citation
DOI: https://doi.org/10.1007/978-3-540-74553-2_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-74552-5
Online ISBN: 978-3-540-74553-2
eBook Packages: Computer ScienceComputer Science (R0)