Abstract
On-line Analytical Processing (OLAP) represents a good applications package to explore and navigate into data cubes. Though, it is limited to exploratory tasks. It does not assist the decision maker in performing information investigation. Thus, various studies have been trying to extend OLAP to new capabilities by coupling it with data mining algorithms.
Our current proposal stands within this trend. It has two major contributions. First, a Multi-perspectives Cube Exploration Framework (MCEF) is introduced. It is a generalized framework designed to assist the application of classical data mining algorithm on OLAP cubes. Second, a Neural Approach for Prediction over High-dimensional Cubes (NAP-HC) is also introduced, which extends Modular Neural Networks (MNN)s architecture to multidimensional context of OLAP cubes, to predict non-existent measures. A preprocessing stage is embedded in NAP-HC to assist it in facing up the challenges arising from the particularity of OLAP cubes. It consists of an OLAP oriented cube exploration strategy coupled with a dimensions reduction step that reposes on the Principal Component Analysis (PCA). Carried out experiments highlight the efficiency of MCEF in assisting the application of MNNs on OLAP cubes and the high predictive capabilities of NAP-HC.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsNotes
- 1.
American Community Surveys is accessible from the official site IPUMS-USA (Integrated Public Use Microdata Series); http://sda.berkeley.edu.
References
Inmon, W.H.: Building the Data Warehouse. QED Information Sciences Inc, Wellesley, MA, USA (1992)
Goil, S., Choudhary, A.: High performance multidimensional analysis and data mining. In: Proceedings of the High Performance Networking and Computing Conference (SC’1998), Orlando, Florida, US, November 1998
Sarawagi, S., Agrawal, R., Megiddo, N.: Discovery-driven exploration of OLAP data cubes. In: Schek, H.-J., Saltor, F., Ramos, I., Alonso, G. (eds.) EDBT 1998. LNCS, vol. 1377, p. 168. Springer, Heidelberg (1998)
Ben Messaoud, R., Loudcher-Rabaseda, S.: Olemar: An on-line environment for mining association rules in multidimensional data. In: Advances in Data Warehousing and Mining, vol. 2. Idea Group Publishing (2007)
Codd, E.F., Codd, S.B., Salley, C.T.: Providing OLAP (on-line Analytical Processing) to User-analysts: An IT Mandate. Codd and Date, Inc., Manchester (1993)
Bodin-Niemczuk, A., Ben Messaoud, R., Rabaséda, S.L., Boussaid, O.: Vers l’intégration de la prédiction dans les cubes OLAP. In: EGC. (2008) 203–204
Agarwal, D., Chen, B.C.: Latent OLAP: Data cubes over latent variables. In: Proceedings of the 2011 ACM SIGMOD International Conference on Management of Data. SIGMOD 2011, pp. 877–888. ACM, New York (2011)
Palpanas, T., Koudas, N., Mendelzon, A.: Using datacube aggregates for approximate querying and deviation detection. IEEE Trans. Knowl. Data Eng. 17, 1465–1477 (2005)
Chen, B.C., Chen, L., Lin, Y., Ramakrishnan, R.: Prediction Cubes. In: Proceedings of the 31st International Conference on Very large Data Bases. VLDB 2005, pp. 982–993 (2005)
Cuzzocrea, A.: Providing probabilistically-bounded approximate answers to non-holistic aggregate range queries in OLAP. In: Proceedings of the 8th ACM International Workshop on Data Warehousing and OLAP. DOLAP 2005, pp. 97–106. ACM, New York (2005)
Chen, Y., Dong, G., Han, J., Pei, J., Wah, B.W., Wang, J.: Regression cubes with lossless compression and aggregation. IEEE Trans. Knowl. Data Eng. 18, 1585–1599 (2006)
Cuzzocrea, A., Saccà, D.: Balancing accuracy and privacy of OLAP aggregations on data cubes. In: Proceedings of the ACM 13th International Workshop on Data Warehousing and OLAP. DOLAP 2010, pp. 93–98. ACM, New York (2010)
Messaoud, R.B., Rabaséda, S.L., Boussaid, O., Missaoui, R.: Enhanced Mining of Association Rules from Data Cubes. In: Proceedings of the \(9^{th}\) ACM International Workshop on Data Warehousing and OLAP (DOLAP’2006), pp. 11–18. ACM Press, Arlington (November 2006)
Cuzzocrea, A.: Overcoming limitations of approximate query answering in OLAP. In: 9th International Database Engineering and Application Symposium. IDEAS 2005, pp. 200–209 (July 2005)
Jain, A.K.: Fundamentals of Digital Image Processing. Prentice-Hall Inc, Upper Saddle River, NJ, USA (1989)
Abdelbaki, W., Ben Messaoud, R., Ben Yahia, S.: A neural-based approach for extending OLAP to prediction. In: Cuzzocrea, A., Dayal, U. (eds.) DaWaK 2012. LNCS, vol. 7448, pp. 117–129. Springer, Heidelberg (2012)
Abdelbaki, W., Ben Yahia, S., Ben Messaoud, R.: NAP-SC: a neural approach for prediction over sparse cubes. In: Zhou, S., Zhang, S., Karypis, G. (eds.) ADMA 2012. LNCS, vol. 7713, pp. 340–352. Springer, Heidelberg (2012)
Witten, I., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques, 2nd edn. Elsevier, Morgan Kaufmann, Burlington (2005)
Cuzzocrea, A., Mansmann, S.: OLAP visualization: models, issues, and techniques. In: Wang, J. (ed.) Encyclopedia of Data Warehousing and Mining, 2nd edn, pp. 1439–1446. IGI Global, Hershey, PA (2009)
Ramakrishnan, R., Chen, B.C.: Exploratory mining in cube space. Data Min. Knowl. Disc. 15(1), 29–54 (2007)
Azam, F.: Biologically inspired modular neural networks. Ph.D. thesis, Virginia Polytechnic Institute and State University, Virginia, USA (2000)
Bishop, C.: Neural Networks For Pattern Recognition. Oxford University Press, Oxford (1995)
Sharkey, A.J. (ed.): Combining Artificial Neural Nets: Ensemble and Modular Multi-Net Systems, 1st edn. Springer-Verlag New York Inc, Secaucus, NJ, USA (1999)
Gallinari, P.: The Handbook of Brain Theory and Neural Networks. MIT Press, Cambridge, MA, USA (1998)
Melin, P., Castillo, O.: Modular neural networks. In: Hybrid Intelligent Systems for Pattern Recognition Using Soft Computing. Studies in Fuzziness and Soft Computing, vol. 172, pp. 109–129. Springer, Heidelberg (2005)
Happel, B.L., Murre, J.M.J.: The design and evolution of modular neural network architectures. Neural Netw. 7, 985–1004 (1994)
Tshilidzi, M.: Computational Intelligence for Missing Data Imputation, Estimation, and Management: Knowledge Optimization Techniques. IGI Publishing, Hershey, PA (2009)
Wang, Z., Xu, J., Lu, F., Zhang, Y.: Using the method combining PCA with BP neural network to predict water demand for urban development. In: Proceedings of the 2009 Fifth International Conference on Natural Computation. ICNC 2009, pp. 621–625. IEEE Computer Society, Washington (2009)
Hotelling, H.: Analysis of a complex of statistical variables into principal components. J. Educ. Psychol. 24(7), 498–520 (1933)
Hornik, K., Stinchcombe, M., White, H.: Multilayer feedforward networks are universal approximators. Neural Netw. 2(5), 359–366 (1989)
Rumelhart, D., McClelland, J.: Parallel Distributed Processing: Explorations in the Microstructure of Cognition. Foundations. Computational Models of Cognition and Perception. MIT Press, Cambridge (1986)
Haykin, S.: Neural Networks: a Comprehensive Foundation. Prentice Hall, Prentice Hall International Editions Series (1999)
Ben Othman, L., Ben Yahia, S.: Yet another approach for completing missing values. In: Yahia, S.B., Nguifo, E.M., Belohlavek, R. (eds.) CLA 2006. LNCS (LNAI), vol. 4923, pp. 155–169. Springer, Heidelberg (2008)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Abdelbaki, W., Yahia, S.B., Messaoud, R.B. (2015). Modular Neural Networks for Extending OLAP to Prediction. In: Hameurlain, A., Küng, J., Wagner, R., Cuzzocrea, A., Dayal, U. (eds) Transactions on Large-Scale Data- and Knowledge-Centered Systems XXI. Lecture Notes in Computer Science(), vol 9260. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-47804-2_4
Download citation
DOI: https://doi.org/10.1007/978-3-662-47804-2_4
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-47803-5
Online ISBN: 978-3-662-47804-2
eBook Packages: Computer ScienceComputer Science (R0)