Skip to main content

Functional Dependencies in Controlling Sparsity of OLAP Cubes

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1874))

Abstract

We will study how relational dependency information can be applied to OLAP cube design. We use dependency information to control sparsity, since functional dependencies between dimensions clearly increase sparsity. Our method helps the user in finding dimensions and hierarchies, identifying sparsity risks, and finally changing the design in order to get a more suitable result. Sparse raw data, a large amount of pre-calculated aggregations, and many dimensions may expand the need of the storage space so rapidly that the problem cannot be solved by increasing the capacity of the system. We give two methods to construct suitable OLAP cubes. In the synthesis method, attributes are divided into equivalence classes according to dependencies in which they participate. Each equivalence class may form a dimension. The decomposition method is applied when candidates for dimensions exist. We decompose dimensions based on conflicts, and construct new cubes for removed dimensions until no conflicts between dimensions exist.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Cabibbo, L. and Torlone, R.: A logical approach to multidimensional databases, 6th International Conference on Extending Database Technology (EDBT’98, Valencia, Spain 23–27 March), G. Alonso (eds.), LNCS, Vol. 1377, pages 183–197, Springer, 1998.

    Google Scholar 

  2. Chen, P. P.: The Entity-Relationship model: Toward a Unified View of Data, ACM Transactions on Database Systems, 1(1), 1976.

    Google Scholar 

  3. Codd, E. F.: A relational model for large shared data banks, Communications of the ACM, 13(6), 1970.

    Google Scholar 

  4. Codd, E. F.: Further Normalization of the Data Base Relational Model, Data Base Systems, Courant Computer Science Symposia Series 6, R. Rustin (ed.), Prentice-Hall, Englewood Cliffs, New Jersey, 1972.

    Google Scholar 

  5. Golfarelli, M., Maio, D., and Rizzi, S.: Conceptual design of data warehouses from E/R schemes, Proceedings of the 31st Hawaii International Conference on System Sciences, 1998.

    Google Scholar 

  6. Lenz, H.-J. and Shoshani, A.: Summarizability in OLAP and Statistical Data Bases, Ninth International Conference On Scientific And Statistical Database Management (SSDBM), pages 132–143, 1997.

    Google Scholar 

  7. Lehner, W., Albrecht, J., and Wedekind, H.: Multidimensional normal forms, Proceedings of the 10th International Conference on Scientific and Statistical Data Management (SSDBM’98), Capri, Italy, 1998.

    Google Scholar 

  8. Li, C. and Wang, X. S.: A data model for supporting on-line analytical processing, Proceedings of the Conference on Information and Knowledge Management (CIKM’96), pages 81–88, 1996.

    Google Scholar 

  9. Niemi, T., Nummenmaa, J., and Thanisch, P.: Applying dependency theory to conceptual modelling, Topics in Conceptual Analysis and Modeling, Czech Academy of Sciences’ Publishing House Filosofia, Prague, 2000. (to be published)

    Google Scholar 

  10. Nummenmaa, J.: Designing Desirable Database Schemas, Department of Computer Science, University of Tampere, A-1995-1, Tampere, 1995. (Ph.D. Thesis)

    Google Scholar 

  11. Pedersen, T. and Jensen, C.: Multidimensional data modeling for complex data, Proceedings of the International Conference on Data Engineering (ICDE’99), 1999.

    Google Scholar 

  12. Pendse, N.: Database Explosion, The OLAP Report, 2000. Available at http://www.olapreport.com/DatabaseExplosion.htm.

  13. Sapia, C., Blaschka, M., Höfling, G., and Dinter, B.: Extending the E/R model for the multidimensional paradigm, Advances in Database Technologies, Y. Kambayashi et al. (eds.), pages 105–116, LNCS 1552, Springer, 1998.

    Google Scholar 

  14. Wang, X. S. and Li, C.: Deriving orthogonality to optimize the search for summary data, Information Systems, 24(1), pages 47–65, 1999.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2000 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Niemi, T., Nummenmaa, J., Thanisch, P. (2000). Functional Dependencies in Controlling Sparsity of OLAP Cubes. In: Kambayashi, Y., Mohania, M., Tjoa, A.M. (eds) Data Warehousing and Knowledge Discovery. DaWaK 2000. Lecture Notes in Computer Science, vol 1874. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44466-1_20

Download citation

  • DOI: https://doi.org/10.1007/3-540-44466-1_20

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-67980-6

  • Online ISBN: 978-3-540-44466-4

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics