Skip to main content

Honey, I Shrunk the Cube

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8133))

Abstract

Information flooding may occur during an OLAP session when the user drills down her cube up to a very fine-grained level, because the huge number of facts returned makes it very hard to analyze them using a pivot table. To overcome this problem we propose a novel OLAP operation, called shrink, aimed at balancing data precision with data size in cube visualization via pivot tables. The shrink operation fuses slices of similar data and replaces them with a single representative slice, respecting the constraints posed by dimension hierarchies, until the result is smaller than a given threshold. We present a greedy agglomerative clustering algorithm that at each step fuses the two slices yielding the minimum increase in the total approximation, and discuss some experimental results that show its efficiency and effectiveness.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Golfarelli, M., Rizzi, S.: Data Warehouse design: Modern principles and methodologies. McGraw-Hill (2009)

    Google Scholar 

  2. Marcel, P., Missaoui, R., Rizzi, S.: Towards intensional answers to OLAP queries for analytical sessions. In: Proc. DOLAP, Maui, USA, pp. 49–56 (2012)

    Google Scholar 

  3. Golfarelli, M., Rizzi, S., Biondi, P.: myOLAP: An approach to express and evaluate OLAP preferences. IEEE Trans. Knowl. Data Eng. 23(7), 1050–1064 (2011)

    Article  Google Scholar 

  4. Vitter, J.S., Wang, M.: Approximate computation of multidimensional aggregates of sparse data using wavelets. In: Proc. SIGMOD, Philadelphia, USA, pp. 193–204 (1999)

    Google Scholar 

  5. Han, J.: OLAP mining: Integration of OLAP with data mining. In: Proc. Working Conf. on Database Semantics, Leysin, Switzerland, pp. 3–20 (1997)

    Google Scholar 

  6. Minnesota Population Center: Integrated public use microdata series (2008), http://www.ipums.org

  7. Gordevicius, J., Gamper, J., Böhlen, M.H.: Parsimonious temporal aggregation. VLDB J. 21(3), 309–332 (2012)

    Article  Google Scholar 

  8. Li, T., Li, N.: Towards optimal k-anonymization. DKE 65(1), 22–39 (2008)

    Article  Google Scholar 

  9. Tan, P.N., Steinbach, M., Kumar, V.: Introduction to Data Mining. Pearson International (2006)

    Google Scholar 

  10. Bellatreche, L., Giacometti, A., Marcel, P., Mouloudi, H., Laurent, D.: A personalization framework for OLAP queries. In: Proc. DOLAP, Bremen, Germany, pp. 9–18 (2005)

    Google Scholar 

  11. Jerbi, H., Ravat, F., Teste, O., Zurfluh, G.: A framework for OLAP content personalization. In: Catania, B., Ivanović, M., Thalheim, B. (eds.) ADBIS 2010. LNCS, vol. 6295, pp. 262–277. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  12. Aligon, J., Golfarelli, M., Marcel, P., Rizzi, S., Turricchia, E.: Mining preferences from OLAP query logs for proactive personalization. In: Eder, J., Bielikova, M., Tjoa, A.M. (eds.) ADBIS 2011. LNCS, vol. 6909, pp. 84–97. Springer, Heidelberg (2011)

    Chapter  Google Scholar 

  13. Motro, A.: Intensional answers to database queries. IEEE Trans. Knowl. Data Eng. 6(3), 444–454 (1994)

    Article  Google Scholar 

  14. Yoon, S.C., Song, I.Y., Park, E.K.: Intelligent query answering in deductive and object-oriented databases. In: Proc. CIKM, Gaithersburg, USA, pp. 244–251 (1994)

    Google Scholar 

  15. Flach, P.: From extensional to intensional knowledge: Inductive logic programming techniques and their application to deductive databases. Technical report, University of Bristol, Bristol, UK (1998)

    Google Scholar 

  16. Benamara, F.: Generating intensional answers in intelligent question answering systems. In: Proc. Int. Conf. Natural Language Generation, Brockenhurst, UK, pp. 11–20 (2004)

    Google Scholar 

  17. Cimiano, P., Rudolph, S., Hartfiel, H.: Computing intensional answers to questions – an inductive logic programming approach. DKE 69(3), 261–278 (2010)

    Article  Google Scholar 

  18. Acharya, S., Gibbons, P.B., Poosala, V.: Congressional samples for approximate answering of group-by queries. In: Proc. SIGMOD Conference, Dallas, USA, pp. 487–498 (2000)

    Google Scholar 

  19. de Rougemont, M., Cao, P.T.: Approximate answers to OLAP queries on streaming data warehouses. In: Proc. DOLAP, Maui, USA, pp. 121–128 (2012)

    Google Scholar 

  20. Fagin, R., Kolaitis, P.G., Miller, R.J., Popa, L.: Data exchange: Semantics and query answering. In: Calvanese, D., Lenzerini, M., Motwani, R. (eds.) ICDT 2003. LNCS, vol. 2572, pp. 207–224. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  21. Golfarelli, M., Mandreoli, F., Penzo, W., Rizzi, S., Turricchia, E.: OLAP query reformulation in peer-to-peer data warehousing. Inf. Syst. 37(5), 393–411 (2012)

    Article  Google Scholar 

  22. Gordevicius, J., Gamper, J., Böhlen, M.H.: Parsimonious temporal aggregation. VLDB J. 21(3), 309–332 (2012)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Golfarelli, M., Rizzi, S. (2013). Honey, I Shrunk the Cube. In: Catania, B., Guerrini, G., Pokorný, J. (eds) Advances in Databases and Information Systems. ADBIS 2013. Lecture Notes in Computer Science, vol 8133. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40683-6_14

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-40683-6_14

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-40682-9

  • Online ISBN: 978-3-642-40683-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics