Skip to main content

Sidera: A Cluster-Based Server for Online Analytical Processing

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4804))

Abstract

Online Analytical Processing (OLAP) has become a primary component of today’s pervasive Decision Support systems. The rich multi-dimensional analysis that OLAP provides allows corporate decision makers to more fully assess and evaluate organizational progress than ever before. However, as the data repositories upon which OLAP is based become larger and larger, single CPU OLAP servers are often stretched to, or even beyond, their limits. In this paper, we present a comprehensive architectural model for a fully parallelized OLAP server. Our multi-node platform actually consists of a series of largely independent sibling servers that are “glued” together with a lightweight MPI-based Parallel Service Interface (PSI). Physically, we target the commodity-oriented, “shared nothing” Linux cluster, a model that provides an extremely cost effective alterative to the “shared everything” commercial platforms often used in high-end database environments. Experimental results demonstrate both the viability and robustness of the design.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Beyer, K., Ramakrishnan, R.: Bottom-up computation of sparse and iceberg cubes. In: ACM SIGMOD, pp. 359–370 (1999)

    Google Scholar 

  2. Boral, H., Alexander, W., Clay, L., Copeland, G., Danforth, S., Franklin, M., Hart, B., Smith, M., Valduriez, P.: Prototyping bubba, a highly parallel database system. Transactions on Knowledge and Data Engineering 2(1), 4–24 (1990)

    Article  Google Scholar 

  3. Chang, C., Acharya, A., Sussman, A., Saltz, J.: T2: a customizable parallel database for multi-dimensional data. SIGMOD Record 27(1), 58–66 (1998)

    Article  Google Scholar 

  4. Chen, Y., Dehne, F., Eavis, T., Rau-Chaplin, A.: Parallel rolap datacube construction on shared nothing multi-processors. Journal of Distributed and Parallel Databases 15(3), 219–236 (2004)

    Article  Google Scholar 

  5. Cruanes, T., Dageville, B., Ghosh, B.: Parallel sql execution in oracle 10g. In: ACM SIGMOD, pp. 850–854 (2004)

    Google Scholar 

  6. Dehne, F., Eavis, T., Hambrusch, S., Rau-Chaplin, A.: Parallelizing the datacube. Journal Distributed and Parallel Databases 11(2), 181–201 (2001)

    Google Scholar 

  7. Dehne, F., Eavis, T., Rau-Chaplin, A.: Rcube: Parallel multi-dimensional rolap indexing. Journal of Data Warehousing and Mining (to appear)

    Google Scholar 

  8. Dehne, F., Eavis, T., Rau-Chaplin, A.: The cgmCUBE project: Optimizing parallel data cube generation for rolap. Journal of Distributed and Parallel Databases 19(1), 29–62 (2006)

    Article  Google Scholar 

  9. DeWitt, D., Gray, J.: Parallel database systems: the future of high performance database systems. Communications of the ACM 35(6), 85–98 (1992)

    Article  Google Scholar 

  10. DeWitt, D.J., Ghandeharizadeh, S., Schneider, D.A., Bricker, A., Hsaio, H.-I, Rasmussen, R.: The Gamma database machine project. Transactions on Knowledge and Data Engineering 2(1), 44–62 (1990)

    Article  Google Scholar 

  11. Eavis, T., Cueva, D.: A hilbert space compression architecture for data warehouse environments. In: DaWaK (2007) (accepted for publication)

    Google Scholar 

  12. Eavis, T., Lopez, A.: rk-hist: An r-tree based histogram for multi-dimensional selectivity estimation (2007) (currently under review)

    Google Scholar 

  13. Eavis, T., Taleb, A.: mapgraph: efficient methods for complex olap hierarchies (2007) (currently under review)

    Google Scholar 

  14. Furtado, C., Lima, A., Pacitti, E., Valduriez, P., Mattoso, M.: Physical and virtual partitioning in olap database clusters. In: SBAC. Int. Symp. on Computer Architecture and High Performance Computing, pp. 143–150 (2005)

    Google Scholar 

  15. Furtado, P.: Experimental evidence on partitioning in parallel data warehouses. In: DOLAP, pp. 23–30 (2004)

    Google Scholar 

  16. Goil, S., Choudhary, A.: High performance multidimensional analysis of large datasets. In: DOLAP, pp. 34–39 (1998)

    Google Scholar 

  17. Gray, J., Bosworth, A., Layman, A., Pirahesh, H.: Data cube: A relational aggregation operator generalizing group-by, cross-tab, and sub-totals. In: ICDE, pp. 152–159 (1996)

    Google Scholar 

  18. Guttman, A.: R-trees: A dynamic index structure for spatial searching. In: Proceedings of the 1984 ACM SIGMOD Conference, pp. 47–57 (1984)

    Google Scholar 

  19. Harinarayan, V., Rajaraman, A., Ullman, J.: Implementing data cubes. In: ACM SIGMOD, pp. 205–216 (1996)

    Google Scholar 

  20. Jin, R., Vaidyanathan, K., Yang, G., Agrawal, G.: Communication and memory optimal parallel data cube construction. Transactions on Parallel and Distributed Systems 16(12), 1105–1119 (2005)

    Article  Google Scholar 

  21. Mittal, A., Dandamudi, S.: Dynamic versus static locking in real-time parallel database systems. Parallel and Distributed Processing Symposium, 32–42 (2004)

    Google Scholar 

  22. Morfonios, K., Ioannidis, Y.: CURE for cubes: cubing using a ROLAP engine. In: VLDB, pp. 379–390 (2006)

    Google Scholar 

  23. Rao, J., Zhang, C., Megiddo, N., Lohman, G.: Automating physical database design in a parallel database. In: ACM SIGMOD, pp. 558–569 (2002)

    Google Scholar 

  24. Rauch, F., Stricker, T.: Os support for a commodity database on pc clusters: distributed devices vs. distributed file systems. In: Australasian database conference, pp. 145–154 (2005)

    Google Scholar 

  25. Scheuermann, P., Weikum, G., Zabback, P.: Data partitioning and load balancing in parallel disk systems. The VLDB Journal 7(1), 48–66 (1998)

    Article  Google Scholar 

  26. Shi, H., Schaeffer, J.: Parallel sorting by regular sampling. Journal of Parallel and Distributed Computing 14, 361–372 (1990)

    Article  Google Scholar 

  27. Sismanis, Y., Deligiannakis, A., Kotidis, Y., Roussopoulos, N.: Hierarchical dwarfs for the rollup cube. In: DOLAP, pp. 17–24 (2003)

    Google Scholar 

  28. Smith, J., Watson, P., Sampaio, S., Paton, N.: Polar: An architecture for a parallel ODMG compliant object database. In: CIKM, pp. 352–359 (2000)

    Google Scholar 

  29. Märtens, H., Stöhr, T., Rahm, E.: Multi-dimensional database allocation for parallel data warehouses. In: VLDB, pp. 273–284 (2000)

    Google Scholar 

  30. Böhm, K., Röhm, U., Schek, H.-J.: Routing and physical design in a database cluster. In: Zaniolo, C., Grust, T., Scholl, M.H., Lockemann, P.C. (eds.) EDBT 2000. LNCS, vol. 1777, pp. 254–268. Springer, Heidelberg (2000)

    Chapter  Google Scholar 

  31. Zhao, Y., Deshpande, P., Naughton, J.: An array-based algorithm for simultaneous multi-dimensional aggregates. In: ACM SIGMOD, pp. 159–170 (1997)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Robert Meersman Zahir Tari

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Eavis, T., Dimitrov, G., Dimitrov, I., Cueva, D., Lopez, A., Taleb, A. (2007). Sidera: A Cluster-Based Server for Online Analytical Processing. In: Meersman, R., Tari, Z. (eds) On the Move to Meaningful Internet Systems 2007: CoopIS, DOA, ODBASE, GADA, and IS. OTM 2007. Lecture Notes in Computer Science, vol 4804. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-76843-2_22

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-76843-2_22

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-76835-7

  • Online ISBN: 978-3-540-76843-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics