Skip to main content

High-Performance Query Processing of a Real-World OLAP Database with ParGRES

  • Conference paper
High Performance Computing for Computational Science - VECPAR 2008 (VECPAR 2008)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 5336))

Abstract

Typical OLAP queries take a long time to be processed so speeding up the execution of each single query is imperative to decision making. ParGRES is an open-source database cluster middleware for high performance OLAP query processing. By exploiting intra-query parallelism on PC clusters, ParGRES has shown excellent performance using the TPC-H benchmark. In this paper, we evaluate ParGRES on a real-world OLAP database. Through adaptive virtual partitioning of the database, ParGRES yields linear and very often super-linear speedup for frequent queries. This shows that ParGRES is a very cost-effective solution for OLAP query processing in real settings.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Akal, F., Böhm, K., Schek, H.J.: OLAP Query Evaluation in a Database Cluster: a Performance Study on Intra-Query Parallelism. In: Manolopoulos, Y., Návrat, P. (eds.) ADBIS 2002. LNCS, vol. 2435, pp. 218–231. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  2. Brazilian Institute of Geography and Statistics – IBGE, http://www.ibge.gov.br

  3. Cecchet, E., Marguerite, J., Zwaenepoel, W.: C-JDBC: Flexible Database Clustering Middleware. In: Proceedings of USENIX Annual Technical Conference, Freenix Track, Boston, EUA, pp. 9–18 (June 2004)

    Google Scholar 

  4. Grid5000 Projects Web Site - Grid5000, http://www.grid5000.fr

  5. HSQL Database Engine, http://hsqldb.org/

  6. Kotowski, N., Lima, A.A., Pacitti, E., Valduriez, P., Mattoso, M.L.Q.: Parallel Query Processing for OLAP in Grids. Concurrency and Computation. Practice & Experience (2008), http://dx.doi.org/10.1002/cpe.1303

  7. Lima, A.A.B.: Intra-query parallelism in Database Clusters (in Portuguese). Ph.D. Thesis, COPPE/UFRJ, Rio de Janeiro (2004)

    Google Scholar 

  8. Lima, A.A.B., Mattoso, M., Valduriez, P.: Adaptive Virtual Partitioning for OLAP Query Processing in a Database Cluster. In: Proceedings of the 19th Brazilian Symposium on Database Systems (SBBD 2004), Brasília, Brazil, October 18-20, pp. 92–105 (2004)

    Google Scholar 

  9. Mattoso, M., Zimbrão, G., Lima, A.A.B.: Baião. F., Braganholo, V., Aveleda, A., Miranda, B., Almentero, B., Costa, M.N.: ParGRES Middleware for Executing OLAP Queries in Parallel. Technical report (2005), http://pargres.nacad.ufrj.br/Documentos/ES-690.pdf

  10. Miranda, B., Lima, A.A.B., Valduriez, P., Mattoso, M.: Apuama: Combining Intra-query and Inter-query Parallelism in a Database Cluster. In: Grust, T., Höpfner, H., Illarramendi, A., Jablonski, S., Mesiti, M., Müller, S., Patranjan, P.-L., Sattler, K.-U., Spiliopoulou, M., Wijsen, J. (eds.) EDBT 2006. LNCS, vol. 4254, pp. 649–661. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  11. Multidimensional Statistics Database – BME, http://www.bme.ibge.gov.br

  12. ParGRES, http://pargres.nacad.ufrj.br/

  13. PostgreSQL, http://www.postgresql.org

  14. POWERDB, http://www.dbs.ethz.ch/archive/index.html

  15. Röhm, U., Bohm, K., Schek, H.-J.: OLAP Query Routing and Physical Design in a Database Cluster. In: Zaniolo, C., Grust, T., Scholl, M.H., Lockemann, P.C. (eds.) EDBT 2000. LNCS, vol. 1777, pp. 254–268. Springer, Heidelberg (2000)

    Chapter  Google Scholar 

  16. Röhm, U., Bohm, K., Schek, H.-J.: Cache-Aware Query Routing in a Cluster of Databases. In: Proceedings of the 17th International Conference on Data Engineering (ICDE 2001), pp. 641–650. IEEE Computer Society, Los Alamitos (2001)

    Chapter  Google Scholar 

  17. Röhm, U., Böhm, K., Scheck, H.-J., Schuldt, H.: FAS - A Freshness-Sensitive Coordination Middleware for a Cluster of OLAP Components. In: Proceedings of the 28th International Conference on Very Large Databases Conference (VLDB 2002), Hong Kong, China, August 20-23 (2002)

    Google Scholar 

  18. Sequoia Project, http://sequoia.continuent.org/HomePage

  19. TPC Benchmark C, http://www.tpc.org/tpcc/

  20. TPC Benchmark H, http://www.tpc.org/tpch/

  21. TPC Benchmark W, http://www.tpc.org/tpcw/default.asp

  22. Valduriez, P.: Parallel Database Systems: open problems and new issues. International Journal on Distributed and Parallel Databases 1(2), 137–165 (1993)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Paes, M., Lima, A.A.B., Valduriez, P., Mattoso, M. (2008). High-Performance Query Processing of a Real-World OLAP Database with ParGRES. In: Palma, J.M.L.M., Amestoy, P.R., Daydé, M., Mattoso, M., Lopes, J.C. (eds) High Performance Computing for Computational Science - VECPAR 2008. VECPAR 2008. Lecture Notes in Computer Science, vol 5336. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-92859-1_18

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-92859-1_18

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-92858-4

  • Online ISBN: 978-3-540-92859-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics