Abstract
Typical OLAP queries take a long time to be processed so speeding up the execution of each single query is imperative to decision making. ParGRES is an open-source database cluster middleware for high performance OLAP query processing. By exploiting intra-query parallelism on PC clusters, ParGRES has shown excellent performance using the TPC-H benchmark. In this paper, we evaluate ParGRES on a real-world OLAP database. Through adaptive virtual partitioning of the database, ParGRES yields linear and very often super-linear speedup for frequent queries. This shows that ParGRES is a very cost-effective solution for OLAP query processing in real settings.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Akal, F., Böhm, K., Schek, H.J.: OLAP Query Evaluation in a Database Cluster: a Performance Study on Intra-Query Parallelism. In: Manolopoulos, Y., Návrat, P. (eds.) ADBIS 2002. LNCS, vol. 2435, pp. 218–231. Springer, Heidelberg (2002)
Brazilian Institute of Geography and Statistics – IBGE, http://www.ibge.gov.br
Cecchet, E., Marguerite, J., Zwaenepoel, W.: C-JDBC: Flexible Database Clustering Middleware. In: Proceedings of USENIX Annual Technical Conference, Freenix Track, Boston, EUA, pp. 9–18 (June 2004)
Grid5000 Projects Web Site - Grid5000, http://www.grid5000.fr
HSQL Database Engine, http://hsqldb.org/
Kotowski, N., Lima, A.A., Pacitti, E., Valduriez, P., Mattoso, M.L.Q.: Parallel Query Processing for OLAP in Grids. Concurrency and Computation. Practice & Experience (2008), http://dx.doi.org/10.1002/cpe.1303
Lima, A.A.B.: Intra-query parallelism in Database Clusters (in Portuguese). Ph.D. Thesis, COPPE/UFRJ, Rio de Janeiro (2004)
Lima, A.A.B., Mattoso, M., Valduriez, P.: Adaptive Virtual Partitioning for OLAP Query Processing in a Database Cluster. In: Proceedings of the 19th Brazilian Symposium on Database Systems (SBBD 2004), BrasÃlia, Brazil, October 18-20, pp. 92–105 (2004)
Mattoso, M., Zimbrão, G., Lima, A.A.B.: Baião. F., Braganholo, V., Aveleda, A., Miranda, B., Almentero, B., Costa, M.N.: ParGRES Middleware for Executing OLAP Queries in Parallel. Technical report (2005), http://pargres.nacad.ufrj.br/Documentos/ES-690.pdf
Miranda, B., Lima, A.A.B., Valduriez, P., Mattoso, M.: Apuama: Combining Intra-query and Inter-query Parallelism in a Database Cluster. In: Grust, T., Höpfner, H., Illarramendi, A., Jablonski, S., Mesiti, M., Müller, S., Patranjan, P.-L., Sattler, K.-U., Spiliopoulou, M., Wijsen, J. (eds.) EDBT 2006. LNCS, vol. 4254, pp. 649–661. Springer, Heidelberg (2006)
Multidimensional Statistics Database – BME, http://www.bme.ibge.gov.br
ParGRES, http://pargres.nacad.ufrj.br/
PostgreSQL, http://www.postgresql.org
Röhm, U., Bohm, K., Schek, H.-J.: OLAP Query Routing and Physical Design in a Database Cluster. In: Zaniolo, C., Grust, T., Scholl, M.H., Lockemann, P.C. (eds.) EDBT 2000. LNCS, vol. 1777, pp. 254–268. Springer, Heidelberg (2000)
Röhm, U., Bohm, K., Schek, H.-J.: Cache-Aware Query Routing in a Cluster of Databases. In: Proceedings of the 17th International Conference on Data Engineering (ICDE 2001), pp. 641–650. IEEE Computer Society, Los Alamitos (2001)
Röhm, U., Böhm, K., Scheck, H.-J., Schuldt, H.: FAS - A Freshness-Sensitive Coordination Middleware for a Cluster of OLAP Components. In: Proceedings of the 28th International Conference on Very Large Databases Conference (VLDB 2002), Hong Kong, China, August 20-23 (2002)
Sequoia Project, http://sequoia.continuent.org/HomePage
TPC Benchmark C, http://www.tpc.org/tpcc/
TPC Benchmark H, http://www.tpc.org/tpch/
TPC Benchmark W, http://www.tpc.org/tpcw/default.asp
Valduriez, P.: Parallel Database Systems: open problems and new issues. International Journal on Distributed and Parallel Databases 1(2), 137–165 (1993)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Paes, M., Lima, A.A.B., Valduriez, P., Mattoso, M. (2008). High-Performance Query Processing of a Real-World OLAP Database with ParGRES. In: Palma, J.M.L.M., Amestoy, P.R., Daydé, M., Mattoso, M., Lopes, J.C. (eds) High Performance Computing for Computational Science - VECPAR 2008. VECPAR 2008. Lecture Notes in Computer Science, vol 5336. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-92859-1_18
Download citation
DOI: https://doi.org/10.1007/978-3-540-92859-1_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-92858-4
Online ISBN: 978-3-540-92859-1
eBook Packages: Computer ScienceComputer Science (R0)