Abstract
The paper is dedicated to a problem of effective query processing in cluster database systems. An original approach to data allocation and replication at nodes of a cluster system is presented. On the basis of this approach the load balancing method is developed. Also, we propose a new method for parallel query processing on the cluster systems. All described methods have been implemented in “Omega” parallel database management system prototype. Our experiments show that “Omega” system demonstrates nearly linear scalability even in presence of data skew.
This work was supported by the Russian foundation for basic research (project 09-07-00241-a) and Grant of the President of the Russian Federation for young scientists supporting (project MK-3535.2009.9).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Dean, J., Ghemawat, S.: MapReduce: simplified data processing on large clusters. Communications of ACM 51(1), 107–113 (2008)
Chaudhuri, S., Narasayya, V.: Self-tuning database systems: a decade of progress. In: Proceedings of the 33rd International Conference on Very Large Data Bases, Vienna, Austria, September 23-27, pp. 3–14 (2007)
Xu, Y., Kostamaa, P., Zhou, X., Chen, L.: Handling data skew in parallel joins in shared-nothing systems. In: Proceedings of ACM SIGMOD International Conference on Management of Data Vancouver, Canada, June 9-12, pp. 1043–1052. ACM, New York (2008)
Han, W., Ng, J., Markl, V., Kache, H., Kandil, M.: Progressive optimization in a shared-nothing parallel database. In: Proceedings of the 2007 ACM SIGMOD International Conference on Management of Data, Beijing, China, June 11-14, pp. 809–820 (2007)
Zhou, J., Cieslewicz, J., Ross, K.A., Shah, M.: Improving database performance on simultaneous multithreading processors. In: Proceedings of the 31st International Conference on Very Large Data Bases, Trondheim, Norway, August 30-September 2, pp. 49–60 (2005)
Lakshmi, M.S., Yu, P.S.: Effect of Skew on Join Performance in Parallel Architectures. In: Proceedings of the First International Symposium on Databases in Parallel and Distributed Systems, Austin, Texas, United States, pp. 107–120. IEEE Computer Society Press, Los Alamitos (1988)
Ferhatosmanoglu, H., Tosun, A.S., Canahuate, G., Ramachandran, A.: Efficient parallel processing of range queries through replicated declustering. Distrib. Parallel Databases 20(2), 117–147 (2006)
Kostenetskii, P.S., Lepikhov, A.V., Sokolinskii, L.B.: Technologies of parallel database systems for hierarchical multiprocessor environments. Automation and Remote Control 5, 112–125 (2007)
Sokolinsky, L.B.: Organization of Parallel Query Processing in Multiprocessor Database Machines with Hierarchical Architecture. Programming and Computer Software 27(6), 297–308 (2001)
Lepikhov, A.V., Sokolinsky, L.B.: Data Placement Strategy in Hierarchical Symmetrical Multiprocessor Systems. In: Proceedings of Spring Young Researchers Colloquium in Databases and Information Systems (SYRCoDIS 2006), June 1-2, pp. 31–36. Moscow State University, Moscow (2006)
Parallel DBMS “Omega” official page, http://omega.susu.ru
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lepikhov, A.V., Sokolinsky, L.B. (2010). Query Evaluation Techniques for Cluster Database Systems. In: Catania, B., Ivanović, M., Thalheim, B. (eds) Advances in Databases and Information Systems. ADBIS 2010. Lecture Notes in Computer Science, vol 6295. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15576-5_27
Download citation
DOI: https://doi.org/10.1007/978-3-642-15576-5_27
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15575-8
Online ISBN: 978-3-642-15576-5
eBook Packages: Computer ScienceComputer Science (R0)