Design of distributed database systems: an iterative genetic algorithm

Song, Sukkyu

doi:10.1007/s10844-013-0269-0

Design of distributed database systems: an iterative genetic algorithm

Published: 09 August 2013

Volume 45, pages 29–59, (2015)
Cite this article

Journal of Intelligent Information Systems Aims and scope Submit manuscript

Sukkyu Song¹

572 Accesses
5 Citations
3 Altmetric
Explore all metrics

Abstract

The two important aspects for design of distributed database systems are operation allocation and data allocation. Operation allocation refers to query execution plan indicating which operations (subqueries) should be allocated to which sites in a computer network, so that query processing costs are minimized. Data allocation is to allocate relations to sites so that the performance of distributed database are improved. In this research, we developed a solution technique for operation allocation and data allocation problem, using three objective functions: total time minimization or response time minimization, and the combination of total time and response time minimization. We formulated these allocation problems and provided analytical cost models for each objective function. Since the problem is NP-hard, we proposed a heuristic solution based on genetic algorithm (GA). Comparison of results with the exhaustive enumeration indicated that GA produced optimal solutions in all cases in much less time.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Vector Evaluated Genetic Algorithm-Based Distributed Query Plan Generation in Distributed Database

A new approach based on greedy minimizing algorithm for solving data allocation problem

Article 23 May 2023

Data allocation in distributed database systems: a novel hybrid method based on differential evolution and variable neighborhood search

Article 29 November 2019

References

Arcangeli, J., Hameurlain, A., Migeon, E., Morvan, F. (2004). Mobile agent based self-adaptive join for wide-area distributed query processing. Journal of Database Management, 15(4), 25–44.
Article MATH Google Scholar
Baiao, F., Mattoso, M., Zaverucha, G. (2004). A distribution design methodology for object DBMS. Journal of Distributed and Parallel Databases, 16(1), 45–90.
Article Google Scholar
Bellatreche, L., Cuzzocrea, A., Benkrid, S. (2010). F&A: a methodology for effectively and efficiently designing parallel relational data warehouses on hetrogenous database clusters. In Data warehouse and knowledge discovery (DaWak) (pp. 89–104)
Bellatreche, L., Cuzzocrea, A., Benkrid, S. (2012). Effectively and efficiently designing and querying parallel relational data warehouses on hetrogenous database clusters: the F&A approach. Journal of Database Management, 23(4), 17–51.
Article Google Scholar
Bergsten, B., Couprie, M., Valduriez, P. (1993). Overview of parallel architectures for database. The Computer Journal, 36, 734–740.
Article Google Scholar
Cheng, C., Lee, W., Wong, K. (2002). A genetic algorithm-based clustering approach for database partitioning. IEEE Transactions on Systems, Man, and Cybernetics, 32(3), 215–230.
Article Google Scholar
Cuadrado, J. (1995). Optimize database queries. In Byte (pp. 57–63).
Davis, L. (1991). Handbook of genetic algorithms. New York, NY: Van Nostrand Reinhold.
Google Scholar
Du, J., Alhajj, R., Barker, K. (2006). Genetic algorithms based approach to database vertical partitioning. Journal of Intelligent Information Systems, 26(2), 167–183.
Article MATH Google Scholar
Goldberg, D.E. (1989). Genetic algorithms in search, optimization, and machine learning. Reading, MA: Addison-Wesley.
MATH Google Scholar
Gorla, N. (2001). An object-oriented database design for improved performance. Data and Knowledge Engineering, 37, 117–138.
Article MATH Google Scholar
Gu, X., Lin, W., Bharadwaj, V. (2006). Practically realizable efficient data allocation and replication strategies for distributed databases with buffer constraints. IEEE Transactions on Parallel & Distributed Systems, 17(9), 1001–1013.
Article Google Scholar
Hababeh, I., Ramachandran, M., Bowring, N. (2007). A high-performance computing method for data allocation in distributed databse systems. Journal of Supercomputing, 39(1), 3–18.
Article Google Scholar
Johansson, J., March, S., Naumann, J. (2003). Modeling network latency and parallel processing in distributed database design. Decision Sciences, 34(4), 677–706.
Article Google Scholar
Keshavamurthy, B., Bettahally, K., Asad, D. (2013). Privacy preserving assocation rule mining over distributed databases using genetic algorithm. Neural Computing & Applications, 22, 351–364.
Article Google Scholar
Kossmann, D. (2000). The state of the art in distributed query processing. ACM Computing Surveys, 32(4), 422–469.
Article Google Scholar
Li, B., & Jiang, W. (2000). A novel stochastic optimization algorithm. IEEE Transactions on Systems, Man, and Cybernetics: Part B, 30(1), 191–198.
Google Scholar
March, S.T., & Rho, S. (1995). Allocating data and operations to nodes in distributed database design. IEEE Transactions on Knowledge and Data Engineering, 7(2), 305–317.
Article Google Scholar
Martin, T., Lam K., Russel, J. (1990). An evaluation of site selection algorithms for distributed query processing. The Computer Journal, 33(1), 61–70.
Article Google Scholar
Menon, S. (2005). Allocating fragments in distributed databases. IEEE Transactions on Parallel & Distributed Systems, 16(7), 577–585.
Article Google Scholar
Ozsu, M., & Valduriez, P. (1991). Principles of distributed database systems, englewood cliffs. Englewood Cliffs, NJ: Prentice-Hall.
Google Scholar
Seshadri, S., & Cooper, B. (2007). Routing queries through a peer-to-peer infobeacons network using information retrieval techniques. IEEE Transactions on Parallel & Distributed Systems, 18(12), 1754–1765.
Article Google Scholar
Sevince, E., & Cosar, A. (2011). An evolutionary genetic algorithm for optimization of distributed database queries. The Computer Journal, 54(5), 717–725.
Article Google Scholar
Song, S., & Gorla, N. (2000). A genetic algorithm for vertical fragmentation and access path selection. The Computer Journal, 43(1), 81–93.
Article MATH Google Scholar

Download references

Author information

Authors and Affiliations

Youngsan University, Busan, Republic of Korea
Sukkyu Song

Authors

Sukkyu Song
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sukkyu Song.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Song, S. Design of distributed database systems: an iterative genetic algorithm. J Intell Inf Syst 45, 29–59 (2015). https://doi.org/10.1007/s10844-013-0269-0

Download citation

Received: 28 September 2012
Revised: 02 July 2013
Accepted: 15 July 2013
Published: 09 August 2013
Issue Date: August 2015
DOI: https://doi.org/10.1007/s10844-013-0269-0

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Design of distributed database systems: an iterative genetic algorithm

Abstract

Access this article

Similar content being viewed by others

Vector Evaluated Genetic Algorithm-Based Distributed Query Plan Generation in Distributed Database

A new approach based on greedy minimizing algorithm for solving data allocation problem

Data allocation in distributed database systems: a novel hybrid method based on differential evolution and variable neighborhood search

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Design of distributed database systems: an iterative genetic algorithm

Abstract

Access this article

Similar content being viewed by others

Vector Evaluated Genetic Algorithm-Based Distributed Query Plan Generation in Distributed Database

A new approach based on greedy minimizing algorithm for solving data allocation problem

Data allocation in distributed database systems: a novel hybrid method based on differential evolution and variable neighborhood search

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation