Abstract
In this paper, a hybrid Genetic Algorithm for solving multiple sequence alignment problems is proposed. Two new mechanisms have been introduced, i.e., one to generate the initial population and the second one is used during mutation operation. Here, the initial populations have been generated by Needleman Wunsch pair-wise alignment method. In the second step, the UPGMA method is used to generate the Guide tree with the help of two different matrix such as dynamic distance and edit distance matrix. The performance of the proposed method has been tested on publicly available benchmark datasets (i.e. Bali base) with some of the existing methods such as PRRP, CLUSTALX, SB−PIMA, MULTALIGN, SAGA, RBT-GA. We find that proposed method is better in most of cases and where it is not better at least close to best solution.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsREFERENCES
Das, S., Abraham, A., Konar, A.: Swarm intelligence algorithms in bioinformatics. Stud. Comput. Intell. 94, 113–147 (2008)
Wang, L., Jiang, T.: On the complexity of multiple sequence alignment. J. Comput. Biol. 1, 337–348 (1994)
Feng, D.F., Doolittle, R.F.: Progressive sequence alignment as a prerequisite to correct phylogenetic trees. J. Mol. Evolution 25, 351–360 (1987)
Thompson, J.D., Higgins, D.G., Gibson, T.J.: CLUSTALW: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucl. Acids Res. 22, 4673–4680 (1994)
Rabiner, L.R.: A tutorial on hidden Markov models and selected applications in speech recognition. In: Proceedings of the IEEE. vol. 77, pp. 257–285 (1989)
Baldi, P., Chauvin, Y., Hunkapiller, T., McClure, M.A.: Hidden Markov Models of biological primary sequence information. Proc. Natl. Acad. Sci. U.S.A. 91, 1059–1063 (1994)
Krogh, A., Brown, M., Mian, I.S., Sjolander, K., Haussler, D.: Hidden Markov models in computational biology: applications to protein modeling. J. Mol. Biol. 235, 1501–1531 (1994)
Eddy, S.R.: Profile hidden Markov models. Bioinformatics 14, 755–763 (1998)
Kim, J., Pramanik, S., Chung, M.J.: Multiple sequence alignment using simulated annealing. Bioinformatics 10, 419–426 (1994)
Lukashin, A.V., Engelbrecht, J., Brunak, S.: Multiple alignment using simulated annealing: branch point definition in human mRNA splicing. Nucl. Acids Res. 20, 2511–2516 (1992)
Chellapilla, K., Fogel, G.B.: Multiple sequence alignment using evolutionary programming. In: Proceedings of the First Congress on Evolution Composition, pp. 445–452 (1999)
Notredame, C., Higgins, D.G.: SAGA: sequence alignment by genetic algorithm. Nucl. Acids Res. 24, 1515–1524 (1996)
Thomsen, R.: Evolving the topology of hidden markov models using evolutionary algorithms. In: Guervós, J.J.M., Adamidis, P.A., Beyer, H.G., Fernández-Villacañas, J.L., Schwefel, H.P. (eds.) PPSN 2002. LNCS, vol. 2439, pp. 861–870. Springer, Heidelberg (2002)
Taheri, J., Zomaya, A.Y.: RBT-GA: a novel metaheuristic for solving the multiple sequence alignment problem. BMC Genom. 10, 1–11 (2009)
Dayhoff, M.O., Schwartz, R.M., Orcutt, B.C.: A model of evolutionary change in proteins. Atlas Protein Seq. Struct. 5, 345–351 (1978)
Thompson, J.D., Gibson, T.J., Plewniak, F., Jeanmougin, F., Higgins, D.G.: The CLUSTAL − X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucl. Acids Res. 25, 4876–4882 (1997)
Naznin, F., Sarker, R., Essam, D.: Progressive alignment method using genetic algorithm for multiple sequence alignment. IEEE Trans. Evol. Comput. 16, 615–631 (2012)
Yadav, R.K., Banka, H.: Genetic algorithm with improved mutation operator for multiple sequence alignment. In: Mandal, J.K., Satapathy, S.C., Sanyal, M.K., Sarkar, P.P., Mukhopadhyay, A. (eds.) Information Systems Design and Intelligent Applications, pp. 515–524. Springer, New Delhi (2015)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing AG
About this paper
Cite this paper
Yadav, R.K., Banka, H. (2016). A Hybrid Genetic Algorithm Using Dynamic Distance in Mutation Operator for Solving MSA Problem. In: Panigrahi, B., Suganthan, P., Das, S., Satapathy, S. (eds) Swarm, Evolutionary, and Memetic Computing. SEMCCO 2015. Lecture Notes in Computer Science(), vol 9873. Springer, Cham. https://doi.org/10.1007/978-3-319-48959-9_24
Download citation
DOI: https://doi.org/10.1007/978-3-319-48959-9_24
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-48958-2
Online ISBN: 978-3-319-48959-9
eBook Packages: Computer ScienceComputer Science (R0)