Abstract
The alignment and comparison of DNA, RNA and Protein sequences is one of the most common and important tasks in Bioinformatics. However, due to the size and complexity of the search space involved, the search for the best possible alignment for a set of sequences is not trivial. Genetic Algorithms have a predisposition for optimizing general combinatorial problems and therefore are serious candidates for solving multiple sequence alignment tasks. Local search optimization can be used to refine the solutions explored by Genetic Algorithms. We have designed a Genetic Algorithm which incorporates local search for this purpose: AlineaGA. We have tested AlineaGA with representative sequence sets of the globin family. We also compare the achieved results with the results provided by T-COFFEE.
Similar content being viewed by others
Abbreviations
- GA:
-
Genetic Algorithm
References
Pal SK, Bandyopadhyay S, Ray SS (2006) Evolutionary computation in bioinformatics: a review. IEEE Trans Syst Man Cybern C Appl Rev 36:601–615
Horng J, Wu L, Lin C, Yang B (2005) A genetic algorithm for multiple sequence alignment. Soft Comput 9:407–420
Notredame C, Higgins DG, Heringa J (2000) T-Coffee: a novel method for fast and accurate multiple sequence alignment. J Mol Biol 302:205–217
Lassmann T, Sonnhammer ELL (2002) Quality assessment of multiple alignment programs. FEBS Lett 529:126–130
Lecompte O, Thompsony JD, Plewniak F, Thierry J-C, Poch O (2001) Multiple alignment of complete sequences (MACS) in the post-genomic era. Gene 270(1–2):17–30
Anbarasu LA, Narayanasamy P, Sundararajan V (2000) Multiple molecular sequence alignment by island parallel genetic algorithm. Curr Sci 78:858–863
Holland JH (1975) Adaptation in natural and artificial systems. Univ Mich Press, Ann Arbor
Michalewicz Z (1996) Genetic algorithms + data structures = evolution programs, 3rd, Revised and Extended Edition. Springer, Berlin
De Jong K (1988) Learning with genetic algorithms: an overview. Mach Learn 3:121–138
Hart WE, Krasnogor N, Smith JE (2004) Memetic evolutionary algorithms. Stud Fuzz Soft Comput 166:3–30
Surry PD, Radcliffe NJ (1996) Inoculation to initialise evolutionary search. In: Lecture notes in computer science. Springer, Berlin, pp 269–285
Smith JE, Bartley M, Fogarty TC (1997) Microprocessor design verification by two-phase evolution of variable length tests. In: Proceedings of the 1997 IEEE international conference on evolutionary computation. IEEE Press, Indianapolis, pp 453–458
Notredame C, Higgins DG (1996) SAGA: sequence alignment by genetic algorithm. Nucl Acids Res 24:1515–1524
Silva FJM, Sánchez Pérez JM, Gómez Pulido JA, Vega Rodríguez MA (2007) Alineamiento múltiple de secuencias utilizando algoritmos genéticos: revisión. In: Segundo congreso Español de informática, CEDI, Zaragoza, Spain, pp 41–48
Isokawa M, Wayama M, Toshio S (1996) Multiple sequence alignment using a genetic algorithm. Genome Inf 7:176–177
Wayama M, Takahashi K, Shimizu T (1995) An approach to amino acid sequence alignment using a genetic algorithm. Genome Inf 6:122–123
Zhang C, Wong AKC (1997) A genetic algorithm for multiple molecular sequence alignment. Comput Appl Biosci 13:565–581
Wang C, Lefkowitz EJ (2005) Genomic multiple sequence alignments: refinement using a genetic algorithm. BMC Bioinf 6:200
Rognes T (2001) ParAlign: a parallel sequence alignment algorithm for rapid and sensitive database searches. Nucl Acids Res 29:1647–1652
Silva FJM, Sánchez Pérez JM, Gómez Pulido JA, Vega Rodríguez MA (2008) AlineaGA: a genetic algorithm for multiple sequence alignment. In: Nguyen NT, Katarzyniak R (eds) New challenges in applied intelligence technologies, vol 134. Springer, Berlin, pp 309–318
Goldberg DE (1989) Genetic algorithms in search, optimization, and machine learning. Addison-Wesley, Reading
Notredame C, O’Brien EA, Higgins DG (1997) RAGA: RNA sequence alignment by genetic algorithm. Nucl Acids Res 25:4570–4580
Dayhoff MO, Schwartz RM, Orcutt BC (1978) A model of evolutionary change in proteins. Atlas of protein sequence and structure, vol 5. National Biomedical Research Foundation, pp 345–352
Nicholas HB Jr, Ropelewski AJ, Deerfield DW II (2002) Strategies for multiple sequence alignment. BioTech 32:572–591
Huayang X, Mengjie Z, Peter A (2007) Another investigation on tournament selection: modelling and visualisation. In: Proceedings of the 9th annual conference on genetic and evolutionary computation. ACM, London, pp 1468–1475
Chellapilla K, Fogel GB (1999) Multiple sequence alignment using evolutionary programming. In: Angeline PJ, Michalewicz Z, Schoenauer M, Yao X, Zalzala A (eds) Evolutionary computation, 1999. CEC 99. Proceedings of the 1999 congress on. IEEE Press, Washington, pp 445–452
Horng J-T, Lin C-M, Liu B-J, Kao C-Y (2000) Using genetic algorithms to solve multiple sequence alignments. In: Whitley LD, Goldberg DE, Cantu-Paz E, Spector L, Parmee IC, Beyer H-G (eds) Proceedings of the genetic and evolutionary computation conference (GECCO-2000). Morgan Kaufmann, Las Vegas, pp 883–890
Consortium U (2008) The universal protein resource (UniProt). Nucl Acids Res 36:190–195
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
da Silva, F.J.M., Sánchez Pérez, J.M., Gómez Pulido, J.A. et al. AlineaGA—a genetic algorithm with local search optimization for multiple sequence alignment. Appl Intell 32, 164–172 (2010). https://doi.org/10.1007/s10489-009-0189-4
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-009-0189-4