Abstract.
Multiple sequence alignment is an important tool in molecular sequence analysis. This paper presents genetic algorithms to solve multiple sequence alignments. Several data sets are tested and the experimental results are compared with other methods. We find our approach could obtain good performance in the data sets with high similarity and long sequences.The software can be found in http://rsdb.csie.ncu.edu.tw/tools/msa.htm.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Aart E, van Laarhoven P (1987) Simulated Annealing: a Review of Theory and Applications, Kluwer Amsterdam
Chan Sc, Wong AKC, Chiu DKY (1992). A survey of multiple sequence comparison methods. Bull Math Biol 54: 563–598
Chellapilla K, Fogel GB (1999) Multiple sequence alignment using evolutionary programming. Congress on Evolutionary Computation, pp. 445–452
Goldberg DE (1989). Genetic Algorithms in Search, Optimization, and Machine Learning . Addison-Wesley, New York
Gonzalez RR, Izquierdo CM, Seijas J (1998). Multiple protein sequence comparison by genetic algorithms. In: Proceedings of SPIE: Applications and Science of Computational Intelligence (SPIE-98), Rogers SK, Fogel DB, Bezdek JC, Bosacchi B (eds). SPIE-The International Society for Optical Engineering, Bellingham, Washington, Addison-Wesley, New York, pp. 99–102
Isokawa M, Wayama M, Shimizu T (1996) Multiple sequence alignment using a genetic algorithm. Genome Informatics, 7 176–177
Jong KD (1988). Learning with genetic algorithms: An overview. In: Machine Learning 3, Kluwer, Hingham, MA, pp. 121–138
Jue RA, Woodbury NW, Doolittle RF (1980). Sequence homologies among E. coli ribosomal proteins: evidence for evolutionary related groupings and internal duplications. J Mol Evol, 15: 129–148
Lassmann T, Sonnhammer ELL, (2002) Quality assessment of multiple alignment programs. FEBS Letters, 529: 126–130
Lawrence C, Altschul S, Boguski M, Liu J, Neuwald A, Wootton J (1993) Detecting subtle sequence signals: a gibbs sampling strategy for multiple alignment. Science, pp. 208–214
Murata M, Richardson JS, Sussman JL (1985) Simultaneous comparison of three protein sequences. Proc Natl Acad Sci USA, 82: 3073–3077
Needleman SB, Wunsch CD (1970) A general method applicable to the search for similarities in the amino acid sequences of two proteins. J Mol Biol, 42: 245–161
Notredame C (2002) Recent progress in multiple sequence alignment: a survey. Pharmacogenomics, 3: 131–144
Notredame C, Higgins DG (1996) SAGA: sequence alignment by genetic algorithm. Nuc. Acids Res 24(8): 1515–1524
Notredame C, O’Brien EA, Higgins DG (1997) RAGA: RNA sequence alignment by genetic algorithm. Nuc Acids Res 25(22): 4570–4580
Pal SK, Wang PP (1996) Genetic algorithms for pattern recognition. CRC Press, Boca Raton.
Setubal J, Meidanis J (1997) Sequence comparison and database search. Introduction To Computational Molecular Biology, PWS, pp. 47–103.
Thompson J, Higgins D, Gibson T (1994) CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position specific gap penalties and weight matrix choice. Nuc Acids Res 22(22): 4673–4680
Thompson JD, Plewniak F, Poch O (1999) BAliBASE: a benchmark alignment database for the evaluation of multiple sequence alignment programs. Bioinformatics 15: 87–88
Wayama M, Takahashi K, Shimizu T (1995) An approach to amino acid sequence alignment using a genetic algorithm. Genome Informatics, 6: 122–123
Whitley D (1994) A genetic algorithm tutorial. Statistics and Computing, 4: 65–85
Zhang C, Wong AKC (1997) A genetic algorithm for multiple molecular sequence alignment. Comput Appl Biosci 13(6): 565–581
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Horng, JT., Wu, LC., Lin, CM. et al. A genetic algorithm for multiple sequence alignment. Soft Comput 9, 407–420 (2005). https://doi.org/10.1007/s00500-004-0356-9
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00500-004-0356-9