Skip to main content

Advertisement

Log in

A genetic algorithm for multiple sequence alignment

  • Published:
Soft Computing Aims and scope Submit manuscript

Abstract.

Multiple sequence alignment is an important tool in molecular sequence analysis. This paper presents genetic algorithms to solve multiple sequence alignments. Several data sets are tested and the experimental results are compared with other methods. We find our approach could obtain good performance in the data sets with high similarity and long sequences.The software can be found in http://rsdb.csie.ncu.edu.tw/tools/msa.htm.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

Explore related subjects

Discover the latest articles, news and stories from top researchers in related subjects.

References

  • Aart E, van Laarhoven P (1987) Simulated Annealing: a Review of Theory and Applications, Kluwer Amsterdam

  • Chan Sc, Wong AKC, Chiu DKY (1992). A survey of multiple sequence comparison methods. Bull Math Biol 54: 563–598

  • Chellapilla K, Fogel GB (1999) Multiple sequence alignment using evolutionary programming. Congress on Evolutionary Computation, pp. 445–452

  • Goldberg DE (1989). Genetic Algorithms in Search, Optimization, and Machine Learning . Addison-Wesley, New York

  • Gonzalez RR, Izquierdo CM, Seijas J (1998). Multiple protein sequence comparison by genetic algorithms. In: Proceedings of SPIE: Applications and Science of Computational Intelligence (SPIE-98), Rogers SK, Fogel DB, Bezdek JC, Bosacchi B (eds). SPIE-The International Society for Optical Engineering, Bellingham, Washington, Addison-Wesley, New York, pp. 99–102

  • Isokawa M, Wayama M, Shimizu T (1996) Multiple sequence alignment using a genetic algorithm. Genome Informatics, 7 176–177

    Google Scholar 

  • Jong KD (1988). Learning with genetic algorithms: An overview. In: Machine Learning 3, Kluwer, Hingham, MA, pp. 121–138

  • Jue RA, Woodbury NW, Doolittle RF (1980). Sequence homologies among E. coli ribosomal proteins: evidence for evolutionary related groupings and internal duplications. J Mol Evol, 15: 129–148

    Google Scholar 

  • Lassmann T, Sonnhammer ELL, (2002) Quality assessment of multiple alignment programs. FEBS Letters, 529: 126–130

    Google Scholar 

  • Lawrence C, Altschul S, Boguski M, Liu J, Neuwald A, Wootton J (1993) Detecting subtle sequence signals: a gibbs sampling strategy for multiple alignment. Science, pp. 208–214

  • Murata M, Richardson JS, Sussman JL (1985) Simultaneous comparison of three protein sequences. Proc Natl Acad Sci USA, 82: 3073–3077

    Google Scholar 

  • Needleman SB, Wunsch CD (1970) A general method applicable to the search for similarities in the amino acid sequences of two proteins. J Mol Biol, 42: 245–161

    Google Scholar 

  • Notredame C (2002) Recent progress in multiple sequence alignment: a survey. Pharmacogenomics, 3: 131–144

    Google Scholar 

  • Notredame C, Higgins DG (1996) SAGA: sequence alignment by genetic algorithm. Nuc. Acids Res 24(8): 1515–1524

  • Notredame C, O’Brien EA, Higgins DG (1997) RAGA: RNA sequence alignment by genetic algorithm. Nuc Acids Res 25(22): 4570–4580

    Google Scholar 

  • Pal SK, Wang PP (1996) Genetic algorithms for pattern recognition. CRC Press, Boca Raton.

  • Setubal J, Meidanis J (1997) Sequence comparison and database search. Introduction To Computational Molecular Biology, PWS, pp. 47–103.

  • Thompson J, Higgins D, Gibson T (1994) CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position specific gap penalties and weight matrix choice. Nuc Acids Res 22(22): 4673–4680

    Google Scholar 

  • Thompson JD, Plewniak F, Poch O (1999) BAliBASE: a benchmark alignment database for the evaluation of multiple sequence alignment programs. Bioinformatics 15: 87–88

    Google Scholar 

  • Wayama M, Takahashi K, Shimizu T (1995) An approach to amino acid sequence alignment using a genetic algorithm. Genome Informatics, 6: 122–123

    Google Scholar 

  • Whitley D (1994) A genetic algorithm tutorial. Statistics and Computing, 4: 65–85

  • Zhang C, Wong AKC (1997) A genetic algorithm for multiple molecular sequence alignment. Comput Appl Biosci 13(6): 565–581

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jorng-Tzong Horng.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Horng, JT., Wu, LC., Lin, CM. et al. A genetic algorithm for multiple sequence alignment. Soft Comput 9, 407–420 (2005). https://doi.org/10.1007/s00500-004-0356-9

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00500-004-0356-9

Keywords