Skip to main content

Statistical Multiple Alignment

  • Reference work entry
  • First Online:
Encyclopedia of Algorithms
  • 29 Accesses

Years and Authors of Summarized Original Work

  • 2003; Hein, Jensen, Pedersen

Problem Definition

The three main types of mutations modifying biological sequences are insertions, deletions, and substitutions. The simplest model involving these three types of mutations is the so-called Thorne-Kishino-Felsenstein model [16]. In this model, the characters of a sequence evolve independently. Each character in the sequence can be substituted with another character according to a prescribed reversible time-continuous Markov model on the possible characters. Insertion-deletions are modeled as a birth-death process. Insertions can happen at the beginning of the sequence, at the end of the sequence, and between any two characters. It is possible to insert a character into the empty sequence. The time span between two insertions is exponentially distributed with parameter λ, and this parameter does not depend on the context of the position. The newborn character is drawn from the equilibrium...

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 1,599.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 1,999.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Recommended Reading

  1. Arunapuram P, Edvardsson I, Golden M, Anderson JW, Novák Á, Sükösd Z, Hein J (2013) StatAlign 2.0: combining statistical alignment with RNA secondary structure prediction. Bioinformatics 29(5):654–655

    Article  Google Scholar 

  2. Csűrös M, Miklós I (2006) A probabilistic model for gene content evolution with duplication, loss, and horizontal transfer. In: Proceedings of RECOMB2006. Lecture notes in bioinformatics, Springer Verlag, vol 3909, pp 206–220

    Google Scholar 

  3. Durbin R, Eddy S, Krogh A, Mitchison G (1998). Biological sequence analysis. Cambridge University Press, Cambridge

    Book  MATH  Google Scholar 

  4. Felsenstein J (1981) Evolutionary trees from DNA sequences: a maximum likelihood approach. J Mol Evol 17:368–376

    Article  Google Scholar 

  5. Hein JJ, Jensen JL, Pedersen CNS (2003) Recursions for statistical multiple alignment. PNAS 100:14960–14965

    Article  Google Scholar 

  6. Holmes I (2003) Using guide trees to construct multiple-sequence evolutionary hmms. Bioinformatics 19:i147–i157

    Article  Google Scholar 

  7. Holmes I, Bruno WJ (2001) Evolutionary HMMs: a Bayesian approach to multiple alignment. Bioinformatics 17(9):803–820

    Article  Google Scholar 

  8. Jukes TH, Cantor CR (1969) Evolution of protein molecules. In: Munro HN (ed) Mammalian protein metabolism. Academic, New York, pp 21–132

    Chapter  Google Scholar 

  9. Lunter GA, Miklós I, Song YS, Hein J (2003) An efficient algorithm for statistical multiple alignment on arbitrary phylogenetic trees. J Comput Biol 10(6):869–889

    Article  Google Scholar 

  10. Lunter GA, Miklós I, Drummond AJ, Jensen JL, Hein JJ (2003) Bayesian phylogenetic inference under a statistical indel model. In: Proceedings of WABI2003. Lecture notes in bioinformatics, Springer Verlag, vol 2812, pp 228–244

    Google Scholar 

  11. Lunter GA, Miklós I, Drummond AJ, Jensen JL, Hein JJ (2005) Bayesian coestimation of phylogeny and sequence alignment. BMC Bioinformatics 6:83

    Article  Google Scholar 

  12. Metzler D, Fleißner R, Wakolbringer A, von Haeseler A (2001) Assessing variability by joint sampling of alignments and mutation rates. J Mol Evol 53:660–669

    Article  Google Scholar 

  13. Novák A, Miklós I, Lyngsø R, Hein J (2008) StatAlign: an extendable software package for joint bayesian estimation of alignments and evolutionary trees. Bioinformatics 24(20):2403–2404

    Article  Google Scholar 

  14. Pereira F, Riley M (1997) Speech recognition by composition of weighted finite automata. In: Finite-state language processing. MIT, Cambridge, pp 149–173

    Google Scholar 

  15. Satija R, Novák A, Miklós I, Lyngsø R, Hein J (2009) BigFoot: Bayesian alignment and phylogenetic footprinting with MCMC. BMC Evol Biol 9:217

    Article  Google Scholar 

  16. Thorne JL, Kishino H, Felsenstein J (1991) An evolutionary model for maximum likelihood alignment of DNA sequences. J Mol Evol 33:114–124

    Article  Google Scholar 

  17. Thorne JL, Kishino H, Felsenstein J (1992) Inching toward reality: an improved likelihood model of sequence evolution. J Mol Evol 34:3–16

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer Science+Business Media New York

About this entry

Cite this entry

Miklós, I. (2016). Statistical Multiple Alignment. In: Kao, MY. (eds) Encyclopedia of Algorithms. Springer, New York, NY. https://doi.org/10.1007/978-1-4939-2864-4_400

Download citation

Publish with us

Policies and ethics