Ultra-Large Alignments Using Ensembles of Hidden Markov Models

Nguyen, Nam-phuong; Mirarab, Siavash; Kumar, Keerthana; Warnow, Tandy

doi:10.1007/978-3-319-16706-0_26

Nam-phuong Nguyen⁵,
Siavash Mirarab⁶,
Keerthana Kumar⁶ &
…
Tandy Warnow^5,7

Part of the book series: Lecture Notes in Computer Science ((LNBI,volume 9029))

Included in the following conference series:

International Conference on Research in Computational Molecular Biology

2839 Accesses

Abstract

Many biological questions rely upon multiple sequence alignments (MSAs) and phylogenetic trees of large datasets. However, accurate MSA estimation is difficult for large datasets, especially when the dataset evolved under high rates of evolution or contains fragmentary sequences.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Edgar, R.C.: MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Research 32(5), 1792–1797 (2004)
Article Google Scholar
Finn, R.D., Clements, J., Eddy, S.R.: HMMER web server: interactive sequence similarity searching. Nucleic Acids Research 39, W29–W37 (2011)
Article Google Scholar
Katoh, K., Toh, H.: PartTree: an algorithm to build an approximate tree from a large number of unaligned sequences. Bioinformatics 23, 372–374 (2007)
Article Google Scholar
Mirarab, S., Nguyen, N., Wang, L.-S., Guo, S., Kim, J., Warnow, T.: PASTA: ultra-large multiple sequence alignment of nucleotide and amino acid sequences. J. Computational Biology (2015)
Google Scholar
Mirarab, S., Nguyen, N., Warnow, T.: SEPP: SATé-Enabled Phylogenetic Placement. In: Proceedings of the Pacific Symposium on Biocomputing, pp. 247–58, January 2012
Google Scholar
Mirarab, S., Nguyen, N., Warnow, T.: PASTA: ultra-large multiple sequence alignment. In: Sharan, R. (ed.) RECOMB 2014. LNCS, vol. 8394, pp. 177–191. Springer, Heidelberg (2014)
Chapter Google Scholar
Price, M.N., Dehal, P.S., Arkin, A.P.: FastTree 2 – approximately maximum-likelihood trees for large alignments. PloS One 5(3), e9490 (2010)
Article Google Scholar
Sievers, F., Wilm, A., Dineen, D., Gibson, T.J., Karplus, K., Li, W., Lopez, R., McWilliam, H., Remmert, M., Sjöding, J., Thompson, J.D., Higgins, D.G.: Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega. Molecular Systems Biology, 7(539), October 2011
Google Scholar
Stamatakis, A.: RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics (Oxford, England), pp. 1–2, February 2014
Google Scholar

Download references

Author information

Authors and Affiliations

Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Champaign, USA
Nam-phuong Nguyen & Tandy Warnow
Department of Computer Science, University of Texas at Austin, Austin, USA
Siavash Mirarab & Keerthana Kumar
Departments of Bioengineering and Computer Science, University of Illinois at Urbana-Champaign, Champaign, USA
Tandy Warnow

Authors

Nam-phuong Nguyen
View author publications
You can also search for this author in PubMed Google Scholar
Siavash Mirarab
View author publications
You can also search for this author in PubMed Google Scholar
Keerthana Kumar
View author publications
You can also search for this author in PubMed Google Scholar
Tandy Warnow
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Tandy Warnow .

Editor information

Editors and Affiliations

National Center of Biotechnology Information, Bethesda, Maryland, USA
Teresa M. Przytycka

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Nguyen, Np., Mirarab, S., Kumar, K., Warnow, T. (2015). Ultra-Large Alignments Using Ensembles of Hidden Markov Models. In: Przytycka, T. (eds) Research in Computational Molecular Biology. RECOMB 2015. Lecture Notes in Computer Science(), vol 9029. Springer, Cham. https://doi.org/10.1007/978-3-319-16706-0_26

Download citation

DOI: https://doi.org/10.1007/978-3-319-16706-0_26
Published: 26 March 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-16705-3
Online ISBN: 978-3-319-16706-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics