Abstract
The phylogeny reconstruction problem is a fundamental problem in computational molecular biology and biochemical physics. Since the number of data sets has grown substantially in recent years, the accuracy and speed of constructing phylogenies become increasingly critical. Numerous studies have demonstrated that the maximum likelihood (ML) method is the most effective method for reconstructing a phylogenetic tree from sequence data. Conversely, tree bisection and reconnection (TBR) is a tree topology rearrangement method that can generate an extensive tree space. In this paper, we propose an enhanced method for reconstructing phylogenetic trees in which the TBR operation is modified and combined with the minimum evolution principle to filter out some unnecessary reconnected positions to reduce the search time. The experiment results demonstrate that the proposed method can assist other algorithms in constructing more accurate trees within a reasonable time.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
In a graph G, the subdivision of an edge (x, y) by a node z involves replacing (x, y) with a path 〈x, z, y〉 through a new node z.
- 2.
NNI is a local tree rearrangement method that generates two alternative trees by swapping a subtree on one side of the branch with a subtree on the other side.
References
Baba, M.L., Darga, L.L., Goodman, M., Czelusniak, J.: Evolution of cytochrome C investigated by the maximum parsimony method. J. Mol. Evol. 17, 197–213 (1981)
Bordewich, M., Gascuel, O., Huber, K.T., Moulton, V.: Consistency of topological moves based on the balanced minimum evolution principle of phylogenetic inference. IEEE/ACM Trans. Comput. Biol. Bioinf. 6, 110–117 (2009)
Bordewich, M., Semple, C.: On the computational complexity of the rooted subtree prune and regraft distance. Ann. Comb. 8, 409–423 (2004)
Brent, R.: Algorithms for Minimization Without Derivatives. Prentice Hall Inc., Englewood Cliffs (1973)
Bryant, D.: The splits in the neighborhood of a tree. Ann. Comb. 8, 1–11 (2004)
Culik II, K., Wood, D.: A note on some tree similarity measures. Inf. Process. Lett. 15, 39–42 (1982)
Day, W.H.E.: Properties of the nearest neighbor interchange metric for trees of small size. J. Theor. Biol. 101, 275–288 (1983)
Edwards, A.W.F., Cavalli-Sforza, L.L.: The reconstruction of evolution. Ann. Hum. Genet. 27, 105–106 (1963)
Desper, R., Gascuel, O.: Fast and accurate phylogeny reconstruction algorithms based on the minimum-evolution principle. J. Comput. Biol. 9, 687–705 (2002)
Felsenstein, J.: Evolutionary trees from DNA sequences: a maximum likelihood approach. J. Mol. Evol. 17, 368–376 (1981)
Fauvel, J.: Algorithms in the pre-calculus classroom: who was Newton- Raphson? Math. Sch. 27, 45–47 (1998)
Felsenstein, J.: Inferring Phylogenies. Sinauer, Sunderland (2004)
Gaut, B.S., Lewis, P.O.: Success of maximum likelihood phylogeny inference in the four-taxon case. Mol. Biol. Evol. 12, 152–162 (1995)
Guindon, S., Gascuel, O.: A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. Syst. Biol. 52, 696–704 (2003)
Harper, J.T., Waanders, E., Keeling, P.J.: On the monophyly of chromalveolates using a six-protein phylogeny of eukaryotes. Int. J. Syst. Evol. Microbiol. 55, 487–496 (2005)
Hasegawa, M., Kishino, H., Yano, T.: Dating of the human-ape splitting by a molecular clock of mitochondrial DNA. J. Mol. Evol. 21, 160–174 (1985)
Hordijk, W., Gascuel, O.: Improving the efficiency of SPR moves in phylogenetic tree search methods based on maximum likelihood. Bioinformatics 21, 4338–4347 (2005)
Huelsenbeck, J.P., Crandall, K.A.: Phylogeny estimation and hypothesis testing using maximum likelihood. Ann. Rev. Ecol. Syst. 28, 437–466 (1997)
Huelsenbeck, J.F., Hillis, D.M.: Success of phylogenetic methods in the four-taxon Case. Syst. Biol. 42, 247–264 (1993)
Jukes, T.H., Cantor, C.R.: Evolution of protein molecules. In: Munro, H.N. (ed.) Mammalian Protein Metabolism. Academy Press, New York (1969)
Jones, N.C., Pevzner, P.A.: An Introduction to Bioinformatics Algorithms. The MIT Press, Cambridge (2004)
Ho, C.K., Shuman, S.: Trypanosoma brucei RNA triphosphatase: antiprotozoal drug target and guide to eukaryotic phylogeny. J. Biol. Chem. 276, 46182–46186 (2001)
Jiang, H., Blouin, C.: Insertions and the emergence of novel protein structure: a structure-based phylogenetic study of insertions. BMC Bioinf. 8, 444–458 (2007)
Kimura, M.: A simple method for estimating evolutionary rate of base substitutions through comparative studies of nucleotide sequences. J. Mol. Evol. 16, 111–120 (1980)
Kuhner, M.K., Felsenstein, J.: A simulation comparison of phylogeny algorithms under equal and unequal evolutionary rates. Mol. Biol. Evol. 11, 459–468 (1994)
Keane, T.M., Naughton, T.J., Travers, S.A.A., Mclnerney, J.O., McCormack, G.P.: DPRml: distributed phylogeny reconstruction by maximum likelihood. Bioinformatics 21, 969–974 (2005)
Lamboy, W.F.: The accuracy of the maximum parsimony method for phylogeny reconstruction with morphological characters. Syst. Bot. 19, 189–505 (1994)
Larget, B., Simon, D.L.: Markov chain Monte Carlo algorithms for the Bayesian analysis of phylogenetic trees. Mol. Biol. Evol. 16, 750–759 (1999)
Ledford, R.M.: VP1 sequencing of all human rhinovirus serotypes: insights into genus phylogeny and susceptibility to antiviral capsid-binding compounds. J. Virol. 78, 3663–3674 (2004)
Lemey, P., Pybus, O.G., Wang, B., Saksena, N.K., Salemi, M., Vandamme, A.M.: Tracing the origin and history of the HIV-2 epidemic. Nat. Acad. Sci. 100, 6588–6592 (2003)
Lemmon, A., Milinkovitch, M.: The metapopulation genetic algorithm: an efficient solution for the problem of large phylogeny estimation. Nat. Acad. Sci. US Am. 99, 10516–10521 (2002)
Ludwig, W.: ARB: a software environment for sequence data. Nucleic Acids Res. 32, 1363–1371 (2004)
Mau, B., Newton, M.A., Larget, B.: Bayesian phylogenetic inference via Markov Chain Monte Carlo methods. Biometrics 55, 1–12 (1999)
Michener, C., Sokal, R.: A quantitative approach to a problem in classification. Evolution 11, 130–162 (1957)
Neyman, J.: Statistical Decision Theory and Related Topics. Academy Press, New York (1971)
Ohkuma, M., Saita, S., Inoue, T., Kudo, T.: Comparison of four protein phylogeny of parabasalian symbionts in termite guts. Mol. Phylogenet. Evol. 42, 847–853 (2007)
Ranwez, V., Gascuel, O.: Improvement of distance-based phylogenetic methods by a local maximum likelihood approach using triplets. Mol. Biol. Evol. 19, 1952–1963 (2002)
Rosenberg, M., Kumar, S.: Traditional phylogenetic reconstruction methods reconstruct shallow and deep evolutionary relationships equally well. Mol. Biol. Evol. 18, 1823–1827 (2001)
Saitou, N., Nei, M.: The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol. Biol. Evol. 4, 406–425 (1987)
Song, Y.S.: Properties of subtree-prune-and-regraft operations on totally-ordered phylogenetic trees. Ann. Comb. 10, 147–163 (2006)
Stamatakis, A., Ludwig, T.: RAxML-III: a fast program for maximum likelihood-based inference of large phylogenetic trees. Bioinformatics 21, 456–463 (2005)
Stamatakis, A., Hoover, P., Rougemont, J.: A rapid bootstrap algorithm for the RAxML web servers. Syst. Biol. 75, 758–771 (2008)
Takezaki, N., Nei, M.: Inconsistency of the maximum parsimony method when the rate of nucleotide substitution is constant. J. Mol. Evol. 39, 210–218 (1994)
Winkworeth, R.C., Bryant, D., Lockhart, P.J., Havell, D., Moulton, V.: Biogeographic interpretation of splits graph: least squares optimization of branch length. Syst. Biol. 54, 56–65 (2005)
Yang, Z., Rannala, B.: Bayesian phylogenetic Inference using DNA sequences: a Markov Chain Monte Carlo method. Mol. Biol. Evol. 14, 717–724 (1997)
Yang, Z.: Computational Molecular Evolution. Oxford University Press, Oxford (2006)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Hsieh, SY., Tsai, IP., Hung, HC., Chen, YC., Chou, HH., Lee, CW. (2015). An Enhanced Algorithm for Reconstructing a Phylogenetic Tree Based on the Tree Rearrangement and Maximum Likelihood Method. In: Huang, DS., Jo, KH., Hussain, A. (eds) Intelligent Computing Theories and Methodologies. ICIC 2015. Lecture Notes in Computer Science(), vol 9226. Springer, Cham. https://doi.org/10.1007/978-3-319-22186-1_53
Download citation
DOI: https://doi.org/10.1007/978-3-319-22186-1_53
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-22185-4
Online ISBN: 978-3-319-22186-1
eBook Packages: Computer ScienceComputer Science (R0)