Skip to main content

A Fitness Distance Correlation Measure for Evolutionary Trees

  • Conference paper
Bioinformatics and Computational Biology (BICoB 2009)

Part of the book series: Lecture Notes in Computer Science ((LNBI,volume 5462))

Included in the following conference series:

  • 1130 Accesses

Abstract

Phylogenetics is concerned with inferring the genealogical relationships between a group of organisms (or taxa), and this relationship is usually expressed as an evolutionary tree. However, inferring the phylogenetic tree is not a trivial task since it is impossible to know the true evolutionary history for a set of organisms. As a result, most phylogenetic analyses rely on effective heuristics for obtaining accurate trees. These heuristics use tree score as a basis for establishing an accurate depiction of evolutionary tree relationships. Relatively little work has been done to analyze the relationship between improving tree scores (fitness) and topological accuracy (distance). In this paper, we present a new fitness-distance correlation coefficient called r fd to quantify the relationship between evolutionary trees. By applying this measure to three biological datasets consisting of 44, 60, and 174 taxa, our results show that improvements in fitness are strongly correlated (r fd > 0.8) with topological accuracy to the best-tree-overall. Moreover, we investigated the use of the r fd coefficient if the best overall tree is not available and found similar results. Thus, our results show that r fd is a robust measure with several potential applications such as the development of stopping criteria for phylogenetic search.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Butt, D., Roger, A., Blouin, C.: libcov: A C++ bioinformatic library to manipulate protein structures, sequence alignments and phylogeny. BMC Bioinformatics 6(138) (2005)

    Google Scholar 

  2. Deans, A.R., Gillespie, J.J., Yoder, M.J.: An evaluation of ensign wasp classification (Hymenoptera: Evanildae) based on molecular data and insights from ribosomal rna secondary structure. Syst. Ento. 31, 517–528 (2006)

    Article  Google Scholar 

  3. Felsenstein, J.: Phylogenetic inference package (PHYLIP), version 3.2. Cladistics 5, 164–166 (1989)

    Google Scholar 

  4. Felsenstein, J.: Inferring Phylogenies. Sinauer Associates (2003)

    Google Scholar 

  5. Fitch, W.M.: Toward defining the course of evolution: minimal change for a specific tree topology. Syst. Zool. 20, 406–416 (1971)

    Article  Google Scholar 

  6. Foulds, L.R., Graham, R.L.: The Steiner problem in phylogeny is NP-complete. Advances in Applied Mathematics 3, 43–49 (1982)

    Article  Google Scholar 

  7. Gillespie, J., McKenna, C., Yoder, M., Gutell, R., Johnston, J., Kathirithamby, J., Cognato, A.: Assessing the odd secondary structural properties of nuclear small subunit ribosomal rna sequences (18s) of the twisted-wing parasites (Insecta: Strepsiptera). Insect Mol. Biol. 15, 625–643 (2005)

    Article  Google Scholar 

  8. Goloboff, P.: Analyzing large data sets in reasonable times: solutions for composite optima. Cladistics 15, 415–428 (1999)

    Article  Google Scholar 

  9. Goloboff, P.A., Farris, J.S., Nixon, K.C.: TNT, a free program for phylogenetic analysis. Cladistics 24(5), 774–786 (2008)

    Article  Google Scholar 

  10. Jones, T., Forrest, S.: Fitness distance correlation as a measure of problem difficulty for genetic algorithms. In: Eshelman, L. (ed.) Proceedings of the Sixth International Conference on Genetic Algorithms, pp. 184–192. Morgan Kaufmann, San Francisco (1995)

    Google Scholar 

  11. Murphy, W.J., Eizirik, E., O’Brien, S.J., Madsen, O., Scally, M., Douady, C.J., Teeling, E., Ryder, O.A., Stanhope, M.J., de Jong, W.W., Springer, M.S.: Resolution of the early placental mammal radiation using bayesian phylogenetics. Science 294, 2348–2351 (2001)

    Article  CAS  PubMed  Google Scholar 

  12. Nixon, K.C.: The parsimony ratchet, a new method for rapid parsimony analysis. Cladistics 15, 407–414 (1999)

    Article  Google Scholar 

  13. Ronquist, F., Huelsenbeck, J.P.: Mrbayes 3: Bayesian phylogenetic inference under mixed models. Bioinformatics 19(12), 1572–1574 (2003)

    Article  CAS  PubMed  Google Scholar 

  14. Saitou, N., Nei, M.: The neighbor-joining method: A new method for reconstructiong phylogenetic trees. Mol. Biol. Evol. 4, 406–425 (1987)

    CAS  PubMed  Google Scholar 

  15. Stamatakis, A., Ludwig, T., Meier, H.: RAxML: A fast program for maximum likelihood-based inference of large phylogenetic trees. Bioinformatics 1(1), 1–8 (2004)

    Google Scholar 

  16. Sul, S.-J., Williams, T.L.: An experimental analysis of robinson-foulds distance matrix algorithms. In: Halperin, D., Mehlhorn, K. (eds.) ESA 2008. LNCS, vol. 5193, pp. 793–804. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  17. Swofford, D.L.: PAUP*: Phylogenetic analysis using parsimony (and other methods), Sinauer Associates, Underland, Massachusetts, Version 4.0 (2002)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Park, H.J., Williams, T.L. (2009). A Fitness Distance Correlation Measure for Evolutionary Trees. In: Rajasekaran, S. (eds) Bioinformatics and Computational Biology. BICoB 2009. Lecture Notes in Computer Science(), vol 5462. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-00727-9_31

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-00727-9_31

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-00726-2

  • Online ISBN: 978-3-642-00727-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics