Skip to main content
Log in

Improved approximation algorithm for maximum agreement forest of two rooted binary phylogenetic trees

  • Published:
Journal of Combinatorial Optimization Aims and scope Submit manuscript

Abstract

Given two rooted binary phylogenetic trees with identical leaf label-set, the maximum agreement forest (MAF) problem asks for a largest common subforest of the two trees. This problem has been studied extensively in the literature, and has been known to be NP-complete and MAX SNP-hard. The previously best ratio of approximation algorithms for this problem is 3. In this paper, we make full use of the special relations among leaves in phylogenetic trees and present an approximation algorithm with ratio 2.5 for the MAF problem on two rooted binary phylogenetic trees.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

Notes

  1. The definitions for the study of MAFs have been kind of confusing. If size denotes the number of edges in a forest, then for a forest, the size is equal to the number of vertices minus the order. In particular, when the number of vertices is fixed, a forest of a large size means a small order of the forest.

  2. Allen and Steel (2001) proved that the TBR distance between two unrooted binary phylogenetic trees is equal to the order of their MAF minus 1.

References

  • Allen B, Steel M (2001) Subtree transfer operations and their induced metrics on evolutionary trees. Ann Comb 5(1):1–15

    Article  MathSciNet  MATH  Google Scholar 

  • Baroni M, Grnewald S, Moulton V, Semple C (2005) Bounding the number of hybridisation events for a consistent evolutionary history. J Math Biol 51(2):171–182

    Article  MathSciNet  MATH  Google Scholar 

  • Bonet M, John R, Mahindru R, Amenta N (2006) Approximating subtree distances between phylogenies. J Comput Biol 13(8):1419–1434

    Article  MathSciNet  Google Scholar 

  • Bordewich M, McCartin C, Semple C (2008) A 3-approximation algorithm for the subtree distance between phylogenies. J Discret Algorithms 6(3):458–471

    Article  MathSciNet  MATH  Google Scholar 

  • Bordewich M, Semple C (2005) On the computational complexity of the rooted subtree prune and regraft distance. Ann Comb 8(4):409–423

    Article  MathSciNet  MATH  Google Scholar 

  • Buneman P (1971) The recovery of trees from measures of dissimilarity. In: Hodson F, Kendall D, Tauta P (eds) Mathematics in the archaeological and historical sciences. Edinburgh University Press, Edinburgh, pp 387–395

    Google Scholar 

  • Chataigner F (2005) Approximating the maximum agreement forest on \(k\) trees. Inf Process Lett 93:239–244

    Article  MathSciNet  MATH  Google Scholar 

  • Dudas G, Bedford T, Lycett S et al (2015) Reassortment between influenza B lineages and the emergence of a coadapted PB1-PB2-HA gene complex. Mol Biol Evol 32(1):162–172

    Article  Google Scholar 

  • Hein J, Jiang T, Wang L, Zhang K (1996) On the complexity of comparing evolutionary trees. Discret Appl Math 71:153–169

    Article  MathSciNet  MATH  Google Scholar 

  • Hallett M, McCartin C (2007) A faster FPT algorithm for the maximum agreement forest problem. Theory Comput Syst 41(3):539–550

    Article  MathSciNet  MATH  Google Scholar 

  • Lersel LV, Kelk S, Lekic N, Stougie L (2014) Approximation algorithms for nonbinary agreement forests. SIAM J Discret Math 28(1):49–66

    Article  MathSciNet  MATH  Google Scholar 

  • Li M, Tromp J, Zhang L (1996) On the nearest neighbour interchange distance between evolutionary trees. J Theor Biol 182(4):463–467

    Article  Google Scholar 

  • Robinson D, Foulds L (1981) Comparison of phylogenetic trees. Math Biosci 53(1–2):131–147

    Article  MathSciNet  MATH  Google Scholar 

  • Rodrigues E, Sagot M, Wakabayashi Y (2007) The maximum agreement forest problem: approximation algorithms and computational experiments. Theor Comput Sci 374(1–3):91–110

    Article  MathSciNet  MATH  Google Scholar 

  • Rodrigues M, Sagot M, Wakabayashi Y (2001) Some approxiamtion results for the maximum agreement forest problem. In: Proceedings of RANDOM 2001 and APPROX 2001, LNCS, vol 2129, pp 159–169

  • Swofford D, Olsen G, Waddell P, Hillis D (1996) Phylogenetic inference. In: Molecular Systematics, 2nd edn. Sinauer, Associates, pp 407–513

  • Shi F, Chen J, Feng Q, Wang J (2014) Approximation algorithms for maximum agreement forest on multiple trees. In: Proceedings of 20th international computing and combinatorics conference, LNCS, vol 8591, pp 381–392

  • Whidden C, Beiko R, Zeh N (2011) Fixed-parameter and approximation algorithms for maximum agreement forests. CoRR. abs/1108.2664

  • Whidden C, Beiko R, Zeh N (2013) Fixed-parameter and approximation algorithms for maximum agreement forests of multifurcating trees. arXiv preprint arXiv:1305.0512

  • Whidden C, Zeh N (2009) A unifying view on approximation and FPT of agreement forests. In: Proceedings of the 9th workshop on algorithms in bioinformatics, LNCS, vol 5724, pp 390–401

  • Whidden C, Beiko R, Zeh N (2013) Fixed-parameter algorithms for maximum agreement forests. SIAM J Comput 42(4):1431–1466

    Article  MathSciNet  MATH  Google Scholar 

  • Whidden C, Zeh N, Beiko R (2014) Supertrees based on the subtree prune-and-regraft distance. Syst Biol 63(4):566–581. doi:10.1093/sysbio/syu023

    Article  Google Scholar 

  • Whidden C, Frederick A, Matsen I (2015) Quantifying MCMC exploration of phylogenetic tree space. Syst Biol. doi:10.1093/sysbio/syv006

Download references

Acknowledgments

A preliminary version of this work was reported in the Proceedings of the 8th International Frontiers of Algorithmics Workshop, Lecture Notes in Computer Science, vol. 8497, pp. 205–215, 2014. This work is supported by the National Natural Science Foundation of China under Grant (61232001, 61472449, 61420106009), the Major Science & Technology Research Program for Strategic Emerging Industry of Hunan (Grant No. 2012GK4054), and the Research Fund for the Doctoral Program of Higher Education of China (No. 20130162130001).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jianxin Wang.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Shi, F., Feng, Q., You, J. et al. Improved approximation algorithm for maximum agreement forest of two rooted binary phylogenetic trees. J Comb Optim 32, 111–143 (2016). https://doi.org/10.1007/s10878-015-9921-7

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10878-015-9921-7

Keywords

Navigation