Abstract
Given two rooted binary phylogenetic trees with identical leaf label-set, the maximum agreement forest (MAF) problem asks for a largest common subforest of the two trees. This problem has been studied extensively in the literature, and has been known to be NP-complete and MAX SNP-hard. The previously best ratio of approximation algorithms for this problem is 3. In this paper, we make full use of the special relations among leaves in phylogenetic trees and present an approximation algorithm with ratio 2.5 for the MAF problem on two rooted binary phylogenetic trees.







Similar content being viewed by others
Notes
The definitions for the study of MAFs have been kind of confusing. If size denotes the number of edges in a forest, then for a forest, the size is equal to the number of vertices minus the order. In particular, when the number of vertices is fixed, a forest of a large size means a small order of the forest.
Allen and Steel (2001) proved that the TBR distance between two unrooted binary phylogenetic trees is equal to the order of their MAF minus 1.
References
Allen B, Steel M (2001) Subtree transfer operations and their induced metrics on evolutionary trees. Ann Comb 5(1):1–15
Baroni M, Grnewald S, Moulton V, Semple C (2005) Bounding the number of hybridisation events for a consistent evolutionary history. J Math Biol 51(2):171–182
Bonet M, John R, Mahindru R, Amenta N (2006) Approximating subtree distances between phylogenies. J Comput Biol 13(8):1419–1434
Bordewich M, McCartin C, Semple C (2008) A 3-approximation algorithm for the subtree distance between phylogenies. J Discret Algorithms 6(3):458–471
Bordewich M, Semple C (2005) On the computational complexity of the rooted subtree prune and regraft distance. Ann Comb 8(4):409–423
Buneman P (1971) The recovery of trees from measures of dissimilarity. In: Hodson F, Kendall D, Tauta P (eds) Mathematics in the archaeological and historical sciences. Edinburgh University Press, Edinburgh, pp 387–395
Chataigner F (2005) Approximating the maximum agreement forest on \(k\) trees. Inf Process Lett 93:239–244
Dudas G, Bedford T, Lycett S et al (2015) Reassortment between influenza B lineages and the emergence of a coadapted PB1-PB2-HA gene complex. Mol Biol Evol 32(1):162–172
Hein J, Jiang T, Wang L, Zhang K (1996) On the complexity of comparing evolutionary trees. Discret Appl Math 71:153–169
Hallett M, McCartin C (2007) A faster FPT algorithm for the maximum agreement forest problem. Theory Comput Syst 41(3):539–550
Lersel LV, Kelk S, Lekic N, Stougie L (2014) Approximation algorithms for nonbinary agreement forests. SIAM J Discret Math 28(1):49–66
Li M, Tromp J, Zhang L (1996) On the nearest neighbour interchange distance between evolutionary trees. J Theor Biol 182(4):463–467
Robinson D, Foulds L (1981) Comparison of phylogenetic trees. Math Biosci 53(1–2):131–147
Rodrigues E, Sagot M, Wakabayashi Y (2007) The maximum agreement forest problem: approximation algorithms and computational experiments. Theor Comput Sci 374(1–3):91–110
Rodrigues M, Sagot M, Wakabayashi Y (2001) Some approxiamtion results for the maximum agreement forest problem. In: Proceedings of RANDOM 2001 and APPROX 2001, LNCS, vol 2129, pp 159–169
Swofford D, Olsen G, Waddell P, Hillis D (1996) Phylogenetic inference. In: Molecular Systematics, 2nd edn. Sinauer, Associates, pp 407–513
Shi F, Chen J, Feng Q, Wang J (2014) Approximation algorithms for maximum agreement forest on multiple trees. In: Proceedings of 20th international computing and combinatorics conference, LNCS, vol 8591, pp 381–392
Whidden C, Beiko R, Zeh N (2011) Fixed-parameter and approximation algorithms for maximum agreement forests. CoRR. abs/1108.2664
Whidden C, Beiko R, Zeh N (2013) Fixed-parameter and approximation algorithms for maximum agreement forests of multifurcating trees. arXiv preprint arXiv:1305.0512
Whidden C, Zeh N (2009) A unifying view on approximation and FPT of agreement forests. In: Proceedings of the 9th workshop on algorithms in bioinformatics, LNCS, vol 5724, pp 390–401
Whidden C, Beiko R, Zeh N (2013) Fixed-parameter algorithms for maximum agreement forests. SIAM J Comput 42(4):1431–1466
Whidden C, Zeh N, Beiko R (2014) Supertrees based on the subtree prune-and-regraft distance. Syst Biol 63(4):566–581. doi:10.1093/sysbio/syu023
Whidden C, Frederick A, Matsen I (2015) Quantifying MCMC exploration of phylogenetic tree space. Syst Biol. doi:10.1093/sysbio/syv006
Acknowledgments
A preliminary version of this work was reported in the Proceedings of the 8th International Frontiers of Algorithmics Workshop, Lecture Notes in Computer Science, vol. 8497, pp. 205–215, 2014. This work is supported by the National Natural Science Foundation of China under Grant (61232001, 61472449, 61420106009), the Major Science & Technology Research Program for Strategic Emerging Industry of Hunan (Grant No. 2012GK4054), and the Research Fund for the Doctoral Program of Higher Education of China (No. 20130162130001).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Shi, F., Feng, Q., You, J. et al. Improved approximation algorithm for maximum agreement forest of two rooted binary phylogenetic trees. J Comb Optim 32, 111–143 (2016). https://doi.org/10.1007/s10878-015-9921-7
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10878-015-9921-7