Abstract
Incomplete lineage sorting (ILS) gives rise to stochastic variation in the topology of a gene tree and hence introduces false duplication events when gene tree and species tree reconciliation method is used for inferring the duplication history of a gene family. We quantify the effect of ILS on inference of gene duplication by examining the expected number of false duplication events inferred from reconciling a random gene tree, which occurs with a probability predicted in coalescent theory, and the given species tree. We computationally analyze the relationships between the number of false duplication events inferred on a branch and its length in a species tree, and the relationships between the expected number of false duplication events in a species tree and its topological parameters. This study provides evidence that inference of gene duplication based on tree reconciliation was affected by ILS to a greater extent on an asymmetric species tree than on a symmetric one. Our findings also suggest that the bias caused by ILS in reconciliation-based inference of gene duplication might not be negligible. Hence, when gene duplication is inferred via tree reconciliation or any other method that takes gene tree topology into account, the ILS-induced bias should be examined cautiously.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Åkerborg, Ö., Sennblad, B., Arvestad, L., Lagergren, J.: Simultaneous bayesian gene tree reconstruction and reconciliation analysis. Proc. Natl. Acad. Sci. U. S. A. 106(14), 5714–5719 (2009)
Bansal, M.S., Alm, E.J., Kellis, M.: Efficient algorithms for the reconciliation problem with gene duplication, horizontal transfer and loss. Bioinformatics 28(12), i283–i291 (2012)
Berglund-Sonnhammer, A.C., Steffansson, P., Betts, M.J., Liberles, D.A.: Optimal gene trees from sequences and species trees using a soft interpretation of parsimony. J. Mol. Evol. 63(2), 240–250 (2006)
Cann, R.L., Stoneking, M., Wilson, A.C.: Mitochondrial DNA and human evolution. Nature 325(6099), 31–36 (1987)
Chauve, C., El-Mabrouk, N.: New perspectives on gene family evolution: Losses in reconciliation and a link with supertrees. In: Batzoglou, S. (ed.) RECOMB 2009. LNCS, vol. 5541, pp. 46–58. Springer, Heidelberg (2009)
Chen, K., Durand, D., Farach-Colton, M.: Notung: a program for dating gene duplications and optimizing gene family trees. J. Comput. Biol. 7(3-4), 429–447 (2000)
Degnan, J.H., Salter, L.A.: Gene tree distributions under the coalescent process. Evolution 59(1), 24–37 (2005)
Doyle, J.J.: Gene trees and species trees: molecular systematics as one-character taxonomy. Syst. Botany 17, 144–163 (1992)
Doyon, J.-P., Scornavacca, C., Gorbunov, K.Y., Szöllősi, G.J., Ranwez, V., Berry, V.: An Efficient Algorithm for Gene/Species Trees Parsimonious Reconciliation with Losses, Duplications and Transfers. In: Tannier, E. (ed.) RECOMB-CG 2010. LNCS, vol. 6398, pp. 93–108. Springer, Heidelberg (2010)
Edwards, S.V.: Is a new and general theory of molecular systematics emerging? Evolution 63(1), 1–19 (2008)
Edwards, S.V., Beerli, P.: Perspective: gene divergence, population divergence, and the variance in coalescence time in phylogeographic studies. Evolution 54(6), 1839–1854 (2000)
Goodman, M., Czelusniak, J., Moore, G.W., Romero-Herrera, A.E., Matsuda, G.: Fitting the gene lineage into its species lineage, a parsimony strategy illustrated by cladograms constructed from globin sequences. Syst. Biol. 28(2), 132–163 (1979)
Górecki, P., Tiuryn, J.: DLS-trees: a model of evolutionary scenarios. Theor. Comput. Sci. 359(1), 378–399 (2006)
Hahn, M.W.: Bias in phylogenetic tree reconciliation methods: implications for vertebrate genome evolution. Genome Biol. 8(7), R141 (2007)
Hahn, M.W., Han, M.V., Han, S.G.: Gene family evolution across 12 Drosophila genomes. PLoS Genetics 3(11), e197 (2007)
Hey, J., Nielsen, R.: Multilocus methods for estimating population sizes, migration rates and divergence time, with applications to the divergence of Drosophila pseudoobscura and D. persimilis. Genetics 167(2), 747–760 (2004)
Keeling, P.J., Palmer, J.D.: Horizontal gene transfer in eukaryotic evolution. Nat. Rev. Genet. 9(8), 605–618 (2008)
Knowles, L.L., Kubatko, L.S.: Estimating Species Trees: Practical and Theoretical Aspects. Wiley-Blackwel, New Jersey (2010)
Liu, L., Yu, L., Kubatko, L., Pearl, D.K., Edwards, S.V.: Coalescent methods for estimating phylogenetic trees. Mol. Phylogenet. Evol. 53(1), 320–328 (2009)
Ma, B., Li, M., Zhang, L.X.: From gene trees to species trees. SIAM J. Comput. 30(3), 729–752 (2000)
Maddison, W.P.: Gene trees in species trees. Syst. Biol. 46(3), 523–536 (1997)
Maddison, W.P., Knowles, L.L.: Inferring phylogeny despite incomplete lineage sorting. Syst. Biol. 55(1), 21–30 (2006)
Page, R.D.M.: Maps between trees and cladistic analysis of historical associations among genes, organisms, and areas. Syst. Biol. 43(1), 58–77 (1994)
Pamilo, P., Nei, M.: Relationships between gene trees and species trees. Mol. Biol. Evol. 5(5), 568–583 (1988)
Pollard, D.A., Iyer, V.N., Moses, A.M., Eisen, M.B.: Widespread discordance of gene trees with species tree in Drosophila: evidence for incomplete lineage sorting. PLoS Genet. 2(10), e173 (2006)
Rasmussen, M.D., Kellis, M.: Unified modeling of gene duplication, loss, and coalescence using a locus tree. Genome Research 22(4), 755–765 (2012)
Rosenberg, N.A.: The probability of topological concordance of gene trees and species trees. Theor. Popul. Biol. 61(2), 225–247 (2002)
Sackin, M.J.: Good and bad phenograms. Syst. Zool. 21, 225–226 (1972)
Sang, T., Zhong, Y.: Testing hybridization hypotheses based on incongruent gene trees. Syst. Biol. 49(3), 422–434 (2000)
Stolzer, M., Lai, H., Xu, M., Sathaye, D., Vernot, B., Durand, D.: Inferring duplications, losses, transfers and incomplete lineage sorting with nonbinary species trees. Bioinformatics 28(18), i409–i415 (2012)
Takahata, N.: Gene genealogy in three related populations: consistency probability between gene and population trees. Genetics 122(4), 957–966 (1989)
Wehe, A., Bansal, M.S., Burleigh, J.G., Eulenstein, O.: Duptree: a program for large-scale phylogenetic analyses using gene tree parsimony. Bioinformatics 24(13), 1540–1541 (2008)
Wong, K.M., Suchard, M.A., Huelsenbeck, J.P.: Alignment uncertainty and genomic analysis. Science 319(5862), 473–476 (2008)
Wu, C.I.: Inferences of species phylogeny in relation to segregation of ancient polymorphisms. Genetics 127(2), 429–435 (1991)
Wu, Y.: Coalescent-based species tree inference from gene tree topologies under incomplete lineage sorting by maximum likelihood. Evolution 66, 763–775 (2012)
Yu, Y., Than, C., Degnan, J.H., Nakhleh, L.: Coalescent histories on phylogenetic networks and detection of hybridization despite incomplete lineage sorting. Syst. Biol. 60(2), 138–149 (2011)
Zhang, L.X.: From gene trees to species trees ii: Species tree inference by minimizing deep coalescence events. IEEE-ACM Trans. Comput. Biol. Bioinform. 8(6), 1685–1691 (2011)
Zheng, Y., Wu, T., Zhang, L.X.: Reconciliation of gene and species trees with polytomies. arXiv preprint, arXiv:1201.3995 (2012)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zheng, Y., Zhang, L. (2013). Effect of Incomplete Lineage Sorting on Tree-Reconciliation-Based Inference of Gene Duplication. In: Cai, Z., Eulenstein, O., Janies, D., Schwartz, D. (eds) Bioinformatics Research and Applications. ISBRA 2013. Lecture Notes in Computer Science(), vol 7875. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38036-5_26
Download citation
DOI: https://doi.org/10.1007/978-3-642-38036-5_26
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-38035-8
Online ISBN: 978-3-642-38036-5
eBook Packages: Computer ScienceComputer Science (R0)