Abstract
We introduce and analyse a simple discrete probabilistic model of genome evolution. It is based on four fundamental evolutionary events: gene duplication, loss, change and innovation, and it is called DLCI model. This is the first such model rigorously analysed. The focus of the paper is around the size distribution of gene families. The formulas for equilibrium gene family sizes are derived showing that they follow a logarithmic distribution. We consider also a disjoint union of DLCI models and we present the result of this study. Some empirical results for microbial genomes are presented.
This work was partially supported by the State Committee for Scientific Research (Poland) Grants: 3 T11F 021 28 and 3 TF11 016 28
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Dokholyan, N.V., Shakhnovich, B., Shakhnovich, B.E.I.: Expanding protein universe and its origin from the biological Big Bang. Proc. Natl. Acad. Sci. USA 99, 14132–14136 (2002)
Dujon, B., et al.: Genome evolution in yeasts. Nature 430, 35–44 (2004)
Enright, A.J., Van Dongen, S., Ouzounis, C.A.: An efficient algorithm for large-scale detection of protein families. Nucleic Acids Research 30(7), 1575–1584 (2002)
Feller, W.: An introduction to probability theory and its applications. John Wiley and Sons, Inc., New York (1961)
Fitch, W.M.: Homology, a personal view on some of the problems. Trends in Genetics 16(5), 227–321 (2000)
Huynen, M.A., van Nimwegen, E.: The Frequency Distribution of Gene Family Size in Complete Genomes. Molecular Biology Evolution 15(5), 583–589 (1998)
Jordan, K., Makarova, K.S., Spouge, J.L., Wolf, Y.I., Koonin, E.V.: Lineage-Specific Gene Expansions in Bacterial and Archeal Genomes. Genome Research 11, 555–565 (2001)
Karev, G.P., Wolf, Y.I., Rzhetsky, A.Y., Berezovskaya, F.S., Koonin, E.V.: Birth and death of protein domains: A simple model of evolution explains power law behavior. BMC Evolutionary Biology 2(18) (2002)
Karev, G.P., Wolf, Y.I., Berezovskaya, F.S., Koonin, E.V.: Gene family evolution: an in-depth theoretical and simulation analysis of non-linear birth-death-innovation models. BMC Evolutionary Biology 4/32 (2004)
Karev, G.P., Wolf, Y.I., Berezovskaya, F.S., Koonin, E.V.: Modelling genome evolution with a diffusion approximation of a birth-and-death process. Bioinformatics 21/3, iii12–iii19 (2005)
Kimura, M.: The Neutral Theory of Molecular Evolution. Cambridge University Press, Cambridge (1983)
Wen-Hsiung, L.: Molecular Evolution. Sinauer Associates, Inc., Publishers, Sunderland Massachusetts (1997)
Luz, H., Vingron, M.: Family specific rates of protein evolution. Bioinformatics 22(10), 1166–1171 (2006)
Ohno, S.: Evolution by Gene Duplication. Springer, Berlin (1970)
Peterson, J.D., Umayam, L.A., Dickinson, T.M., Hickey, E.K., White, O.: The Comprehensive Microbial Resource. Nucleic Acids Research 29(1), 123–125 (2001)
Reed, W.J., Hughes, B.D.: A model explaining the size distribution of gene and protein families. Math. Biosci. 189(1), 97–102 (2004)
Slonimski, P.P., Mosse, M.O., Golik, P., Henaût, A., Diaz, Y., Risler, J.L., Comet, J.P., Aude, J.C., Wozniak, A., Glemet, E., Codani, J.J.: The first laws of genomics. Microbial and Comparative Genomics 3(46) (1998)
Slonimski, P.P.: Comparison of complete genomes: Organisation and evolution. In: Proceedings of the Third Annual Conference on Computational Molecular Biology. RECOMB 1999 Stanislaw Ulam Memorial Lecture, vol. 310. ACM Press, New York (1999)
Tiuryn, J., Rudnicki, R., Wójtowicz, D.: A case study of genome evolution: From continuous to discrete time model. In: Fiala, J., Koubek, V., Kratochvíl, J. (eds.) MFCS 2004. LNCS, vol. 3153, pp. 1–24. Springer, Heidelberg (2004)
Tiuryn, J., Wójtowicz, D., Rudnicki, R.: A Model of Evolution of Small Paralog Families in Genomes (submitted for publication, 2006)
Wolf, Y.I., Grishin, N.V., Koonin, E.V.: Estimating the number of protein folds and families from complete genome data. J. Molecular Biology 299, 897–905 (2000)
Wójtowicz, D., Tiuryn, J.: On genome evolution with innovation. Mathematical Foundations of Computer Science (to appear, 2006)
Yanai, I., Camacho, C.J., DeLisi, C.: Predictions of Gene Family Distributions in Microbial Genomes: Evolution by Gene Duplication and Modification. Physical Review Letters 85(12), 2641–2644 (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wójtowicz, D., Tiuryn, J. (2006). On Genome Evolution with Accumulated Change and Innovation. In: Bourque, G., El-Mabrouk, N. (eds) Comparative Genomics. RCG 2006. Lecture Notes in Computer Science(), vol 4205. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11864127_4
Download citation
DOI: https://doi.org/10.1007/11864127_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44529-6
Online ISBN: 978-3-540-44530-2
eBook Packages: Computer ScienceComputer Science (R0)