Abstract
Metagenomic data enables the study of microbes and viruses through their DNA as retrieved directly from the environment in which they live. Functional analysis of metagenomes explores the abundance of gene families, pathways, and systems, rather than their taxonomy. Through such analysis researchers are able to identify those functional capabilities most important to organisms in the examined environment. Recently, a statistical framework for the functional analysis of metagenomes was described that focuses on gene families. Here we describe two pathway level computational models for functional analysis that take into account important, yet unaddressed issues such as pathway size, gene length and overlap in gene content among pathways. We test our models over carefully designed simulated data and propose novel approaches for performance evaluation. Our models significantly improve over current approach with respect to pathway ranking and the computations of relative abundance of pathways in environments.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
DeLong, E.F., Preston, C.M., Mincer, T., Rich, V., Hallam, S.J., Frigaard, N., Martinez, A., Sullivan, M.B., Edwards, R., Brito, B.R., Chisholm, S.W., Karl, D.M.: Community Genomics Among Stratified Microbial Assemblages in the Ocean’s Interior. Science 311(5760), 496–503 (2006)
Rusch, D.B., Halpern, A.L., Sutton, G., Heidelberg, K.B., Williamson, S., et al.: The Sorcerer II Global Ocean Sampling expedition: northwest Atlantic through eastern tropical Pacific. PLoS Biol. 5(3), e77 (2007)
Yooseph, S., Sutton, G., Rusch, D.B., Halpern, A.L., Williamson, S.J., et al.: The Sorcerer II Global Ocean Sampling Expedition: Expanding the Universe of Protein Families. PLoS Biol. 5(3), e16 (2007)
Gill, S.R., Pop, M., Deboy, R.T., Eckburg, P.B., Turnbaugh, P.J., Samuel, B.S., Gordon, J.I., Relman, D.A., Fraser-Liggett, C.M., Nelson, K.E.: Metagenomic Analysis of the Human Distal Gut Microbiome. Science 312(5778), 1355–1359 (2006)
Warnecke, F., Luginbuhl, P., Ivanova, N., Ghassemian, M., Richardson, T.H., et al.: Metagenomic and functional analysis of hindgut microbiota of a wood feeding higher termite. Nature 450, 560–565 (2007)
Tyson, G.W., Chapman, J., Hugenholtz, P., Allen, E.E., Ram, R.J., et al.: Community structure and metabolism through reconstruction of microbial genomes from the environment. Nature 428(6978), 37–43 (2004)
Béjà, O., Aravind, L., Koonin, E.V., Suzuki, M.T., Hadd, A., Nguyen, L.P., Jovanovich, S.B., Gates, C.M., Feldman, R.A., Spudich, J.L., Spudich, E.N., DeLong, E.F.: Bacterial rhodopsin: evidence for a new type of phototrophy in the sea. Science 289(5486), 1902–1906 (2000)
Sharon, I., Alperovitch, A., Rohwer, F., Haynes, M., Glaser, F., et al.: Photosystem-I gene cassettes are present in marine virus genomes. Nature 461, 258–262 (2009)
Raes, J., Foerstner, K.U., Bork, P.: Get the most out of your metagenome: computational analysis of environmental sequence data. Curr. Opin. Microbiol. 10(5), 490–498 (2007)
Tatusov, R.L., Fedorova, N.D., Jackson, J.D., Jacobs, A.R., Kiryutin, B., et al.: The COG database: an updated version includes eukaryotes. BMC Bioinformatics 4, 41 (2003)
Finn, R.D., Mistry, J., Schuster-Böckler, B., Griffiths-Jones, S., Hollich, V., Lassmann, T., Moxon, S., Marshall, M., Khanna, A., Durbin, R., Eddy, S.R., Sonnhammer, E.L.L., Bateman, A.: Pfam: clans, web tools and services. Nucleic Acids Res. 34(Database Issue), D247–D251 (2006)
Haft, D.H., Selengut, J.D., White, O.: The TIGRFAMs database of protein families. Nucleic Acids Res. 31, 371–373 (2003)
Kanehisa, M., Goto, S.: KEGG: Kyoto Encyclopedia of Genes and Genomes. Nucleic Acids Res. 28, 27–30 (2000)
Caspi, R., Foerster, H., Fulcher, C.A., Kaipa, P., Krummenacker, M., et al.: The MetaCyc Database of metabolic pathways and enzymes and the BioCyc collection of Pathway/Genome Databases. Nucleic Acids Res. 36(Database issue), D623–D631 (2008)
Overbeek, R., Begley, T., Butler, R.M., Choudhuri, J.V., Chuang, H.Y., et al.: The subsystems approach to genome annotation and its use in the project to annotate 1000 genomes. Nucleic Acids Res. 33, 5691–5702 (2005)
Rodriguez-Brito, B., Rohwer, F., Edwards, R.A.: An application of statistics to compatative metagenomics. BMC Bioinformatics 20(7), 162 (2006)
Markowitz, V.M., Szeto, E., Palaniappan, K., Grechkin, Y., Chu, K., Chen, I.A., Dubchak, I., Anderson, I., Lykidis, A., Mavromatis, K., Ivanova, N.N., Kyrpides, N.C.: The integrated microbial genomes (IMG) system in 2007: data content and analysis tool extensions. Nucleic Acids Res. 36(Database Issue), D528–D533 (2008)
Sharon, I., Pati, A., Markowitz, V.M., Pinter, R.Y.: A statistical framework for the functional analysis of metagenomes. In: Batzoglou, S. (ed.) RECOMB 2009. LNCS, vol. 5541, pp. 496–511. Springer, Heidelberg (2009)
Lander, E.S., Waterman, M.S.: Genomic mapping by fingerprinting random clones: a mathematical analysis. Genomics 2(3), 231–239 (1988)
Altschul, S.F., Gish, W., Miller, W., Myers, E.W., Lipman, D.J.: Basic local alignment search tool. J. Mol. Biol. 215, 403–410 (1990)
Mollet, C., Drancourt, M., Raoult, D.: rpoB sequence analysis as a novel basis for bacterial identification. Mol. Microbiol. 26(5), 1005–1011 (1997)
Venter, J.C., Remington, K., Heidelberg, J.F., Halpern, A.L., Rusch, D., et al.: Environmental genome shotgun sequencing of the Sargasso Sea. Science 304(5667), 66–74 (2004)
Loy, A., Duller, S., Baranyi, C., Mußmann, M., Ott, J., et al.: Reverse dissimilatory sulfite reductase and other Dsr Proteins in sulfur-oxidizing bacteria: evolutionary history and suitability as phylogenetic markers. Environ. Microbiol. 11, 289–299 (2009)
Yutin, N., Suzuki, M.T., Teeling, H., Weber, M., Venter, J.C., et al.: Assessing diversity and biogeography of aerobic anoxygenic phototrophic bacteria in surface waters of the Atlantic and Pacific Oceans using the Global Ocean Sampling expedition metagenomes. Environ. Microbiol. 9, 1464–1475 (2007)
Howard, E.C., Henriksen, J.R., Buchan, A., Reisch, C.R., Bürgmann, H., et al.: Bacterial taxa that limit sulfur flux from the ocean. Science 314(5799), 649–652 (2006)
Edwards, R.A., Rodriguez-Brito, B., Wegley, L., Haynes, M., Breitbart, M., et al.: Using pyrosequencing to shed light on deep mine microbial ecology. BMC Genomics 7, 57 (2006)
Feingersch, R., Suzuki, M.T., Shmoish, M., Sharon, I., Sabehi, G., et al.: Microbial community genomics in eastern Mediterranean Sea surface waters. ISME J. (2009) doi:10.1038/ismej.2009.92
Dinsdale, E.A., Edwards, R.A., Hall, D., Angly, F., Breitbart, M., et al.: Functional metagenomic profiling of nine biomes. Nature 452, 629–632 (2008)
Ye, Y., Doak, T.G.: A parsimony approach to biological pathway reconstruction/inference for genomes and metagenomes. PLoS Comput. Biol. 5(8), e1000465 (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Bercovici, S., Sharon, I., Pinter, R.Y., Shlomi, T. (2010). Pathway-Based Functional Analysis of Metagenomes. In: Berger, B. (eds) Research in Computational Molecular Biology. RECOMB 2010. Lecture Notes in Computer Science(), vol 6044. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-12683-3_4
Download citation
DOI: https://doi.org/10.1007/978-3-642-12683-3_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-12682-6
Online ISBN: 978-3-642-12683-3
eBook Packages: Computer ScienceComputer Science (R0)