ABSTRACT
Inferring the genetic network architecture in cells is of great importance to biologists as it can lead to the understanding of cell signaling and metabolic dynamics underlying cellular processes, onset of diseases, and potential discoveries in drug development. The focus today has shifted to genome scale inference approaches using information theoretic metrics such as mutual information over the gene expression data. In this paper, we propose two classes of inference algorithms using scoring schemes on complex interactions which are primarily based on information theoretic metrics. The central idea is to go beyond pair-wise interactions and utilize more complex structures between any node (gene or transcription factor) and its possible multiple regulators (only transcription factors). While this increases the network inference complexity over pair-wise interaction based approaches, it achieves much higher accuracy. We restricted the complex interactions considered in this paper to 3-node structures (any node and its two regulators) to keep our schemes scalable to genome-scale inference and yet achieve higher accuracy than other state of the art approaches. Detailed performance analyses based on benchmark precision and recall metrics over the known Escherichia coli transcriptional regulatory network, indicated that the accuracy of the proposed algorithms (sCoIn, aCoIn and its variants) is consistently higher in comparison to popular algorithms such as context likelihood of relatedness (CLR), relevance networks (RN) and GEneNetwork Inference with Ensemble of trees (GENIE3).
- Buchanan M, Caldarelli G, De Los Rios P, Rao Francesco and Vendruscolo M. "Networks in cell biology". Cambridge University Press, 2010, ISBN 978-O-521-88273-6.Google ScholarCross Ref
- Babu MM, Luscombe NM, Aravind L, Gerstein M, Teichmann SA. "Structure and evolution of transcriptional regulatory networks". Curr Opin Struct Biol, 2004, Vol.14(3):283--91.Google ScholarCross Ref
- Babu MM, Lang B, Aravind L. "Methods to reconstruct and compare transcriptional regulatory networks". Methods Mol Biol, 2009, 541:163--80.Google ScholarCross Ref
- Hecker, S. Lambeck, S. Toepfer, E. van Someren and R. Guthke. "Gene regulatory network inference: data integration in dynamic models---a review". Biosystems, 2009, 96(1):86--103.Google ScholarCross Ref
- Imoto S, Goto T, Miyano S. "Estimation of genetic networks and functional structures between genes by using Bayesian networks and nonparametric regression". Pacific Symposium on Biocomputing, 2002, 175--186.Google Scholar
- Murphy K, Mian S. "Modelling gene expression data using dynamic Bayesian networks". In Technical report 1999, Computer Science Division University of California, Berkeley, CA.Google Scholar
- Kauffman SA. "Metabolic stability and epigenesis in randomly constructed genetic nets". J Theor Biol, 1969, 22:437--467.Google ScholarCross Ref
- Schmulevich I, Dougherty ER, Kim S, Zhang W. "Probabilistic Boolean Networks: a rule-based uncertainty model for gene regulatory networks". Bioinformatics, 2002, 18(2):261--274.Google ScholarCross Ref
- Chen T, He HL, Church GM. "Modeling gene expression with differential equations". Pacific Symposium on Biocomputing, 1999, 4:29--40.Google Scholar
- Eisen MB, Spellman PT, Brown PO, Botstein D. "Cluster analysis and display of genome-wide expression patterns". Proc Natl Acad Sci USA, 1998, 95: 14863--14868.Google ScholarCross Ref
- Butte AJ, Kohane IS. "Mutual information relevance networks: functional genomic clustering using pairwise entropy measurements". Pacific Symposium on Biocomputing, 2000: 418--429.Google Scholar
- Liang S, Fuhrman S, Somogyi R. REVEAL. "A general reverse engineering algorithm for inference of genetic network architectures". Pacific Symposium on Biocomputing. 1998;3:18--29.Google Scholar
- Margolin AA, Nemenman I, Basso K, Wiggins C, Stolovitzky G, Dalla Favera R, Califano A. "ARACNE: An algorithm for reconstruction of genetic networks in a mammalian cellular context". BMC Bioinformatics, 2006, 7 Suppl 1:S7.Google ScholarCross Ref
- Faith J. J, Hayete B, Thaden J. T, Mogno I, Wierzbowski J, Cottarel G, Kasif S, Collins J. J, and Gardner T. S. "Large-scale mapping and validation of escherichia coli transcriptional regulation from a compendium of expression profiles". PLoS Biology, 2007, 5(1):e8.Google ScholarCross Ref
- Chaitankar V, Ghosh P, Elasri MO, Perkins EJ. "sREVEAL: Scalable extensions of REVEAL towards regulatory network inference". IEEE ISDA, 2011, pp. 1365--1370.Google ScholarCross Ref
- Huynh-Thu VA, Irrthum A, Wehenkel L, Geurts P. "Inferring regulatory networks from expression data using tree-based methods". PLoS ONE, 2010, 5(9): e12776.Google ScholarCross Ref
- Prill RJ, Saez-Rodriguez J, Alexopoulos LG, Sorger PK, Stolovitzky G. "Crowdsourcing Network Inference: The DREAM4 predictive signaling network challenge". Science Signaling, 2011, Sep 6;4(189):mr7.Google Scholar
- Marbach D, Schaffter T, Mattiussi C, and Floreano D. "Generating realistic in silico gene networks for performance assessment of reverse engineering methods". Journal of Computational Biology, 2009, 16(2):229--239.Google ScholarCross Ref
- Cover T M, Thomas J A. "Elements of information theory". Wiley-Interscience, New York, 1991. Google ScholarDigital Library
- A. Kraskov, H. Stögbauer, and P. Grassberger. "Estimating Mutual information", Phys. Rev, 2004, E 69 (6) 066138.Google ScholarCross Ref
- C. Daub et al., "Estimating mutual information using B-spline functions--an improved similarity measure for analyzing gene expression data". BMC Bioinformatics, 2004, 5:118.Google ScholarCross Ref
- Chaitankar V et al. "A scalable information theory based gene regulatory network inference method from time series and knock-out data" BICoB, 2011, pp.74--79.Google Scholar
- Chaitankar et al. "Gene regulatory network inference using predictive minimum description length principle and conditional mutual information". IJCBS, 2009, pp.487--490. Google ScholarDigital Library
- Chaitankar et al. "Effects of cDNA microarray time-series data size on gene regulatory network inference accuracy". ACM-BCB, 2010, pp.410--413. Google ScholarDigital Library
- Chaitankar et al. "A novel gene regulatory inference algorithm using predictive minimum description length approach". BMC Systems Biology, 2010, 4 Suppl 1:S7.Google ScholarCross Ref
- Chaitankar et al. "Time lagged information theoretic approaches to the reverse engineering of gene regulatory networks". BMC Bioinformatics, 2010, 11 Suppl 6:S19.Google ScholarCross Ref
- Chaitankar et al. "Predictive minimum description length principle approach to inferring gene regulatory networks". Adv Exp Med Biol, 2011, 696:37--43.Google ScholarCross Ref
- Lee, T. I. et al. "Transcriptional regulatory networks in Saccharomyces cerevisiae". Science, 2002, 298:799--804.Google ScholarCross Ref
Index Terms
- Genome scale inference of transcriptional regulatory networks using mutual information on complex interactions
Recommendations
A causal inference approach for constructing transcriptional regulatory networks
Motivation: Transcriptional regulatory networks specify the interactions among regulatory genes and between regulatory genes and their target genes. Discovering transcriptional regulatory networks helps us to understand the underlying mechanism of ...
sCoIn: A scoring algorithm based on complex interactions for reverse engineering regulatory networks
BIBE '12: Proceedings of the 2012 IEEE 12th International Conference on Bioinformatics & Bioengineering (BIBE)Structural analysis over well studied transcriptional regulatory networks indicates that these complex networks are made up of small set of reoccurring patterns called motifs. While information theoretic approaches have been immensely popular, these ...
Comments