ABSTRACT
Systems biology aims to understand the behavior of and interaction between various components of the living cell, such as genes, proteins, and metabolites. A large number of components are involved in these complex systems and the diversity of relationships between the components can be overwhelming, and there is therefore a need for analysis methods incorporating data integration. We here present a method for exploring gene regulatory mechanisms which integrates various types of data to assist the identification of important components in gene regulation mechanisms. By first analyzing gene expression data, a set of differentially expressed genes is selected. These genes are then further investigated by combining various types of biological information, such as clustering results, promoter sequences, binding sites, transcription factors and other previously published information regarding the selected genes. Inspired by Information Fusion research, we also mapped functions of the proposed method to the well-known OODA-model to facilitate application of this data integration method in other research communities. We have successfully applied the method to genes identified as differentially expressed in human embryonic stem cells at different stages of differentiation towards cardiac cells. We identified 15 novel motifs that may represent important binding sites in the cardiac cell linage.
- GuhaThakurta, D. Computational identification of transcriptional regulatory elements in DNA sequence. 2006. Nucleic Acids Res. 34, 3585--3598Google Scholar
- Rani, V. 2007. Computational methods to dissect cis-regulatory transcriptional network. J. Biosci. 32, 1325--1330.Google ScholarCross Ref
- Datta, S., Sokhansanj, B. A. 2007. Accelerated search for biomolecular network models to interpret high-throughput experimental data. BMC Bioinformatics. 8, 258.Google ScholarCross Ref
- Wang, Y., Joshi, T., Zhang, X. S., Xu, D., and Chen L. 2006. Inferring gene regulatory networks from multiple microarray datasets. Bioinformatics. 22 (19), 2413--2420. Google ScholarDigital Library
- Laurent, L. C., Chen, J., Ulitsky, I., Mueller, F. J., Lu, C., Shamir, R., Fan, J. B., and Loring, J. F. 2008. Comprehensive MicroRNA Profiling Reveals a Unique Human Embryonic Stem Cell Signature Dominated by a Single Seed Sequence. Stem Cells, 26, 1506--1516.Google ScholarCross Ref
- Stark, C., Breitkreutz, B. J., Reguly, T., Boucher, L., Breitkreutz, A., Tyers, M. 2006. BioGRID: a general repository for interaction datasets. Nucleic Acids Res. 34, 535--539.Google ScholarCross Ref
- Kanehisa, M., Goto, S., Hattori, M., Aoki-Kinoshita, K. F., Itoh, M., Kawashima, S., Katayama, T., Araki, M., Hirakawa, M. 2006. From genomics to chemical genomics: new developments in KEGG. Nucleic Acids Res. 34, 354--357.Google ScholarCross Ref
- Eisen, M., Spellman, P. L., Brown, P. O., Botsein, D. 1998. Cluster analysis and display of genomewide expression patterns. Proc Natl Acad Sci USA 95. 14863--14868.Google Scholar
- Tamayo, P., Slonim, D., Mesirov, J., Zhu, Q., Kitareewan, S., Dmitrovsky, E., Launder, E. S., Golub, T. R. 1999. Interpreting patterns of gene expression with self-organizing maps: methods and application to hematopoetic differentiation. Proc Natl Acad Sci USA 96. 2907--2912.Google ScholarCross Ref
- The Gene Ontology Consortium. 2000. Gene Ontology: tool for the unification of biology. Nature Genet. 25, 25--29.Google ScholarCross Ref
- Gawronska, B., Erlendsson, B., Olsson., B. 2006. Towards an Automated Analysis of Biomedical Abstracts. DILS 2006, 50--65. Google ScholarDigital Library
- McNeish J. 2004. Embryonic stem cells in drug discovery. Nat Rev Drug Discov. 3, 70--80.Google ScholarCross Ref
- Beqqali, A., Kloots, J., Ward-van Oostwaard, D., Mummery, C., Passier, R. 2006. Genome-wide transcriptional profiling of human embryonic stem cells differentiating to cardiomyocytes. Stem Cells. 24, 1956--1967.Google ScholarCross Ref
- Synnergren, J., Adak, S., Englund, M. C., Giesler, T. L., Noaksson, K., Lindahl, A., Nilsson, P., Nelson, D., Abbot, S., Olsson, B., and Sartipy, P. 2008. Cardiomyogenic gene expression profiling of differentiating human embryonic stem cells. J Biotechnol. 134, 162--170.Google ScholarCross Ref
- Ernst, J., Nau, G., Bar-Joseph, Z. 2005. Clustering short time series gene expression data. Bioinformatics. 21, 59--68. Google ScholarDigital Library
- Birnbaum, K., Benfey, P. N. and Shasha, D. E. 2001. Cis element/transcription factor analysis (cis/TF): a method for discovering transcription factor/cis element relationships. Genome Res. 11, 1583--1590.Google ScholarCross Ref
- Zhu, Z., Yitzhak, P. and George, M. C. 2002. Computational identification of transcription factor binding sites via a transcription-factorcentric clustering (TFCC) algorithm. J. Mol. Biol. 318, 71--81.Google ScholarCross Ref
- Han J. D, Bertin N., Hao T., et al. Evidence for dynamically organized modularity in the yeast protein-protein interaction network. Nature. 2004;430:88--93.Google Scholar
- Harris, B.,S., Jay, P., Y., Rackley, M., S., et al. Transcriptional regulation of cardiac conduction system development: 2004 FASEB cardiac conduction system minimeeting, Washington, DC. Anat Rec A Discov Mol Cell Evol Biol. 2004;280:1036--1045.Google ScholarCross Ref
- Firouzi, M., Bierhuizen, M.,F., Kok, B., et al. The human Cx40 promoter polymorphism -44G->A differentially affects transcriptional regulation by Sp1 and GATA4. Biochim Biophys Acta. 2006;1759:491--496.Google Scholar
- Linhares, V.,L., Almeida, N.,A., Menezes, D., C., et al. Transcriptional regulation of the murine Connexin40 promoter by cardiac factors Nkx2-5, GATA4 and Tbx5. Cardiovasc Res. 2004;64:402--411.Google Scholar
- Yang, W., M., Inouye, C., J., Seto, E. Cyclophilin A and FKBP12 interact with YY1 and alter its transcriptional activity.J Biol Chem. 1995;270:15187--1519.Google Scholar
- Coram, R. 2002. Boyd: The Fighter Pilot Who changed the Art of War. ISBN 0-316-88146-5. Little Brown.Google Scholar
Index Terms
- A data integration method for exploring gene regulatory mechanisms
Recommendations
Biological Data Warehousing System for Identifying Transcriptional Regulatory Sites From Gene Expressions of Microarray Data
Identification of transcriptional regulatory sites plays an important role in the investigation of gene regulation. For this propose, we designed and implemented a data warehouse to integrate multiple heterogeneous biological data sources with data ...
Integration of epigenetic data in Bayesian network modeling of gene regulatory network
PRIB'11: Proceedings of the 6th IAPR international conference on Pattern recognition in bioinformaticsThe reverse engineering of gene regulatory network (GRN) is an important problem in systems biology. While gene expression data provide a main source of insights, other types of data are needed to elucidate the structure and dynamics of gene regulation. ...
Integration of Multiple Data Sources for Gene Network Inference using Genetic Perturbation Data: Extended Abstract
BCB '18: Proceedings of the 2018 ACM International Conference on Bioinformatics, Computational Biology, and Health InformaticsThe inference of gene networks from large-scale human genomic data is challenging due to the difficulty in identifying correct regulators for each gene in a high-dimensional search space. We present a Bayesian approach integrating external data sources ...
Comments