Research ArticleTranscriptional master regulator analysis in breast cancer genetic networks
Graphical abstract
Introduction
Cancer is a pathway-disease (Hanahan and Weinberg, 2000). The main hallmarks of cancer are associated to the action of pathways related to cell proliferation, apoptosis evasion, cell differentiation and in general, to the dysregulation of cell cycle and the alteration of DNA-repairing processes (Hanahan and Weinberg, 2000, Hanahan and Weinberg, 2011). The phenotype of a cell is determined by the activity of a large number of genes and proteins (Basso et al., 2005). Hence, transcriptional regulation lies at the heart of many of the intricate molecular relationships driving the activity of biological pathways (Emmert-Streib et al., 2014).
It has been observed that a number of large scale transcriptional cascades behind such complex cellular processes are actually triggered by the action of a relatively small number of transcription factor molecules that have been called Transcriptional Master Regulators (TMRs) (Han et al., 2004, Sun-Kin Chan and Kyba, 2013, Mullen et al., 2011). It has been argued that these genes control the entire transcriptional regulatory program for specific cellular phenotypes (in eukaryotic cells; Han et al., 2004, Basso et al., 2005, Affara et al., 2013). However, TMRs are also able to act on general cellular processes at the same time (Hinnebusch and Natarajan, 2002, Medvedovic et al., 2011, Affara et al., 2013). A proper understanding of the organization of these TMR-mediated highly-regulated events is thus crucial to elucidate normal cell physiology as well as complex pathological phenotypes (Basso et al., 2005).
Given the complex mechanisms underlying transcriptional regulations on eukaryotes, the identification of TMRs is often based on the (inferred or observed) relationship among them and their cascade of RNA targets in gene regulatory networks (Hernández-Lemus and Siqueiros-García, 2013). Being a primary upstream event in the cell regulatory program, dysregulation of TMRs may have a high impact on cancer-related phenotypes, since under genetic instability conditions, uncontrolled synthesis of these molecules could originate the activation/amplification of several transcriptional cascades (Basso et al., 2005, Baca-Lopez et al., 2014, Baca-López et al., 2012).
A TMR is a transcription factor (TF) that is expressed at the early onset of the development of a particular phenotype or cell type (Sun-Kin Chan and Kyba, 2013). It also participates in the specifications of such a phenotype by regulating multiple downstream genes, either directly or by means of genetic cascades. Transcription factors are hence key cellular components that control gene expression: their activities may determine how cells function and respond to the environment (Vaquerizas et al., 2009).
Transcription factors may act in two opposite directions: either activating or repressing transcriptional activity of their targets. Based on the initial estimations of the whole human genome sequence, it was calculated that the transcriptional machinery could be composed of 200 to 300 genes and there could exist between 2000 to 3000 specific union sites for transcription factors (Lander et al., 2001, Venter et al., 2001). In Vaquerizas et al. (2009) it is stated that in the http://amigo.geneontology.orgGene Ontology database 1052 TFs were defined and just 6% (62 cases) of them had experimental corroboration. Six years later, the same database recognized 1846 TFs and 14% (260) of them had experimental evidence. This is indicative of the fast progress on documenting the transcription mechanisms, but this also points to the overwhelming complexity of the mechanisms of genomic control.
Implementation of computational methods to identify and analyze TMRs is relevant in the context of breast cancer, particularly at its earliest stages. We have probabilistically inferred the gene regulatory network associated with this phenotype, then a computational analysis has uncovered its active TMRs in the context of primary breast cancer. In our study we have considered such an analysis, as well as the resulting TMR-related phenomena in the context of transcriptional regulatory programs. We also discuss here some of the implications of our results in breast cancer biology. The article is structured as follows: Section 2 presents an overview of the materials and methods used in this work. This includes both the experimental datasets used, the network inference strategy and the molecular signature analysis, as well as the algorithm for the discovery of transcriptional master regulators. Section 3 presents some of the main results of the application of this pipeline in primary breast cancer microarray gene expression data. Finally, Section 4 presents some conclusions mainly related with the advantages of implementing a method such as MARINa (Lefebvre et al., 2010) in order to unveil some aspects of regulatory control that may lie behind the establishment of tumor phenotypes.
Section snippets
Experimental datasets
For the analysis presented here, we obtained 880 microarray expression profiles from several experimental datasets that are available on the Gene Expression Omnibus site (http://www.ncbi.nlm.nih.gov/geo/GEO) (Edgar et al., 2002). All experiments were performed by using total mRNA on the microarray platform Affymetrix HGU133A (GPL96), which consists of 18,400 transcripts and variants, including 14,500 well-characterized human genes (Liu et al., 2003). From the total 880 samples, 819 correspond
Results and discussion
In this work we inferred a transcriptional regulatory network that is based on mutual information for 14,500 genes and 880 microarray gene expression samples corresponding to biopsy-captured tissue in breast cancer patients and controls (a comparative table of inferred networks with the four threshold values is presented in Supplementary Material 3). The direction of the regulation of the Transcription Factors (TFs) was inferred based on the expression of the target genes (a comparative table
Conclusions
In this study, we implemented a method based on the combination of gene regulatory network inference and gene set enrichment analysis algorithms. We did this across a set of gene expression experiments capable of inducing a molecular signature that distinguishes cases from controls. In this approach, cases were samples belonging to biopsy-captured primary breast cancer tissue while controls were healthy breast tissue. This algorithm called Master Regulator Inference Analysis (MARINa) has
Acknowledgements
Funding statement: This work was supported by CONACYT (grant no. 179431/2012), as well as by federal funding from the National Institute of Genomic Medicine (Mexico). This work has been submitted to comply with the requirements of the Ph.D. program in Biological Sciences at the Universidad Nacional Autónoma de México of Hugo Antonio Tovar-Romero (HATR-Hugo Tovar). HATR is grateful to CONACYT for the financial support provided via a PhD Scholarship (grant no. 202668). Authors are grateful to the
References (70)
- et al.
A hypoxia-inducible factor (HIF)-3 splicing variant, HIF-34 impairs angiogenesis in hypervascular malignant meningiomas with epigenetically silenced HIF-34
Biochem. Biophys. Res. Commun.
(2013) - et al.
Cross-species regulatory network analysis identifies a synergistic interaction between FOXM1 and CENPF that drives prostate cancer malignancy
Cancer Cell
(2014) - et al.
RUNX2 is overexpressed in melanoma cells and mediates their migration and invasion
Cancer Lett.
(2014) - et al.
T-brain-1: a homolog of Brachyury whose expression defines molecularly distinct domains within the cerebral cortex
Neuron
(1995) - et al.
A protein-protein interaction network of transcription factors acting during liver cell proliferation
Genomics
(2008) - et al.
The hallmarks of cancer
Cell
(2000) - et al.
Hallmarks of Cancer: The Next Generation
Cell
(2011) - et al.
TFDP3 was expressed in coordination with E2F1 to inhibit E2F1-mediated apoptosis in prostate cancer
Gene
(2014) - et al.
Pax5: a master regulator of B cell development and leukemogenesis
Adv. Immunol.
(2011) - et al.
Master transcription factors determine cell-type-specific responses to TGF-β signaling
Cell
(2011)
Gene expression patterns distinguish breast carcinomas from normal breast tissues: the Malaysian context
Pathol. Res. Pract.
Human TFDP3, a novel DP protein, inhibits DNA binding and transactivation by E2F
J. Biol. Chem.
TFDP3 inhibits E2F1-induced, p53-mediated apoptosis
Biochem. Biophys. Res. Commun.
Downregulation of zinc finger protein 132 in prostate cancer is associated with aberrant promoter hypermethylation and poor prognosis
Int. J. Cancer
Vasohibin-1 is identified as a master-regulator of endothelial cell apoptosis using gene network analysis
BMC Genomics
ssmarina: Single sample-optimized Master Regulator Analysis. R package version 1.01
Computing topological parameters of biological networks
Bioinformatics
The role of master regulators in the metabolic/transcriptional coupling in breast carcinomas
PLoS ONE
A 3-state model for multidimensional genomic data integration
Syst. Biomed.
Faster cyclic loess: normalizing RNA arrays via linear models
Bioinformatics
How to infer gene networks from expression profiles
Mol. Syst. Biol.
Reverse engineering of regulatory networks in human B cells
Nat. Genet.
The transcriptional network for mesenchymal transformation of brain tumours
Nature
Removing batch effects in analysis of expression microarray data: an evaluation of six batch adjustment methods
PLoS ONE
Genetic and functional analyses of ZIC3 variants in congenital heart disease
Hum. Mutat.
Increased angiotensin II type-2 receptor density in hyperplasia, DCIS and invasive carcinoma of the breast is paralleled with increased iNOS expression
Histochem. Cell Biol.
Strong time dependence of the 76-gene prognostic signature for node-negative breast cancer patients in the TRANSBIG multicenter independent validation series
Clin. Cancer Res.
Gene Expression Omnibus: NCBI gene expression and hybridization array data repository
Nucleic Acids Res.
The gene regulatory network for breast cancer: integrated regulatory landscape of cancer hallmarks
Stat. Genet. Methodol.
Genome-wide expression analysis suggests a crucial role of dysregulation of Matrix Metalloproteinases Pathway in Undifferentiated Thyroid Carcinoma
BMC Genomics
Identification of molecular apocrine breast tumours by microarray analysis
Oncogene
Experimental design.
Evidence for dynamically organized modularity in the yeast protein–protein interaction network
Nature
[Hypoxia-inducible factor-3alpha as a negative regulator of tumorigenesis]
Seikagaku
Roles of the human hypoxia-inducible factor (HIF)-3 variants in the hypoxia response
Cell. Mol. Life Sci.
Cited by (43)
A meta-study on transcription factor networks in the suicidal brain
2021, Journal of Psychiatric ResearchCitation Excerpt :Consequently, little is known about the role of other regulatory mechanisms in the transcriptional alterations found in the suicidal brain, i.e. transcription factors (TF). TF control the initiation of transcriptional cascades (Tovar et al., 2015), regulating hundreds to thousands of downstream genes (Johnson et al., 2007). Furthermore, mutual regulation among TF, i.e., TF regulating other TF, underlies major complex biological functions, such as cell proliferation and differentiation (Neph et al., 2012).
Networks of transcription factors
2020, Genome Plasticity in Health and DiseaseThe network structure of hematopoietic cancers
2023, Scientific ReportsKruppel-family zinc finger proteins as emerging epigenetic biomarkers in head and neck squamous cell carcinoma
2023, Journal of Otolaryngology - Head and Neck SurgeryMolecular mechanisms of multi-omic regulation in breast cancer
2023, Frontiers in OncologyThe regulatory landscape of retinoblastoma: A pathway analysis perspective
2022, Royal Society Open Science