Research Article
Transcriptional master regulator analysis in breast cancer genetic networks

https://doi.org/10.1016/j.compbiolchem.2015.08.007Get rights and content

Abstract

Gene regulatory networks account for the delicate mechanisms that control gene expression. Under certain circumstances, gene regulatory programs may give rise to amplification cascades. Such transcriptional cascades are events in which activation of key-responsive transcription factors called master regulators trigger a series of gene expression events. The action of transcriptional master regulators is then important for the establishment of certain programs like cell development and differentiation. However, such cascades have also been related with the onset and maintenance of cancer phenotypes. Here we present a systematic implementation of a series of algorithms aimed at the inference of a gene regulatory network and analysis of transcriptional master regulators in the context of primary breast cancer cells. Such studies were performed in a highly curated database of 880 microarray gene expression experiments on biopsy-captured tissue corresponding to primary breast cancer and healthy controls. Biological function and biochemical pathway enrichment analyses were also performed to study the role that the processes controlled – at the transcriptional level – by such master regulators may have in relation to primary breast cancer. We found that transcription factors such as AGTR2, ZNF132, TFDP3 and others are master regulators in this gene regulatory network. Sets of genes controlled by these regulators are involved in processes that are well-known hallmarks of cancer. This kind of analyses may help to understand the most upstream events in the development of phenotypes, in particular, those regarding cancer biology.

Introduction

Cancer is a pathway-disease (Hanahan and Weinberg, 2000). The main hallmarks of cancer are associated to the action of pathways related to cell proliferation, apoptosis evasion, cell differentiation and in general, to the dysregulation of cell cycle and the alteration of DNA-repairing processes (Hanahan and Weinberg, 2000, Hanahan and Weinberg, 2011). The phenotype of a cell is determined by the activity of a large number of genes and proteins (Basso et al., 2005). Hence, transcriptional regulation lies at the heart of many of the intricate molecular relationships driving the activity of biological pathways (Emmert-Streib et al., 2014).

It has been observed that a number of large scale transcriptional cascades behind such complex cellular processes are actually triggered by the action of a relatively small number of transcription factor molecules that have been called Transcriptional Master Regulators (TMRs) (Han et al., 2004, Sun-Kin Chan and Kyba, 2013, Mullen et al., 2011). It has been argued that these genes control the entire transcriptional regulatory program for specific cellular phenotypes (in eukaryotic cells; Han et al., 2004, Basso et al., 2005, Affara et al., 2013). However, TMRs are also able to act on general cellular processes at the same time (Hinnebusch and Natarajan, 2002, Medvedovic et al., 2011, Affara et al., 2013). A proper understanding of the organization of these TMR-mediated highly-regulated events is thus crucial to elucidate normal cell physiology as well as complex pathological phenotypes (Basso et al., 2005).

Given the complex mechanisms underlying transcriptional regulations on eukaryotes, the identification of TMRs is often based on the (inferred or observed) relationship among them and their cascade of RNA targets in gene regulatory networks (Hernández-Lemus and Siqueiros-García, 2013). Being a primary upstream event in the cell regulatory program, dysregulation of TMRs may have a high impact on cancer-related phenotypes, since under genetic instability conditions, uncontrolled synthesis of these molecules could originate the activation/amplification of several transcriptional cascades (Basso et al., 2005, Baca-Lopez et al., 2014, Baca-López et al., 2012).

A TMR is a transcription factor (TF) that is expressed at the early onset of the development of a particular phenotype or cell type (Sun-Kin Chan and Kyba, 2013). It also participates in the specifications of such a phenotype by regulating multiple downstream genes, either directly or by means of genetic cascades. Transcription factors are hence key cellular components that control gene expression: their activities may determine how cells function and respond to the environment (Vaquerizas et al., 2009).

Transcription factors may act in two opposite directions: either activating or repressing transcriptional activity of their targets. Based on the initial estimations of the whole human genome sequence, it was calculated that the transcriptional machinery could be composed of 200 to 300 genes and there could exist between 2000 to 3000 specific union sites for transcription factors (Lander et al., 2001, Venter et al., 2001). In Vaquerizas et al. (2009) it is stated that in the http://amigo.geneontology.orgGene Ontology database 1052 TFs were defined and just 6% (62 cases) of them had experimental corroboration. Six years later, the same database recognized 1846 TFs and 14% (260) of them had experimental evidence. This is indicative of the fast progress on documenting the transcription mechanisms, but this also points to the overwhelming complexity of the mechanisms of genomic control.

Implementation of computational methods to identify and analyze TMRs is relevant in the context of breast cancer, particularly at its earliest stages. We have probabilistically inferred the gene regulatory network associated with this phenotype, then a computational analysis has uncovered its active TMRs in the context of primary breast cancer. In our study we have considered such an analysis, as well as the resulting TMR-related phenomena in the context of transcriptional regulatory programs. We also discuss here some of the implications of our results in breast cancer biology. The article is structured as follows: Section 2 presents an overview of the materials and methods used in this work. This includes both the experimental datasets used, the network inference strategy and the molecular signature analysis, as well as the algorithm for the discovery of transcriptional master regulators. Section 3 presents some of the main results of the application of this pipeline in primary breast cancer microarray gene expression data. Finally, Section 4 presents some conclusions mainly related with the advantages of implementing a method such as MARINa (Lefebvre et al., 2010) in order to unveil some aspects of regulatory control that may lie behind the establishment of tumor phenotypes.

Section snippets

Experimental datasets

For the analysis presented here, we obtained 880 microarray expression profiles from several experimental datasets that are available on the Gene Expression Omnibus site (http://www.ncbi.nlm.nih.gov/geo/GEO) (Edgar et al., 2002). All experiments were performed by using total mRNA on the microarray platform Affymetrix HGU133A (GPL96), which consists of 18,400 transcripts and variants, including 14,500 well-characterized human genes (Liu et al., 2003). From the total 880 samples, 819 correspond

Results and discussion

In this work we inferred a transcriptional regulatory network that is based on mutual information for 14,500 genes and 880 microarray gene expression samples corresponding to biopsy-captured tissue in breast cancer patients and controls (a comparative table of inferred networks with the four threshold values is presented in Supplementary Material 3). The direction of the regulation of the Transcription Factors (TFs) was inferred based on the expression of the target genes (a comparative table

Conclusions

In this study, we implemented a method based on the combination of gene regulatory network inference and gene set enrichment analysis algorithms. We did this across a set of gene expression experiments capable of inducing a molecular signature that distinguishes cases from controls. In this approach, cases were samples belonging to biopsy-captured primary breast cancer tissue while controls were healthy breast tissue. This algorithm called Master Regulator Inference Analysis (MARINa) has

Acknowledgements

Funding statement: This work was supported by CONACYT (grant no. 179431/2012), as well as by federal funding from the National Institute of Genomic Medicine (Mexico). This work has been submitted to comply with the requirements of the Ph.D. program in Biological Sciences at the Universidad Nacional Autónoma de México of Hugo Antonio Tovar-Romero (HATR-Hugo Tovar). HATR is grateful to CONACYT for the financial support provided via a PhD Scholarship (grant no. 202668). Authors are grateful to the

References (70)

  • I.B. Pau Ni et al.

    Gene expression patterns distinguish breast carcinomas from normal breast tissues: the Malaysian context

    Pathol. Res. Pract.

    (2010)
  • H. Qiao et al.

    Human TFDP3, a novel DP protein, inhibits DNA binding and transactivation by E2F

    J. Biol. Chem.

    (2007)
  • C. Tian et al.

    TFDP3 inhibits E2F1-induced, p53-mediated apoptosis

    Biochem. Biophys. Res. Commun.

    (2007)
  • M.O. Abildgaard et al.

    Downregulation of zinc finger protein 132 in prostate cancer is associated with aberrant promoter hypermethylation and poor prognosis

    Int. J. Cancer

    (2012)
  • M. Affara et al.

    Vasohibin-1 is identified as a master-regulator of endothelial cell apoptosis using gene network analysis

    BMC Genomics

    (2013)
  • M.J. Alvarez

    ssmarina: Single sample-optimized Master Regulator Analysis. R package version 1.01

    (2013)
  • Y. Assenov et al.

    Computing topological parameters of biological networks

    Bioinformatics

    (2008)
  • K. Baca-López et al.

    The role of master regulators in the metabolic/transcriptional coupling in breast carcinomas

    PLoS ONE

    (2012)
  • K. Baca-Lopez et al.

    A 3-state model for multidimensional genomic data integration

    Syst. Biomed.

    (2014)
  • K.V. Ballman et al.

    Faster cyclic loess: normalizing RNA arrays via linear models

    Bioinformatics

    (2004)
  • M. Bansal et al.

    How to infer gene networks from expression profiles

    Mol. Syst. Biol.

    (2007)
  • K. Basso et al.

    Reverse engineering of regulatory networks in human B cells

    Nat. Genet.

    (2005)
  • M.S. Carro et al.

    The transcriptional network for mesenchymal transformation of brain tumours

    Nature

    (2010)
  • C. Chen et al.

    Removing batch effects in analysis of expression microarray data: an evaluation of six batch adjustment methods

    PLoS ONE

    (2011)
  • J. Cowan et al.

    Genetic and functional analyses of ZIC3 variants in congenital heart disease

    Hum. Mutat.

    (2014)
  • B. De Paepe et al.

    Increased angiotensin II type-2 receptor density in hyperplasia, DCIS and invasive carcinoma of the breast is paralleled with increased iNOS expression

    Histochem. Cell Biol.

    (2002)
  • C. Desmedt et al.

    Strong time dependence of the 76-gene prognostic signature for node-negative breast cancer patients in the TRANSBIG multicenter independent validation series

    Clin. Cancer Res.

    (2007)
  • R. Edgar et al.

    Gene Expression Omnibus: NCBI gene expression and hybridization array data repository

    Nucleic Acids Res.

    (2002)
  • F. Emmert-Streib et al.

    The gene regulatory network for breast cancer: integrated regulatory landscape of cancer hallmarks

    Stat. Genet. Methodol.

    (2014)
  • J. Espinal-Enríquez et al.

    Genome-wide expression analysis suggests a crucial role of dysregulation of Matrix Metalloproteinases Pathway in Undifferentiated Thyroid Carcinoma

    BMC Genomics

    (2015)
  • P. Farmer et al.

    Identification of molecular apocrine breast tumours by microarray analysis

    Oncogene

    (2005)
  • P. Grass

    Experimental design.

  • J.-D.J. Han et al.

    Evidence for dynamically organized modularity in the yeast protein–protein interaction network

    Nature

    (2004)
  • S. Hara et al.

    [Hypoxia-inducible factor-3alpha as a negative regulator of tumorigenesis]

    Seikagaku

    (2011)
  • M. Heikkila et al.

    Roles of the human hypoxia-inducible factor (HIF)-3 variants in the hypoxia response

    Cell. Mol. Life Sci.

    (2011)
  • Cited by (43)

    • A meta-study on transcription factor networks in the suicidal brain

      2021, Journal of Psychiatric Research
      Citation Excerpt :

      Consequently, little is known about the role of other regulatory mechanisms in the transcriptional alterations found in the suicidal brain, i.e. transcription factors (TF). TF control the initiation of transcriptional cascades (Tovar et al., 2015), regulating hundreds to thousands of downstream genes (Johnson et al., 2007). Furthermore, mutual regulation among TF, i.e., TF regulating other TF, underlies major complex biological functions, such as cell proliferation and differentiation (Neph et al., 2012).

    • Networks of transcription factors

      2020, Genome Plasticity in Health and Disease
    View all citing articles on Scopus
    View full text