Computational pan-cancer characterization of model-based quantitative transcription regulations dysregulated in regional lymph node metastasis

https://doi.org/10.1016/j.compbiomed.2021.104571Get rights and content

Highlights

  • Transcription regulation (TR) is an essential biological process.

  • TR was mostly regarded as yes/no between single mRNAs and single TFs.

  • We formulated TR as a regression model between a single mRNA and multiple TFs.

  • The proposed mqTrans models quantified the TR machinery in primary cancers.

  • Pan-cancer study received non-differentially expressed mRNAs with metastasis-changed TRs.

Abstract

Cancer is one of the major causes of mortality worldwide. Regional lymph node metastasis is an important mechanism during the spread of human cancers, in which transcription regulation plays an essential role. This study formulated a regression-model-based quantitative transcription regulation (mqTrans) between one mRNA gene and multiple transcription factors (TFs). Computational pan-cancer screening was carried out to detect the quantitative dysregulation of transcription regulation in the regional lymph node metastasis of 18 cancer types. Only a few metastasis-dysregulated mqTrans models were shared among the cancer types. The mRNA genes of the metastasis-dysregulated mqTrans models were not differentially expressed in regional lymph node metastasis. The experimental data suggested that mqTrans technology provided a complementary approach to the evaluation of transcription regulation mechanisms and may facilitate its quantitative investigation in other phenotypes.

Introduction

Cancer is the second leading cause of death worldwide, especially in patients diagnosed in the late stages [1]. The projected estimations in 2021 for the United States are 1,898,160 new cases and 608,570 cancer-related deaths [2]. The breast cancer incidence rate has seen a yearly increase of 0.3% during 2012–2016 [3]. Although the overall death rates continue to decline due to new technology in the diagnosis and treatment of cancer, incidence rates are leveling off among males and increasing slightly among females [4]. The current estimated 5-year relative survival rates of men and women for all cancer types combined are 68.5% and 70.1%, respectively [5].

Lymphatic metastasis is an important mechanism for the spread of human cancers [6]. Tumor lymphatics play an essential role in cancer progression and are solely responsible for transporting malignant cells to regional lymph nodes, preceding the systemic lethal spread [7]. For example, breast cancer metastasizes through the lymphovascular system to the regional lymph nodes in the axilla and to both visceral and non-visceral sites [8]. Local lymph node-positive or metastatic diagnosis was observed in most men who eventually died from prostate cancers [9].

Regional lymph node metastasis is thus an important prognostic factor in many cancer types [10,11], which has been used as a risk indicator for gastric cancer patients [12] and has negatively affected the prognosis of prostate cancer [13]. Regional lymph node metastasis is also considered during the treatment decision process [14], whether to remove the nodes as part of the primary cancer resection and prevent future risks or treat distant metastasis in vital organs [15]. Melanoma patients with regional lymph node metastasis usually require systemic and intensive adjuvant therapies [16]. Further, lymph node metastasis is also directly associated with treatment response, local recurrence, and long-term survival of gastric cancer patients [17]. Therefore, the investigation of lymph node metastasis across various cancer types has important clinical significance.

This study formulated the quantitative transcription regulation relationship between an mRNA and multiple transcription factors (TFs) as a regression (mqTrans) model. The mqTrans model of an mRNA was trained in the primary cancer samples, and evaluated for its fluctuations in the regional lymph node metastatic samples. This study established the quantitative landscape of transcription regulation across 18 cancer types, and then screened the changes in regional lymph node metastasis. Only a few metastasis-dysregulated mqTrans models were shared across, suggesting the inherent metastatic heterogeneity in different cancer types. Support from the literature was found for only three of the top-10 ranked metastasis-dysregulated genes in the 18 cancer types.

Section snippets

Datasets and preprocessing

The Cancer Genome Atlas (TCGA) is one of the most comprehensive genomic datasets of multiple cancer types [18]. The RNAseq-based transcriptomic datasets of 18 TCGA cancer types were used in this study. The detailed summary and preprocessing details are summarized in the supplementary section “Dataset summary and preprocessing” and Supplementary Table S1.

Feature selection for the regression models

Feature selection algorithms may substantially reduce the number of features used to train a classification or regression model [19,20]. To

Optimizing the mqTrans models of three cancer types

We selected three datasets, COAD/LUAD/LUSC, to optimize the parameter pThreshold of the mqTrans models, as shown in Fig. 1. The parameter pThreshold was tuned from 0.0 to 1.0 with a step size 0.1, and only the TF features with weights no smaller than pThreshold times the maximal TF weight in the same regression model were kept for further analysis.

Fig. 1 (a) shows that the mqTrans models fluctuate radically when pThreshold <0.3. The dataset COAD increases MeanMAE from 5.46 at pThreshold = 0.1

Conclusion

This study quantitatively investigated the transcription regulations using regression models, and comprehensively screened the genes whose model-based quantitative transcription regulations (mqTrans) were significantly altered in the lymph node metastasis of 18 cancer types. These cancer types only shared a limited number of metastasis-dysregulated mqTrans models, even between subtypes. This suggests an inherent heterogeneity of cancers.

The mqTrans technology also provided a new perspective to

Declaration of competing interest

The authors declared no competing interests.

Acknowledgements

This work was supported by the Jilin Provincial Key Laboratory of Big Data Intelligent Computing (20180622002JC), the Education Department of Jilin Province (JJKH20180145KJ), and a startup grant from Jilin University. This work was also partially supported by the Bioknow MedAI Institute (BMCPP-2018-001), the High-Performance Computing Center of Jilin University, and the Fundamental Research Funds for the Central Universities, JLU.

Insightful comments from the editor-in-chief and the two

References (50)

  • R.L. Siegel et al.

    Cancer statistics, 2021

    CA: Canc. J. Clinc.

    (2021)
  • C.E. DeSantis et al.

    Breast cancer statistics

    CA: Canc. J. Clinc.

    (2019)
  • S.J. Henley et al.

    Annual report to the nation on the status of cancer, part I: national cancer statistics

    Cancer

    (2020)
  • R. Yang et al.

    Trends in cancer incidence and mortality rates in the United States from 1975 to 2016

    Ann. Transl. Med.

    (2020)
  • I. Carr

    Lymphatic metastasis

    Canc. Metastasis Rev.

    (1983)
  • S. Ran et al.

    Lymphatic endothelial cell progenitors in the tumor microenvironment

    Adv. Exp. Med. Biol.

    (2020)
  • S.D. Nathanson et al.

    Breast cancer metastasis through the lympho-vascular system

    Clin. Exp. Metastasis

    (2018)
  • L.F. Oñate-Ocaña et al.

    Treatment of the adenocarcinoma of the esophagogastric junction at a single institution in Mexico

    Ann. Surg Oncol.

    (2007)
  • H. Morita et al.

    Histopathological predictor for regional lymph node metastasis in gastric cancer

    Virchows Arch. : Int. J. Pathol.

    (2009)
  • N. Xu et al.

    Development and validation of hub genes for lymph node metastasis in patients with prostate cancer

    J. Cell Mol. Med.

    (2020)
  • S.A. Vantyghem et al.

    A new model for lymphatic metastasis: development of a variant of the MDA-MB-468 human breast cancer cell line that aggressively metastasizes to lymph nodes

    Clin. Exp. Metastasis

    (2005)
  • B. Cady

    Regional lymph node metastases; a singular manifestation of the process of clinical metastases in cancer: contemporary animal research and clinical reports suggest unifying concepts

    Ann. Surg Oncol.

    (2007)
  • R.R. White et al.

    Long-term survival in 2,505 patients with melanoma with regional lymph node metastasis

    Ann. Surg.

    (2002)
  • T. Zhu et al.

    Molecular background of the regional lymph node metastasis of gastric cancer

    Oncol. Lett.

    (2018)
  • K.A. Hoadley et al.

    Cell-of-Origin patterns dominate the molecular classification of 10,000 tumors from 33 types of cancer

    Cell

    (2018)
  • Cited by (7)

    • Deep learning-based transcription factor activity for stratification of breast cancer patients

      2022, Biochimica et Biophysica Acta - Gene Regulatory Mechanisms
      Citation Excerpt :

      The transcription factor activity patterns have been widely used to characterize the functional status of transcriptional regulatory circuits [5] and genomic aberrations in many types of tumors in recent years [6–9]. In addition, transcription factor activity was used to characterize driver somatic mutations and identify new markers of drug response in tumors [10–12]. Thus, transcription factor activities have attracted more and more attention in the area of tumor research [13].

    View all citing articles on Scopus
    View full text