ABSTRACT
Understanding the molecular mechanisms underlying early cancer development is still a challenge. To address this, we developed an interpretable, data-driven machine learning approach to identify the gene biomarkers that predict the clinical outcomes of early cancer patients. As a demonstration, we applied this approach into large-scale pan-cancer datasets including TCGA to find out how effective it would be at identifying the developmental gene expression biomarkers across tumor stages for various cancer types. Results confirmed that artificial neural network prediction embedding nonlinear feature selection outperforms other classifiers. Moreover, and more relevant to the goal of machine learning interpretable classifiers, we found that early cancer patient groups clustered by the biomarkers selected have significantly more survival differences than ones by early TNM stages, suggesting that this method identified novel early cancer molecular biomarkers. Furthermore, using lung cancer as a study case, we leveraged the hierarchical architectures of neural network to identify the developmental regulatory networks controlling the expression of early cancer biomarkers, providing mechanistic insights of functional genomics driving the onset of cancer development. Finally, we reported the drugs targeting early cancer biomarkers, revealing potential genomic medicine affecting the early cancer development.
Index Terms
- Interpretable Machine Learning Approach Reveals Developmental Gene Expression Biomarkers for Cancer Patient Outcomes at Early Stages
Recommendations
Cancer adjuvant chemotherapy strategic classification by artificial neural network with gene expression data
Display Omitted Adjuvant chemotherapy for non-small cell lung cancer (NSCLC) is still controversial.Prediction of adjuvant chemotherapy benefit in NSCLC using ANN with gene expression data.Using gene signature profiles to predict ACT benefit in NSCLC is ...
Machine learning analysis of lung squamous cell carcinoma gene expression datasets reveals novel prognostic signatures
Abstract BackgroundLung squamous cell carcinoma (LUSC) patients are often diagnosed at an advanced stage and have poor prognoses. Thus, identifying novel biomarkers for the LUSC is of utmost importance.
MethodsMultiple datasets from the NCBI-GEO ...
Graphical abstractDisplay Omitted
Highlights- Meta-analysis of microarray gene expression profiles for developing ML-based models for LUSC prognosis.
- The end-to-end analysis pipeline in the study is usable for other disorders, thereby expanding its general applicability.
- ...
MicroRNA Expression Profiling of Pancreatic Cancer Histological Subtypes in the Identification of Potential Biomarkers for Diagnosis and Tumor Prognosis
ICBRA '23: Proceedings of the 2023 10th International Conference on Bioinformatics Research and ApplicationsPancreatic cancer (PC) is the seventh leading cause of cancer-related deaths and 12th in malignant frequency globally. It has a significantly high incidence rate in countries with high human development index (HDI) and, generally, regions able to use ...
Comments