Abstract
In the field of bioinformatics, the classification of tumors is a difficult and time-consuming task. When diagnosing cancer, gene expression levels are typically one of the most useful tools. However, the biological noise present in microarray data leads to unsatisfactory precision and accuracy. The utilization of thousands of genes in the process of diagnosing tumors is an important task. The two levels of feature selection have been proposed in order to determine the genes that are the most informative to diagnose cancer. Using three different statistical methods, the first level of selection reveals the prognostic genes. In the second level, the differential evolution algorithm considers the prognostic genes that were obtained from statistical measures as initial members to identify the most relevant features. The scaling factor in the modified differential evolution algorithm was made to vary in a dynamic manner in order to evolve the mutant member of the population. The proposed model is a hybrid of statistical approach and evolutionary computation with modified differential evolution algorithm that identifies the candidate genes from thousands of genes from gene expression data. The findings obtained through this hybrid approach upon testing five gene expression datasets provide evidence that it has outperformed when compared to the existing systems for DLBCL outcome, prostate outcome, prostate, and colon tumor datasets with improved classification accuracies of 14%, 4%, 0.62%, and 0.13%, respectively.
Similar content being viewed by others
Availability of datasets
The gene expression datasets used in this work are available in Open Repositories.
References
Sung, H., Ferlay, J., Siegel, R.L., Laversanne, M., Soerjomataram, I., Jemal, A., et al.: Global cancer Statistics 2020: GLOBOCAN Estimates of incidence and mortality worldwide for 36 cancers in 185 countries. Cancer J. Clinic. 71, 209–249 (2021)
Manceau, Cécile., Fromont, Gaëlle., Beauval, Jean-Baptiste., Barret, Eric, Brureau, Laurent, Créhange, Gilles, Dariane, Charles, et al.: Biomarker in active surveillance for prostate cancer: a systematic review. Cancers 13(17), 4251 (2021)
Nyberg, Tommy, Tischkowitz, Marc, Antoniou, Antonis C.: BRCA1 and BRCA2 pathogenic variants and prostate cancer risk: systematic review and meta-analysis. British J. Cancer 126(7), 1067–1081 (2022)
Wiebringhaus, R., Pecoraro, M., Neubauer, H.A., Trachtová, K., Trimmel, B., Wieselberg, M., Pencik, J., et al.: Proteomic Analysis Identifies NDUFS1 and ATP5O as Novel Markers for Survival Outcome in Prostate Cancer. Cancers 13(23), 6036 (2021)
Meng, Jialin, Guan, Yu., Wang, Bijun, Chen, Lei, Chen, Junyi, Zhang, Meng, Liang, Chaozhao: Risk subtyping and prognostic assessment of prostate cancer based on consensus genes. Commun. Biology 5, 233 (2022)
Bundy, Joseph L., Judson, Richard, Williams, Antony J., Grulke, Chris, Shah, Imran: Predicting molecular initiating events using chemical target annotations and gene expression. BioData Min. 15(7), 1–27 (2022)
Vijaya Lakshmi, T.R., Sastry, P.N., Rajinikanth, T.V.: Feature selection to recognize text from palm leaf manuscripts. Signal, Image and Video process. 12(2), 223–229 (2018)
Gunavathi, C., Premalatha, K.: Performance analysis of genetic algorithm with kNN and SVM for feature selection in tumor classification. Int. J. Comput. Electr. Automat. Control Informat. Eng. 8(08), 1490–1497 (2019)
Wang, X., Gotoh, O.: Accurate molecular classification of cancer using simple rules. BMC Med. Genom. 2, 64 (2009)
Wang, X., Simon, R.: Microarray-based cancer prediction using single Genes. BMC Bioinformat. 12, 391 (2011)
Chandra, B., Gupta, M.: An efficient statistical feature selection for classification of gene expression data. J. Biomed. Informat. 44, 529–535 (2011)
Alonso, G.C.J., Moro-Sancho, I.Q., Simon-Hurtado, A., Varela- Arrabal, R.: Microarray gene expression classification with few genes: criteria to combine attribute selection and classification methods. Expert Syst. Appl. 39, 7270–7280 (2018)
Huang, Qinghua, Huang, Q., Huang, X., Kong, Z., Li, X., Tao, D.: Bi-phase evolutionary searching for biclusters in gene expression data. IEEE Trans. Evolut. Computat. 23(5), 803–814 (2018)
Cheng, Qing, Butler, William, Zhou, Yinglu, Hong Zhang, Lu., Tang, Kathryn Perkinson, Chen, Xufeng, McCall, Shannon J., Inman, Brant A., Huang, Jiaoti: Pre-existing castration-resistant prostate cancer-like cells in primary prostate cancer promote resistance to hormonal therapy. European urology 81(5), 446–455 (2022)
Alsadoon, Abeer, Al-Naymat, Ghazi, Alsadoon, Omar Hisham, Prasad, P. W. C. DDV: A Taxonomy for deep learning methods in detecting prostate cancer. Neul. Process. Lett. 53(4), 2665–2685 (2021)
Lakshmi, T.V.: Reduction of features to identify characters from degraded historical manuscripts. Alex. Eng. J. 57(4), 2393–2399 (2018)
Vijaya Lakshmi, T.R., Sastry, P.N., Rajinikanth, T.V.: Feature optimization to recognize Telugu handwritten characters by implementing DE and PSO techniques. In: Proceedings of the 5th International Conference on Frontiers in Intelligent Computing: Theory and Applications. Springer, Singapore, pp. 397–405 (2017)
Castillo, T., Jose, M., Arif, Muhammad, Starmans Martijn, P.A., Niessen, Wiro J., Bangma, Chris H., Schoots, Ivo G., Veenland, Jifke F.: Classification of clinically significant prostate cancer on multi-parametric MRI: a validation study comparing deep learning and radiomics. Cancers. 14(1), 12 (2021)
Wei, Ziwei, et al.: Deep learning-based multi-omics integration robustly predicts relapse in prostate cancer. Front. Oncology. 12, 109 (2022)
Funding
No funding was provided to carry out this work.
Author information
Authors and Affiliations
Contributions
T.R.Vijaya Lakshmi planned the experiments, designed the two level computational framework and took the lead in writing the manuscript. Ch.Venkata Krishna Reddy has carried out the simulations, aided in interpreting the results and worked on the proof outline.
Corresponding author
Ethics declarations
Conflict of Interest
The authors have no relevant financial or non-financial interests to disclose. No funds, grants, or other support was received for conducting this study. The authors have no competing interests to declare that are relevant to the content of this article.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Vijaya Lakshmi, T.R., Krishna Reddy, C.V. Cancer prediction with gene expression profiling and differential evolution. SIViP 17, 1855–1861 (2023). https://doi.org/10.1007/s11760-022-02396-9
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11760-022-02396-9