Abstract
Several authors have shown that hematological parameters can be used to detect poor prognosis in patients with cancer. Thus, such features could be used in artificial intelligence (AI-based) models to predict mortality among these patients. This work aimed to develop and compare several AI-based models to predict the prognosis (death vs. survival) of cancer in patients using blood tests and patient data as inputs. At total, 908 cancer patients were assisted in a prospective study. Four artificial intelligence models were compared: artificial neural networks (ANN), supporting vector machines (SVM), decision trees and neuro-fuzzy networks. Also, four different input strategies were tested, considering the use of 49, 45, 22 and 14 inputs. The results of this study showed that the ANN and the SVM presented the best results, using 45 inputs. The ANN was the best model since it presented better statistical values for the positive (death) and negative (survival) classes. The use of blood parameters as inputs for AI-based models could be used to predict death in patients with cancer, and this methodology can be expanded to other diseases.


Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Data availability
The data are not publicly available because they contain information that could compromise the privacy of research participants.
References
Huang S, Yang J, Fong S et al (2020) Artificial intelligence in cancer diagnosis and prognosis: opportunities and challenges. Cancer Lett 471:61–71
Ferlay JEM, Lam F, Colombet M, Mery L, Piñeros M, Znaor A, Soerjomataram I, Bray F (2020) Global cancer observatory: cancer today. International Agency for Research on Cancer, Lyon
de Oliveira Santos M, de Lima FCS, Martins LFL et al (2023) Estimativa de incidência de câncer no Brasil, 2023–2025. Rev Bras Cancerol 69:e213700
Batlle JF, Pinto A, Basterretxea L et al (2020) Development and validation of an early death risk score for older patients treated with chemotherapy for cancer. J Clin Oncol 38:12030–12030
Bibault J-E, Hancock S, Buyyounouski MK et al (2021) Development and validation of an interpretable artificial intelligence model to predict 10-year prostate cancer mortality. Cancers 13:3064
Karnes RJ, Choeurng V, Ross AE et al (2018) Validation of a genomic risk classifier to predict prostate cancer-specific mortality in men with adverse pathologic features. Eur Urol 73:168–175
Paredes-Aracil E, Palazón-Bru A, Folgado-de la Rosa DM et al (2017) A scoring system to predict breast cancer mortality at 5 and 10 years. Sci Rep 7:415
Soares M, Fontes F, Dantas J et al (2004) Performance of six severity-of-illness scores in cancer patients requiring admission to the intensive care unit: a prospective observational study. Crit Care (Lond Engl) 8:R194–R203
Williams C, Brunskill S, Altman D et al (2006) Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy. Health Technol Assess (Winch Engl) 10:1–204
Biswas N, Uddin KMM, Rikta ST et al (2022) A comparative analysis of machine learning classifiers for stroke prediction: a predictive analytics approach. Healthc Anal 2:100116
Mishra A, Ashraf MZ (2020) Using artificial intelligence to manage thrombosis research, diagnosis, and clinical management. In: Seminars in thrombosis and hemostasis. Thieme Medical Publishers, pp 410–418
Kumar Y, Koul A, Singla R et al (2022) Artificial intelligence in disease diagnosis: a systematic literature review, synthesizing framework and future research agenda. J Ambient Intell Humaniz Comput 14:8459–8486
Alhazmi A, Alhazmi Y, Makrami A et al (2021) Application of artificial intelligence and machine learning for prediction of oral cancer risk. J Oral Pathol Med 50:444–450
Bhardwaj P, Bhandari G, Kumar Y et al (2022) An Investigational approach for the prediction of gastric cancer using artificial intelligence techniques: a systematic review. Arch Comput Methods Eng 29:4379–4400
Khanagar SB, Naik S, Al Kheraif AA et al (2021) Application and performance of artificial intelligence technology in oral cancer diagnosis and prediction of prognosis: a systematic review. Diagnostics 11:1004
Qureshi TA, Javed S, Sarmadi T et al (2022) Artificial intelligence and imaging for risk prediction of pancreatic cancer. Chin Clin Oncol 11:1
Yang Y, Zhao Y, Liu X et al (2022b) Artificial intelligence for prediction of response to cancer immunotherapy. Semin Cancer Biol 87:137–147
Park K, Ali A, Kim D et al (2013) Robust predictive model for evaluating breast cancer survivability. Eng Appl Artif Intell 26:2194–2205
He T, Li J, Wang P et al (2022) Artificial intelligence predictive system of individual survival rate for lung adenocarcinoma. Comput Struct Biotechnol J 20:2352–2359
Li X, Zhai Z, Ding W et al (2022) An artificial intelligence model to predict survival and chemotherapy benefits for gastric cancer patients after gastrectomy development and validation in international multicenter cohorts. Int J Surg 105:106889
Chen J, Li Y, Cui H (2021) Preoperative low hematocrit is an adverse prognostic biomarker in ovarian cancer. Arch Gynecol Obstet 303:767–775
Peng F, Hu D, Lin X et al (2017) The monocyte to red blood cell count ratio is a strong predictor of postoperative survival in colorectal cancer patients: the Fujian prospective investigation of cancer (FIESTA) study. J Cancer 8:967
Xie X, Yao M, Chen X et al (2015) Reduced red blood cell count predicts poor survival after surgery in patients with primary liver cancer. Medicine 94:e577
Zhang X, Zhang F, Qiao W et al (2018) Low hematocrit is a strong predictor of poor prognosis in lung cancer patients. BioMed Res Int 2018:6804938
Syed-Abdul S, Firdani R-P, Chung H-J et al (2020) Artificial intelligence based models for screening of hematologic malignancies using cell population data. Sci Rep 10:4583
Yavuz E, Eyupoglu C (2020) An effective approach for breast cancer diagnosis based on routine blood analysis features. Med Biol Eng Comput 58:1583–1601
Martins TD, Annichino-Bizzacchi JM, Romano AVC et al (2020) Artificial neural networks for prediction of recurrent venous thromboembolism. Int J Med Inform 141:104221
Ottaiano GY, Annichino-Bizzacchi JM, Filho RM et al (2021) Development of neuro-fuzzy networks for venous thromboembolism recurrence prediction. In: Virtual congress of the international society on thrombosis and haemostasis Philadelphia
Carvalho LP, Colella MP, Annichino-Bizzacchi JM et al (2020) Artificial neural network for prediction of hemorrhagic severity in patients with immune thrombocytopenia purpura. In: Virtual congress of the international society on thrombosis and haemostasis—ISTH 2020, Milão
Martins TD, Martins SD, Montalvao SAL et al (2021a) Combining artificial neural networks and blood tests to diagnose covid-19 infection. In: Virtual congress of the international society on thrombosis and haemostasis, Philadelphia
Martins TD, Martins SD, Montalvao SAL et al (2021b) Artificial neural networks to predict covid-19 progression of moderate hospitalized patients using early clinical parameters and blood tests. In: Virtual congress of the international society on thrombosis and haemostasis, Philadelphia
Martins TD, Martins SD, Montalvão S, Al Bannoud M, Ottaiano GY, Silva LQ, Huber SC, Diaz TSP, Wroclawski C, Filho CC, Maciel-Filho R, Annichino-Bizzacchi JM (2024) Combining artificial neural networks and hematological data to diagnose Covid-19 infection in Brazilian population. Neural Comput Appl 36:4387–4399. https://doi.org/10.1007/s00521-023-09312-3
Hotelling H (1933a) Analysis of a complex of statistical variables into principal components. J Educ Psychol 24:417
Hotelling H (1933b) Analysis of a complex of statistical variables into principal components. J Educ Psychol 24:498–520
Härdle W, Simar L (2007) Applied multivariate statistical analysis, 2nd edn. Springer, Berlin
Wold H (1966) Estimation of principal components and related models by iterative least squares. In: Krishnajah PR (ed) Multivariate analysis. Academic Press, NewYork, pp 391–420
Abdi H (2010) Partial least squares regression and projection on latent structure regression (PLS regression). Wiley Interdiscip Rev Comput Stat 2:97–106
Xu Y, Hu W, Yang Z et al (2016) A multivariate partial least squares approach to joint association analysis for multiple correlated traits. Crop J 4:21–29
De Jong S (1993) SIMPLS: an alternative approach to partial least squares regression. Chemom Intell Lab Syst 18:251–263
Haykin S (2005) Neural networks—a comprehensive foundation. Prentice Hall, Delhi
Marquardt DW (1963) An algorithm for least-squares estimation of nonlinear parameters. J Soc Ind Appl Math 11:431–441
Beale E (1972) A derivation of conjugate gradients. In: Numerical methods for nonlinear optimization, pp 39–43
Powell MJD (1964) An efficient method for finding the minimum of a function of several variables without calculating derivatives. Comput J 7:155–162
Riedmiller M, Braun H (1992) RPROP-a fast adaptive learning algorithm. In: Proceedings of ISCIS VII. Citeseer
Møller MF (1993) A scaled conjugate gradient algorithm for fast supervised learning. Neural Netw 6:525–533
Rokach L (2008) Data mining with decision trees: theory and applications. World Scientific, Singapore
Breiman L, Friedman J, Olshen R et al (1984) Cart. Classification and regression trees. Chapman and Hall/CRC, New York
Czogala E, Leski J (2012) Fuzzy and neuro-fuzzy intelligent systems. Physica-Verlag, Heidelberg
Glorot X, Bengio Y (2010) Understanding the difficulty of training deep feedforward neural networks. In: Proceedings of the thirteenth international conference on artificial intelligence and statistics, pp 249–256
Rakitianskaia A, Engelbrecht A (2015) Measuring saturation in neural networks. In: 2015 IEEE symposium series on computational intelligence. IEEE, pp 1423–1430
Mandrekar JN (2010) Receiver operating characteristic curve in diagnostic test assessment. J Thorac Oncol 5:1315–1316
Monaghan TF, Rahman SN, Agudelo CW et al (2021) Foundational statistical principles in medical research: sensitivity, specificity, positive predictive value, and negative predictive value. Medicina 57:503
Skaik YAE-W (2008) Understanding and using sensitivity, specificity and predictive values. Indian J Ophthalmol 56:341
Jun TJ, Kang S-J, Lee J-G et al (2019) Automated detection of vulnerable plaque in intravascular ultrasound images. Med Biol Eng Comput 57:863–876
Xiong X-l, Zhang R-x, Bi Y et al (2019) Machine learning models in type 2 diabetes risk prediction: results from a cross-sectional retrospective study in Chinese adults. Curr Med Sci 39:582–588
Dahal KR, Gautam Y (2020) Argumentative comparative analysis of machine learning on coronary artery disease. Open J Stat 10:694–705
Artoni F, Martelli D, Monaco V et al (2016) Principal component analysis can decrease neural networks performance for incipient falls detection: a preliminary study with hands and feet accelerations. In: 2016 38th Annual international conference of the IEEE Engineering in Medicine and Biology Society (EMBC). IEEE. pp 6194–6197
O’Donoghue J, Roantree M, McCarren A (2017) Detecting feature interactions in agricultural trade data using a deep neural network. In: Bellatreche L, Chakravarthy S (eds) Big data analytics and knowledge discovery. Springer, Cham, pp 449–458
Yang Y, Xu L, Sun L et al (2022a) Machine learning application in personalised lung cancer recurrence and survivability prediction. Comput Struct Biotechnol J 20:1811–1820
Wang Y, Ji C, Wang Y et al (2021) Predicting postoperative liver cancer death outcomes with machine learning. Curr Med Res Opin 37:629–634
Nazari M, Shiri I, Zaidi H (2021) Radiomics-based machine learning model to predict risk of death within 5-years in clear cell renal cell carcinoma patients. Comput Biol Med 129:104135
Acknowledgements
São Paulo Research Foundation (FAPESP)—(#2016/14172-6).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Ethical approval
This study was approved by the ethical committee of the University of Campinas (CAAE: 87212317.7.0000.5404).
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Martins, T.D., Maciel-Filho, R., Montalvão, S.A.L. et al. Predicting mortality of cancer patients using artificial intelligence, patient data and blood tests. Neural Comput & Applic 36, 15599–15616 (2024). https://doi.org/10.1007/s00521-024-09915-4
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-024-09915-4