Skip to main content
Log in

Predicting mortality of cancer patients using artificial intelligence, patient data and blood tests

  • Original Article
  • Published:
Neural Computing and Applications Aims and scope Submit manuscript

Abstract

Several authors have shown that hematological parameters can be used to detect poor prognosis in patients with cancer. Thus, such features could be used in artificial intelligence (AI-based) models to predict mortality among these patients. This work aimed to develop and compare several AI-based models to predict the prognosis (death vs. survival) of cancer in patients using blood tests and patient data as inputs. At total, 908 cancer patients were assisted in a prospective study. Four artificial intelligence models were compared: artificial neural networks (ANN), supporting vector machines (SVM), decision trees and neuro-fuzzy networks. Also, four different input strategies were tested, considering the use of 49, 45, 22 and 14 inputs. The results of this study showed that the ANN and the SVM presented the best results, using 45 inputs. The ANN was the best model since it presented better statistical values for the positive (death) and negative (survival) classes. The use of blood parameters as inputs for AI-based models could be used to predict death in patients with cancer, and this methodology can be expanded to other diseases.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2

Similar content being viewed by others

Explore related subjects

Discover the latest articles, news and stories from top researchers in related subjects.

Data availability

The data are not publicly available because they contain information that could compromise the privacy of research participants.

References

  1. Huang S, Yang J, Fong S et al (2020) Artificial intelligence in cancer diagnosis and prognosis: opportunities and challenges. Cancer Lett 471:61–71

    Google Scholar 

  2. Ferlay JEM, Lam F, Colombet M, Mery L, Piñeros M, Znaor A, Soerjomataram I, Bray F (2020) Global cancer observatory: cancer today. International Agency for Research on Cancer, Lyon

    Google Scholar 

  3. de Oliveira Santos M, de Lima FCS, Martins LFL et al (2023) Estimativa de incidência de câncer no Brasil, 2023–2025. Rev Bras Cancerol 69:e213700

    Google Scholar 

  4. Batlle JF, Pinto A, Basterretxea L et al (2020) Development and validation of an early death risk score for older patients treated with chemotherapy for cancer. J Clin Oncol 38:12030–12030

    Google Scholar 

  5. Bibault J-E, Hancock S, Buyyounouski MK et al (2021) Development and validation of an interpretable artificial intelligence model to predict 10-year prostate cancer mortality. Cancers 13:3064

    Google Scholar 

  6. Karnes RJ, Choeurng V, Ross AE et al (2018) Validation of a genomic risk classifier to predict prostate cancer-specific mortality in men with adverse pathologic features. Eur Urol 73:168–175

    Google Scholar 

  7. Paredes-Aracil E, Palazón-Bru A, Folgado-de la Rosa DM et al (2017) A scoring system to predict breast cancer mortality at 5 and 10 years. Sci Rep 7:415

    Google Scholar 

  8. Soares M, Fontes F, Dantas J et al (2004) Performance of six severity-of-illness scores in cancer patients requiring admission to the intensive care unit: a prospective observational study. Crit Care (Lond Engl) 8:R194–R203

    Google Scholar 

  9. Williams C, Brunskill S, Altman D et al (2006) Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy. Health Technol Assess (Winch Engl) 10:1–204

    Google Scholar 

  10. Biswas N, Uddin KMM, Rikta ST et al (2022) A comparative analysis of machine learning classifiers for stroke prediction: a predictive analytics approach. Healthc Anal 2:100116

    Google Scholar 

  11. Mishra A, Ashraf MZ (2020) Using artificial intelligence to manage thrombosis research, diagnosis, and clinical management. In: Seminars in thrombosis and hemostasis. Thieme Medical Publishers, pp 410–418

  12. Kumar Y, Koul A, Singla R et al (2022) Artificial intelligence in disease diagnosis: a systematic literature review, synthesizing framework and future research agenda. J Ambient Intell Humaniz Comput 14:8459–8486

    Google Scholar 

  13. Alhazmi A, Alhazmi Y, Makrami A et al (2021) Application of artificial intelligence and machine learning for prediction of oral cancer risk. J Oral Pathol Med 50:444–450

    Google Scholar 

  14. Bhardwaj P, Bhandari G, Kumar Y et al (2022) An Investigational approach for the prediction of gastric cancer using artificial intelligence techniques: a systematic review. Arch Comput Methods Eng 29:4379–4400

    Google Scholar 

  15. Khanagar SB, Naik S, Al Kheraif AA et al (2021) Application and performance of artificial intelligence technology in oral cancer diagnosis and prediction of prognosis: a systematic review. Diagnostics 11:1004

    Google Scholar 

  16. Qureshi TA, Javed S, Sarmadi T et al (2022) Artificial intelligence and imaging for risk prediction of pancreatic cancer. Chin Clin Oncol 11:1

    Google Scholar 

  17. Yang Y, Zhao Y, Liu X et al (2022b) Artificial intelligence for prediction of response to cancer immunotherapy. Semin Cancer Biol 87:137–147

    Google Scholar 

  18. Park K, Ali A, Kim D et al (2013) Robust predictive model for evaluating breast cancer survivability. Eng Appl Artif Intell 26:2194–2205

    Google Scholar 

  19. He T, Li J, Wang P et al (2022) Artificial intelligence predictive system of individual survival rate for lung adenocarcinoma. Comput Struct Biotechnol J 20:2352–2359

    Google Scholar 

  20. Li X, Zhai Z, Ding W et al (2022) An artificial intelligence model to predict survival and chemotherapy benefits for gastric cancer patients after gastrectomy development and validation in international multicenter cohorts. Int J Surg 105:106889

    Google Scholar 

  21. Chen J, Li Y, Cui H (2021) Preoperative low hematocrit is an adverse prognostic biomarker in ovarian cancer. Arch Gynecol Obstet 303:767–775

    Google Scholar 

  22. Peng F, Hu D, Lin X et al (2017) The monocyte to red blood cell count ratio is a strong predictor of postoperative survival in colorectal cancer patients: the Fujian prospective investigation of cancer (FIESTA) study. J Cancer 8:967

    Google Scholar 

  23. Xie X, Yao M, Chen X et al (2015) Reduced red blood cell count predicts poor survival after surgery in patients with primary liver cancer. Medicine 94:e577

    Google Scholar 

  24. Zhang X, Zhang F, Qiao W et al (2018) Low hematocrit is a strong predictor of poor prognosis in lung cancer patients. BioMed Res Int 2018:6804938

    Google Scholar 

  25. Syed-Abdul S, Firdani R-P, Chung H-J et al (2020) Artificial intelligence based models for screening of hematologic malignancies using cell population data. Sci Rep 10:4583

    Google Scholar 

  26. Yavuz E, Eyupoglu C (2020) An effective approach for breast cancer diagnosis based on routine blood analysis features. Med Biol Eng Comput 58:1583–1601

    Google Scholar 

  27. Martins TD, Annichino-Bizzacchi JM, Romano AVC et al (2020) Artificial neural networks for prediction of recurrent venous thromboembolism. Int J Med Inform 141:104221

    Google Scholar 

  28. Ottaiano GY, Annichino-Bizzacchi JM, Filho RM et al (2021) Development of neuro-fuzzy networks for venous thromboembolism recurrence prediction. In: Virtual congress of the international society on thrombosis and haemostasis Philadelphia

  29. Carvalho LP, Colella MP, Annichino-Bizzacchi JM et al (2020) Artificial neural network for prediction of hemorrhagic severity in patients with immune thrombocytopenia purpura. In: Virtual congress of the international society on thrombosis and haemostasis—ISTH 2020, Milão

  30. Martins TD, Martins SD, Montalvao SAL et al (2021a) Combining artificial neural networks and blood tests to diagnose covid-19 infection. In: Virtual congress of the international society on thrombosis and haemostasis, Philadelphia

  31. Martins TD, Martins SD, Montalvao SAL et al (2021b) Artificial neural networks to predict covid-19 progression of moderate hospitalized patients using early clinical parameters and blood tests. In: Virtual congress of the international society on thrombosis and haemostasis, Philadelphia

  32. Martins TD, Martins SD, Montalvão S, Al Bannoud M, Ottaiano GY, Silva LQ, Huber SC, Diaz TSP, Wroclawski C, Filho CC, Maciel-Filho R, Annichino-Bizzacchi JM (2024) Combining artificial neural networks and hematological data to diagnose Covid-19 infection in Brazilian population. Neural Comput Appl 36:4387–4399. https://doi.org/10.1007/s00521-023-09312-3

  33. Hotelling H (1933a) Analysis of a complex of statistical variables into principal components. J Educ Psychol 24:417

    Google Scholar 

  34. Hotelling H (1933b) Analysis of a complex of statistical variables into principal components. J Educ Psychol 24:498–520

    Google Scholar 

  35. Härdle W, Simar L (2007) Applied multivariate statistical analysis, 2nd edn. Springer, Berlin

    Google Scholar 

  36. Wold H (1966) Estimation of principal components and related models by iterative least squares. In: Krishnajah PR (ed) Multivariate analysis. Academic Press, NewYork, pp 391–420

    Google Scholar 

  37. Abdi H (2010) Partial least squares regression and projection on latent structure regression (PLS regression). Wiley Interdiscip Rev Comput Stat 2:97–106

    Google Scholar 

  38. Xu Y, Hu W, Yang Z et al (2016) A multivariate partial least squares approach to joint association analysis for multiple correlated traits. Crop J 4:21–29

    Google Scholar 

  39. De Jong S (1993) SIMPLS: an alternative approach to partial least squares regression. Chemom Intell Lab Syst 18:251–263

    Google Scholar 

  40. Haykin S (2005) Neural networks—a comprehensive foundation. Prentice Hall, Delhi

    Google Scholar 

  41. Marquardt DW (1963) An algorithm for least-squares estimation of nonlinear parameters. J Soc Ind Appl Math 11:431–441

    MathSciNet  Google Scholar 

  42. Beale E (1972) A derivation of conjugate gradients. In: Numerical methods for nonlinear optimization, pp 39–43

  43. Powell MJD (1964) An efficient method for finding the minimum of a function of several variables without calculating derivatives. Comput J 7:155–162

    MathSciNet  Google Scholar 

  44. Riedmiller M, Braun H (1992) RPROP-a fast adaptive learning algorithm. In: Proceedings of ISCIS VII. Citeseer

  45. Møller MF (1993) A scaled conjugate gradient algorithm for fast supervised learning. Neural Netw 6:525–533

    Google Scholar 

  46. Rokach L (2008) Data mining with decision trees: theory and applications. World Scientific, Singapore

    Google Scholar 

  47. Breiman L, Friedman J, Olshen R et al (1984) Cart. Classification and regression trees. Chapman and Hall/CRC, New York

    Google Scholar 

  48. Czogala E, Leski J (2012) Fuzzy and neuro-fuzzy intelligent systems. Physica-Verlag, Heidelberg

    Google Scholar 

  49. Glorot X, Bengio Y (2010) Understanding the difficulty of training deep feedforward neural networks. In: Proceedings of the thirteenth international conference on artificial intelligence and statistics, pp 249–256

  50. Rakitianskaia A, Engelbrecht A (2015) Measuring saturation in neural networks. In: 2015 IEEE symposium series on computational intelligence. IEEE, pp 1423–1430

  51. Mandrekar JN (2010) Receiver operating characteristic curve in diagnostic test assessment. J Thorac Oncol 5:1315–1316

    Google Scholar 

  52. Monaghan TF, Rahman SN, Agudelo CW et al (2021) Foundational statistical principles in medical research: sensitivity, specificity, positive predictive value, and negative predictive value. Medicina 57:503

    Google Scholar 

  53. Skaik YAE-W (2008) Understanding and using sensitivity, specificity and predictive values. Indian J Ophthalmol 56:341

    Google Scholar 

  54. Jun TJ, Kang S-J, Lee J-G et al (2019) Automated detection of vulnerable plaque in intravascular ultrasound images. Med Biol Eng Comput 57:863–876

    Google Scholar 

  55. Xiong X-l, Zhang R-x, Bi Y et al (2019) Machine learning models in type 2 diabetes risk prediction: results from a cross-sectional retrospective study in Chinese adults. Curr Med Sci 39:582–588

    Google Scholar 

  56. Dahal KR, Gautam Y (2020) Argumentative comparative analysis of machine learning on coronary artery disease. Open J Stat 10:694–705

    Google Scholar 

  57. Artoni F, Martelli D, Monaco V et al (2016) Principal component analysis can decrease neural networks performance for incipient falls detection: a preliminary study with hands and feet accelerations. In: 2016 38th Annual international conference of the IEEE Engineering in Medicine and Biology Society (EMBC). IEEE. pp 6194–6197

  58. O’Donoghue J, Roantree M, McCarren A (2017) Detecting feature interactions in agricultural trade data using a deep neural network. In: Bellatreche L, Chakravarthy S (eds) Big data analytics and knowledge discovery. Springer, Cham, pp 449–458

    Google Scholar 

  59. Yang Y, Xu L, Sun L et al (2022a) Machine learning application in personalised lung cancer recurrence and survivability prediction. Comput Struct Biotechnol J 20:1811–1820

    Google Scholar 

  60. Wang Y, Ji C, Wang Y et al (2021) Predicting postoperative liver cancer death outcomes with machine learning. Curr Med Res Opin 37:629–634

    Google Scholar 

  61. Nazari M, Shiri I, Zaidi H (2021) Radiomics-based machine learning model to predict risk of death within 5-years in clear cell renal cell carcinoma patients. Comput Biol Med 129:104135

    Google Scholar 

Download references

Acknowledgements

São Paulo Research Foundation (FAPESP)—(#2016/14172-6).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tiago D. Martins.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Ethical approval

This study was approved by the ethical committee of the University of Campinas (CAAE: 87212317.7.0000.5404).

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (XLSX 23 KB)

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Martins, T.D., Maciel-Filho, R., Montalvão, S.A.L. et al. Predicting mortality of cancer patients using artificial intelligence, patient data and blood tests. Neural Comput & Applic 36, 15599–15616 (2024). https://doi.org/10.1007/s00521-024-09915-4

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00521-024-09915-4

Keywords