Skip to main content

A Robust Machine Learning Protocol for Prediction of Prostate Cancer Survival at Multiple Time-Horizons

  • Conference paper
  • First Online:
Computational Science – ICCS 2023 (ICCS 2023)

Abstract

Prostate cancer is one of the leading causes of cancer death in men in Western societies. Predicting patients’ survival using clinical descriptors is important for stratification in the risk classes and selecting appropriate treatment. Current work is devoted to developing a robust Machine Learning (ML) protocol for predicting the survival of patients with metastatic castration-resistant prostate cancer. In particular, we aimed to identify relevant factors for survival at various time horizons. To this end, we built ML models for eight different predictive horizons, starting at three and up to forty-eight months. The model building involved the identification of informative variables with the help of the MultiDimensional Feature Selection (MDFS) algorithm, entire modelling procedure was performed in multiple repeats of cross-validation. We evaluated the application of 5 popular classification algorithms: Random Forest, XGBoost, logistic regression, k-NN and naive Bayes, for this task. Best modelling results for all time horizons were obtained with the help of Random Forest. Good prediction results and stable feature selection were obtained for six horizons, excluding the shortest and longest ones. The informative variables differ significantly for different predictive time horizons. Different factors affect survival rates over different periods, however, four clinical variables: ALP, LDH, HB and PSA, were relevant for all stable predictive horizons. The modelling procedure that involves computationally intensive multiple repeats of cross-validated modelling, allows for robust prediction of the relevant features and for much-improved estimation of uncertainty of results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Abdallah, K., Hugh-Jones, C., Norman, T., Friend, S., Stolovitzky, G.: The prostate cancer dream challenge: a community-wide effort to use open clinical trial data for the quantitative prediction of outcomes in metastatic prostate cancer (2015)

    Google Scholar 

  2. Breiman, L.: Random forests. Mach. Learn. 45, 5–32 (2001)

    Article  Google Scholar 

  3. Chen, T., Guestrin, C.: XGBoost: a scalable tree boosting system, pp. 785–794 (2016). https://doi.org/10.1145/2939672.2939785

  4. Costello, J., Stolovitzky, G.: Seeking the wisdom of crowds through challenge-based competitions in biomedical research. Clini. Pharmacol. Ther. 93 (2013). https://doi.org/10.1038/clpt.2013.36

  5. Fernández-Delgado, M., et al.: Do we need hundreds of classifiers to solve real world classification problems. J. Mach. Learn. Res. 15(1), 3133–3181 (2014)

    MathSciNet  Google Scholar 

  6. Fizazi, K., et al.: Phase iii, randomized, placebo-controlled study of docetaxel in combination with Zibotentan in patients with metastatic castration-resistant prostate cancer. J. Clin. Oncol. 31(14), 1740–1747 (2013)

    Article  Google Scholar 

  7. Grönberg, H.: Prostate cancer epidemiology. Lancet 361(9360), 859–864 (2003)

    Article  Google Scholar 

  8. Guinney, J., et al.: Prediction of overall survival for patients with metastatic castration-resistant prostate cancer: development of a prognostic model through a crowdsourced challenge with open clinical trial data. Lancet Oncol. 18(1), 132–142 (2017). https://doi.org/10.1016/S1470-2045(16)30560-5. https://www.sciencedirect.com/science/article/pii/S1470204516305605

  9. Halabi, S., et al.: Prognostic model for predicting survival in men with hormone-refractory metastatic prostate cancer. J. Clin. Oncol. Official J. Am. Soc. Clin. Oncol. 21, 1232–1237 (2003). https://doi.org/10.1200/JCO.2003.06.100

  10. Halabi, S., et al.: Updated prognostic model for predicting overall survival in first-line chemotherapy for patients with metastatic castration-resistant prostate cancer. J. Clin. Oncol. Official J. Am. Soc. Clin. Oncol. 32 (2014). https://doi.org/10.1200/JCO.2013.52.3696

  11. He, H., Garcia, E.A.: Learning from imbalanced data. IEEE Trans. Knowl. Data Eng. 21(9), 1263–1284 (2009)

    Article  Google Scholar 

  12. Mahapatra, S., et al.: Global methylation profiling for risk prediction of prostate cancer. Clin. Cancer Res. 18(10), 2882–2895 (2012). https://doi.org/10.1158/1078-0432.CCR-11-2090

    Article  Google Scholar 

  13. Matthews, B.: Comparison of the predicted and observed secondary structure of T4 phage lysozyme. Biochem. Biophys. Acta. 405(2), 442–451 (1975)

    Google Scholar 

  14. Mnich, K., Rudnicki, W.R.: All-relevant feature selection using multidimensional filters with exhaustive search. Inf. Sci. 524, 277–297 (2020). https://doi.org/10.1016/j.ins.2020.03.024

    Article  MathSciNet  Google Scholar 

  15. Petrylak, D., et al.: Docetaxel and prednisone with or without lenalidomide in chemotherapy-Naive patients with metastatic castration-resistant prostate cancer (mainsail): a randomised, double-blind, placebo-controlled phase 3 trial. Lancet Oncol. 16(4), 417–425 (2015)

    Article  Google Scholar 

  16. Piliszek, R., et al.: MDFS: multidimensional feature selection in R. R J. (2019). https://doi.org/10.32614/RJ-2019-019

  17. Tannock, I.F., et al.: Aflibercept versus placebo in combination with docetaxel and prednisone for treatment of men with metastatic castration-resistant prostate cancer (VENICE): a phase 3, double-blind randomised trial. Lancet Oncol. 14(8), 760–768 (2013)

    Article  Google Scholar 

  18. Yu, M., et al.: Individual prediction in prostate cancer studies using a joint longitudinal survival-cure model. J. Am. Stat. Assoc. 103(481), 178–187 (2008)

    Article  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Wojciech Lesiński .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Lesiński, W., Rudnicki, W.R. (2023). A Robust Machine Learning Protocol for Prediction of Prostate Cancer Survival at Multiple Time-Horizons. In: Mikyška, J., de Mulatier, C., Paszynski, M., Krzhizhanovskaya, V.V., Dongarra, J.J., Sloot, P.M. (eds) Computational Science – ICCS 2023. ICCS 2023. Lecture Notes in Computer Science, vol 10475. Springer, Cham. https://doi.org/10.1007/978-3-031-36024-4_12

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-36024-4_12

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-36023-7

  • Online ISBN: 978-3-031-36024-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics