Abstract
The objective of the present paper is to model the genetic influence on prostate cancer with multivariate adaptive regression splines (MARS) and artificial neural networks (ANNs) techniques for classification. These models will be able to classify subjects that have cancer according to the values of the selected proteins from the genes selected with the models as most relevant. Subjects are selected as cases and controls from the MCC-Spain database and represent a heterogeneous group. Multivariate adaptive regression splines models allow to select a set of the most relevant proteins from the database. These models were trained in nine different degrees and chosen regarding its performance and complexity. Artificial neural networks models were trained on with data restricted to the most significant variables. The performance of both types of models was analyzed in terms of the area under the curve of the receiver operating characteristics curve. The ANN technique resulted in a model with AUC of 0.62006, while for MARS technique, the value was 0.569312 in the best situation. Then, the artificial neural network model obtained can determine whether a patient suffers prostate cancer significantly better than MARS models and with high rate of success. The best model presented was based on support vector machines, reaching values of AUC of 0.65212.



Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Center MM, Jemal A, Lortet-Tieulent J, Ward E, Ferlay J, Brawley O, Bray F (2012) International variation in prostate cancer incidence and mortality rates. Eur Urol 61:1079–1092
Ferlay J, Soerjomataram I, Ervik M, Dikshit R, Eser S, Mathers C, Rebelo M, Parkin DM, Forman D, Bray F (2013) Cancer incidence and mortality worldwide: IARC Cancer Base No. 11. International Agency for Research on Cancer, Lyon. GLOBOCAN 2012 v1. 0, 2013
Di Sebastiano KM, Mourtzakis M (2014) The role of dietary fat throughout the prostate cancer trajectory. Nutrients 6:6095–6109
Allott EH, Masko EM, Freedland SJ (2013) Obesity and prostate cancer: weighing the evidence. Eur Urol 63:800–809
Huncharek M, Haddock KS, Reid R, Kupelnick B (2010) Smoking as a risk factor for prostate cancer: a meta-analysis of 24 prospective cohort studies. Am J Public Health 100:693–701
Liu Y, Hu F, Li D, Wang F, Zhu L, Chen W, Ge J, An R, Zhao Y (2011) Does physical activity reduce the risk of prostate cancer? A systematic review and meta-analysis. Eur Urol 60:1029–1044
Discacciati A, Wolk A (2014) Lifestyle and dietary factors in prostate cancer prevention. In: Cuzick J, Thorat M (eds) prostate cancer prevention. Springer, Berlin, pp 27–37
Gong Z, Neuhouser ML, Goodman PJ, Albanes D, Chi C, Hsing AW, Lippman SM, Platz EA, Pollak MN, Thompson IM et al (2006) Obesity, diabetes, and risk of prostate cancer: results from the prostate cancer prevention trial. Cancer Epidemiol Prev Biomark 15:1977–1983
Bosetti C, Rosato V, Gallus S, Cuzick J, La Vecchia C (2012) Aspirin and cancer risk: a quantitative review to 2011. Ann Oncol 23:1403–1415
Thompson IM, Goodman PJ, Tangen CM, Lucia MS, Miller GJ, Ford LG, Lieber MM, Cespedes RD, Atkins JN, Lippman SM (2003) others: The influence of finasteride on the development of prostate cancer. N Engl J Med 349:215–224
Andriole GL, Bostwick DG, Brawley OW, Gomella LG, Marberger M, Montorsi F, Pettaway CA, Tammela TL, Teloken C, Tindall DJ et al (2010) Effect of dutasteride on the risk of prostate cancer. N Engl J Med 362:1192–1202
Hamilton RJ, Freedland SJ (2008) Review of recent evidence in support of a role for statins in the prevention of prostate cancer. Curr Opin Urol 18:333–339
Jalving M, Gietema JA, Lefrandt JD, de Jong S, Reyners AKL, Gans ROB, de Vries EGE (2010) Metformin: Taking away the candy for cancer? Eur J Cancer 46:2369–2380
De Andrés J, Sánchez-Lasheras F, Lorca P, de Cos Juez FJ (2011) A hybrid device of Self Organizing Maps (SOM) and Multivariate Adaptive Regression Splines (MARS) for the forecasting of firms’ bankruptcy. Account Manag Inf Syst 10:351
Fernández JRA, Muñiz CD, Nieto PJG, de Cos Juez FJ, Lasheras FS, Roqueñí MN (2013) Forecasting the cyanotoxins presence in fresh waters: a new model based on genetic algorithms combined with the MARS technique. Ecol Eng 53:68–78
Friedman JH (1991) Multivariate adaptive regression splines. Ann Stat 19:1–67
Sekulic S, Kowalski BR (1992) MARS: a tutorial. J Chemom 6:199–216
Breiman L, Friedman JH, Olshen RA, Stone CJ (1993) Classification and Regression Trees, Wadsworth International Group, Belmon, CA, 1984. Case Descr Featur Subset Correct Missed FA Misclass 1:1–3
Antón JCÁ, Nieto PJG, de Cos Juez FJ, Lasheras FS, Viejo CB, Gutiérrez NR (2013) Battery state-of-charge estimator using the MARS technique. IEEE Trans Power Electron 28:3798–3805
Guzmán D, de Cos Juez FJ, Lasheras FS, Myers R, Young L (2010) Deformable mirror model for open-loop adaptive optics using multivariate adaptive regression splines. Opt Express 18:6492–6505
Nieto PJG, Torres JM, de Cos Juez FJ, Lasheras FS (2012) Using multivariate adaptive regression splines and multilayer perceptron networks to evaluate paper manufactured using Eucalyptus globulus. Appl Math Comput 219:755–763
Nieto PJG, Lasheras FS, de Cos Juez FJ, Fernández JRA (2011) Study of cyanotoxins presence from experimental cyanobacteria concentrations using a new data mining methodology based on multivariate adaptive regression splines in Trasona reservoir (Northern Spain). J Hazard Mater 195:414–421
Friedman JH, Roosen CB (1995) An introduction to multivariate adaptive regression splines. Springer, Heidelberg
Suárez Gómez SL, Gutiérrez CG, Rodríguez JDS, Rodríguez MLS, Lasheras FS, de Cos Juez FJ (2016) Analysing the performance of a tomographic reconstructor with different neural networks frameworks. In: International conference on intelligent systems design and applications, pp 1051–1060
Suárez Gómez SL, Santos Rodríguez JD, Iglesias Rodríguez FJ, de Cos Juez FJ (2017) Analysis of the temporal structure evolution of physical systems with the self-organising tree algorithm (SOTA): application for validating neural network systems on adaptive optics data before on-sky implementation. Entropy 19:103
Sánchez AS, Fernández PR, Lasheras FS, de Cos Juez FJ, Nieto PJG (2011) Prediction of work-related accidents according to working conditions using support vector machines. Appl Math Comput 218:3539–3552
Vilán JAV, Fernández JRA, Nieto PJG, Lasheras FS, de Cos Juez FJ, Muñiz CD (2013) Support vector machines and multilayer perceptron networks used to evaluate the cyanotoxins presence from experimental cyanobacteria concentrations in the Trasona reservoir (Northern Spain). Water Resour Manag 27:3457–3476
Basden AG, Atkinson D, Bharmal NA, Bitenc U, Brangier M, Buey T, Butterley T, Cano D, Chemla F, Clark P (2016) others: experience with wavefront sensor and deformable mirror interfaces for wide-field adaptive optics systems. Mon Not R Astron Soc 459:1350–1359
de Cos Juez FJ, Lasheras FS, Roqueñí N, Osborn J (2012) An ANN-based smart tomographic reconstructor in a dynamic environment. Sensors (Switzerland) 12:8895–8911. https://doi.org/10.3390/s120708895
Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. In: Advances in neural information processing systems, pp 1097–1105
Gardner MW, Dorling SR (1998) Artificial neural networks (the multilayer perceptron)—a review of applications in the atmospheric sciences. Atmos Environ 32:2627–2636
Hornik K, Stinchcombe M, White H (1989) Multilayer feedforward networks are universal approximators. Neural Netw 2:359–366
González-Gutiérrez C, Santos-Rodríguez JD, Díaz RÁF, Rolle JLC, Gutiérrez NR, de Cos Juez FJ (2016) Using GPUs to speed up a tomographic reconstructor based on machine learning. In: International conference on european transnational education, pp 279–289
Haykin S (1999) Neural networks: a comprehensive foundation. Prentice-Hall, Upper Saddle River
Rumelhart DE, Hinton GE, Williams RJ (1988) Learning representations by back-propagating errors. Cogn Model 5:1
Lasheras, F.S., Gómez, S.L.S., Garc’\ia, M.V.R., Krzemień, A., Sánchez, A.S.: Time series and artificial intelligence with a genetic algorithm hybrid approach for rare earth price prediction. (2017)
Ríos EMA, Crespo MMS, Sánchez AS, Gómez SLS, Lasheras FS (2017) Genetic algorithm based on support vector machines for computer vision syndrome classification. In: International Joint Conference SOCO’17-CISIS’17-ICEUTE’17 Le{ó}n, Spain, September 6–8, 2017, Proceeding, pp 381–390
Castaño-Vinyals G, Aragonés N, Pérez-Gómez B, Martin V, Llorca J, Moreno V, Altzibar JM, Ardanaz E, De Sanjosé S, Jiménez-Moleón JJ et al (2015) Population-based multicase-control study in common tumors in Spain (MCC-Spain): rationale and study design. Gac. Sanit. 29:308–315
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors report no conflicts of interest in this work.
Rights and permissions
About this article
Cite this article
Sánchez Lasheras, J.E., González Donquiles, C., García Nieto, P.J. et al. A methodology for detecting relevant single nucleotide polymorphism in prostate cancer with multivariate adaptive regression splines and backpropagation artificial neural networks. Neural Comput & Applic 32, 1231–1238 (2020). https://doi.org/10.1007/s00521-018-3503-4
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-018-3503-4