Abstract
In this work we describe a rule-based classifier (DEQAR-CC), which employs a combination of selected rules after a two-phase training process, and without the need of a previous discretization for the numerical variables. It was compared in the application to a real imbalanced dataset regarding the toxicity during and after radiation therapy for prostate cancer. In this comparison with other predictive methods (rule-based, artificial neural networks, trees, Bayesian and logistic regression), DEQAR-CC showed a better global prediction performance than the rest of classifiers, in an evaluation regarding several performance measures and by using cross-validation. Finally, it was employed to obtain a predictive model for genitourinary toxicity, obtaining an interpretable classification scheme which simply combines two rules with two variables.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Agrawal, R., Imielinski, T., Swami, A.: Mining association rules between sets of items in large databases. In: Proceedings of ACM SIGMOD ICMD, pp. 207–216 (1993)
Zhang, M., He, C.: Survey on association rules mining algorithms. In: Luo, Q. (ed.) Advancing Computing, Communication, Control and Management. Lecture Notes in Electrical Engineering, vol. 56, pp. 111–118. Springer, Berlin (2010). https://doi.org/10.1007/978-3-642-05173-9_15
Wrobel, S.: An algorithm for multi-relational discovery of subgroups. In: Komorowski, J., Zytkow, J. (eds.) PKDD 1997. LNCS, vol. 1263, pp. 78–87. Springer, Heidelberg (1997). https://doi.org/10.1007/3-540-63223-9_108
Grosskreutz, H., Rüping, S.: On subgroup discovery in numerical domains. Data Min. Knowl. Disc. 19(2), 210–226 (2009)
Hastie, T., Tibshirani, R., Friedman, J.: The Elements of Statistical Learning: Data Mining, Inference, and Prediction. Springer, New York (2009)
Liu, B., Hsu, W., Ma, Y.: Integrating classification and association rule mining. In: Proceedings of Fourth International Conference on Knowledge Discovery and Data Mining, pp. 80–86 (1998)
Sun, Y., Wong, A.K.C., Kamel, M.S.: Classification of imbalanced data: a review. Int. J. Pattern Recognit. Artif. Intell. 23(4), 687–719 (2009)
Domínguez-Olmedo, J.L., Mata, J., Pachón, V., Lopez Guerra, J.L.: A rule-based method applied to the imbalanced classification of radiation toxicity. In: Proceedings of the 11th International Joint Conference on Biomedical Engineering Systems and Technologies, vol. 5, HEALTHINF, pp. 147–155 (2018)
Ferlay, J., et al.: Cancer incidence and mortality patterns in Europe: estimates for 40 countries in 2012. Eur. J. Cancer 49(6), 1374–1403 (2013)
Domínguez-Olmedo, J.L., Mata Vázquez, J.: Comparison of standard discretization with a new method for quantitative association rules. IEEE Lat. Am. Trans. 14(4), 1879–1885 (2016)
Domínguez-Olmedo, J.L., Mata, J., Pachón, V., Maña, M.J.: A deterministic approach to association rule mining without attribute discretization. In: Snasel, V., Platos, J., El-Qawasmeh, E. (eds.) ICDIPC 2011. CCIS, vol. 188, pp. 140–150. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-22389-1_13
Domínguez-Olmedo, J.L., Mata Vázquez, J.: Obtaining significant and interpretable rules for subgroup discovery tasks. IEEE Lat. Am. Trans. 15(10), 2012–2016 (2017)
Domínguez-Olmedo, J.L., Vázquez, J.M., Pachón, V.: Deterministic extraction of compact sets of rules for subgroup discovery. In: Jackowski, K., Burduk, R., Walkowiak, K., Woźniak, M., Yin, H. (eds.) IDEAL 2015. LNCS, vol. 9375, pp. 138–145. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-24834-9_17
Frank, E., Witten, I.H.: Generating accurate rule sets without global optimization. In: Proceedings of the Fifteenth International Conference on Machine Learning, pp. 144–151 (1998)
Rumelhart, D.E., Hinton, G.E., Williams, R.J.: Parallel Distributed Processing: Explorations in the Microstructure of Cognition, pp. 318–362. MIT Press, Cambridge (1986)
Quinlan, R.: C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers, San Mateo (1993)
Breiman, L.: Random forests. Mach. Learn. 45(1), 5–32 (2001)
Pearl, J.: Bayesian networks: a model of self-activated memory for evidential reasoning. UCLA Computer Science Department (1985)
John, G.H., Langley, P.: Estimating continuous distributions in Bayesian classifiers. In: Proceedings of the Eleventh Conference on Uncertainty in Artificial Intelligence, San Mateo, pp. 338–345 (1995)
Le-Cessie, S., van Houwelingen, J.C.: Ridge estimators in logistic regression. Appl. Stat. 41(1), 191–201 (1992)
Lopez Guerra, J.L., et al.: Ethnic difference in risk of toxicity in prostate cancer patients treated with dynamic arc radiation therapy. Tumori 101(4), 461–468 (2015)
Cohen, J.: A coefficient of agreement for nominal scales. Educ. Psychol. Meas. 20(1), 37–46 (1960)
Matthews, B.W.: Comparison of the predicted and observed secondary structure of T4 phage lysozyme. Biochimica et Biophysica Acta (BBA)-Protein Structure 405(2), 442–451 (1975)
Arlot, S., Celisse, A.: A survey of cross-validation procedures for model selection. Stat. Surv. 4, 40–79 (2010)
Frank, E., Hall, M.A., Witten, I.H.: Data Mining: Practical Machine Learning Tools and Techniques. Morgan Kaufmann, Burlington (2016)
Powers, D.M.W.: Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation. Int. J. Mach. Learn. Technol. 2(1), 37–63 (2011)
Acevedo-Henao, C.M., Lopez, J.L., Matute, R., Azinovic, I.: Image-guided radiation therapy based on helical tomotherapy in prostate cancer: minimizing toxicity. Oncol. Res. Treat. 37(6), 324–330 (2014)
Ahmed, A.A., et al.: A novel method for predicting late genitourinary toxicity after prostate radiation therapy and the need for age-based risk-adapted dose constraints. Int. J. Radiat. Oncol. Biol. Phys. 86(4), 709–715 (2013)
Acknowledgments
The research presented in this paper was partially funded by the Regional Government of Andalusia (Junta de Andalucía) under grant number TIC-7629 and Spanish Ministry of Education and Science (Grant Number: TIN2009-14057-C03-03).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Domínguez-Olmedo, J.L., Mata, J., Pachón, V., Lopez Guerra, J.L. (2019). Application of a Rule-Based Classifier to Data Regarding Radiation Toxicity in Prostate Cancer Treatment. In: Cliquet Jr., A., et al. Biomedical Engineering Systems and Technologies. BIOSTEC 2018. Communications in Computer and Information Science, vol 1024. Springer, Cham. https://doi.org/10.1007/978-3-030-29196-9_20
Download citation
DOI: https://doi.org/10.1007/978-3-030-29196-9_20
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-29195-2
Online ISBN: 978-3-030-29196-9
eBook Packages: Computer ScienceComputer Science (R0)