Abstract
In a computational medical model, diagnosis is the classification of disease status in the terms of abnormal or positive, normal or negative or intermediate stages. Different Machine learning techniques such as artificial neural networks (ANNs) are extensively and successfully used in disease diagnosis. However, there is no single classifier that can solve all classification problems. Selecting an optimal classifier for a problem is difficult, and it has become a relevant subject in the area. This paper focuses on grammar-based automatic programming (GAP) to build optimized discriminant functions for medical data classification in any arbitrary language. These techniques have an implicit power of automatic feature selection and feature extraction. This work carries out an in-depth investigation of the use of different GAP algorithms in the medical data classification problem. The objective is to identify the benefits and limitations of algorithms of this nature in the current problem. Classical classifiers were also considered for comparison purposes. Fourteen medical data sets were used in the experiments, and seven performance measures such as accuracy, sensitivity, specificity, precision, geometric-mean, F-measure, and false-positive rate are used to evaluate the performance of the produced classifier. The multiple criteria decision analysis (MCDA) demonstrates that GAP approaches are able to produce suitable classifiers for a given problem, and the GS performs better than other classical classifiers in medical data classification.















Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Assunçao F, Lourenço N, Machado P, Ribeiro B (2017) Automatic generation of neural networks with structured grammatical evolution. In: 2017 IEEE congress on evolutionary computation (CEC). IEEE, pp 1557–1564
Asuncion A, Newman D (2007) Uci machine learning repository. University of California, School of Information and Computer Science, Irvine, CA. http://archive.ics.uci.edu/ml/datasets.php
Barandela R, Sánchez JS, Garca V, Rangel E (2003) Strategies for learning in class imbalance problems. Pattern Recogn 36(3):849–851
Brameier M, Banzhaf W (2001) A comparison of linear genetic programming and neural networks in medical data mining. IEEE Trans Evolut Comput 5(1):17–26
Cerri R, Barros RC, de Carvalho AC, Freitas AA (2013) A grammatical evolution algorithm for generation of hierarchical multi-label classification rules. In: 2013 IEEE congress on evolutionary computation. IEEE, pp 454–461
Chareka T, Pillay N (2016) A study of fitness functions for data classification using grammatical evolution. In: 2016 Pattern recognition association of South Africa and robotics and mechatronics international conference (PRASA-RobMech). IEEE, pp. 1–4
Contreras, I., Bertachi, A., Biagi, L., Vehí, J., Oviedo, S.: Using grammatical evolution to generate short-term blood glucose prediction models. In: KHD@ IJCAI, pp 91–96
Dehuri S, Roy R, Cho SB, Ghosh A (2012) An improved swarm optimized functional link artificial neural network (iso-flann) for classification. J Syst Softw 85(6):1333–1345
Elleuch S, Jarboui B (2018) Variable neighborhood programming for evolving discriminent functions with dynamic thresholds. In: 2018 international conference on machine learning and cybernetics (ICMLC), vol 1. IEEE, pp 263–268
Espejo PG, Ventura S, Herrera F (2009) A survey on the application of genetic programming to classification. IEEE Trans Syst, Man, Cybern, Part C (Appl Rev) 40(2):121–144
Gray H, Maxwell R, Martinez-Perez I, Arus C, Cerdan S (1996) Genetic programming for classification of brain tumours from nuclear magnetic resonance biopsy spectra. Genet Progr 424:1–6
Han J, Pei J, Kamber M (2011) Data mining: concepts and techniques. Elsevier, Amsterdam
Hope D, Munday E, Smith S (2007) Evolutionary algorithms in the classification of mammograms. In: 2007 IEEE symposium on computational intelligence in image and signal processing. IEEE, pp 258–265
Karaboga D (2005) An idea based on honey bee swarm for numerical optimization. In: Technical Report-TR06, Erciyes University. Engineering Faculty, Computer Engineering Department, Kayseri/Türkiye
Kou G, LU Y, Peng Y, Shi Y (2012) Evaluation of classification algorithms using mcdm and rank correlation. Int J Inf Technol Decis Mak 11:197–225. https://doi.org/10.1142/S0219622012500095
Lennartsson D, Nordin P (2004) A genetic programming method for the identification of signal peptides and prediction of their cleavage sites. EURASIP J Adv Signal Process 1:153697
Lin JY, Ke HR, Chien BC, Yang WP (2008) Classifier design with feature selection and feature extraction using layered genetic programming. Expert Syst Appl 34(2):1384–1393
López-Vázquez G, Ornelas-Rodriguez M, Espinal A, Soria-Alcaraz JA, Rojas-Domínguez A, Puga-Soberanes H, Carpio JM (2019) Rostro-Gonzalez H (2019) Evolutionary spiking neural networks for solving supervised classification problems. Comput Intell Neurosci 2019:4182639. https://doi.org/10.1155/2019/4182639
Luz EJS, Schwartz WR, Cámara-Chávez G, Menotti D (2016) Ecg-based heartbeat classification for arrhythmia detection: a survey. Comput Methods Progr Biomed 127:144–164
Mckay RI, Hoai NX, Whigham PA, Shan Y, O’neill M (2010a) Grammar-based genetic programming: a survey. Genet Progr Evolvable Mach 11(3–4):365–396
McKay RI, Hoai NX, Whigham PA, Shan Y, O’Neill M (2010b) Grammar-based genetic programming: a survey. Genet Progr Evolvable Mach 11:365–396
Motsinger-Reif AA, Deodhar S, Winham SJ, Hardison NE (2010) Grammatical evolution decision trees for detecting gene-gene interactions. BioData Min 3(1):8
Nag K, Pal NR (2015) A multiobjective genetic programming-based ensemble for simultaneous feature selection and classification. IEEE Trans Cybern 46(2):499–510
Neill M, Brabazon A (2004) Grammatical swarm. Genet Evolut Comput Conf 1:163–174
O’Neill M, Brabazon A (2006) Grammatical swarm: the generation of programs by social programming. Nat Comput 5(4):443–462
O’Neill M, Ryan C (2001) Grammatical evolution. IEEE Trans Evolut Comput 5(4):349–358
O’Neill M, Ryan C (2003) Grammatical evolution. In: Grammatical evolution. Springer, US, pp 33–47
Paul TK, Iba H (2006) Classification of scleroderma and normal biopsy data and identification of possible biomarkers of the disease. In: 2006 IEEE symposium on computational intelligence and bioinformatics and computational biology. IEEE, pp 1–6
Rivero D, Dorado J, Rabuñal J, Pazos A (2010) Generation and simplification of artificial neural networks by means of genetic programming. Neurocomputing 73(16–18):3200–3223
Si T (2016) Grammatical evolution using fireworks algorithm. In: Pant M, Deep K, Bansal J, Nagar A, Das K (eds) Proceedings of 5th international conference on soft computing for problem solving. Advances in intelligent systems and computing, vol 436, pp. 43–55. Springer, New York
Si T, De A, Bhattacharjee AK (2013) Grammatical bee colony. In: Panigrahi BK, Suganthan PN, Das S, Dash SS (eds) Swarm, evolutionary, and memetic computing. SEMCCO, Lecture Notes in computer science, vol 8297. Springer, New York, pp 436–445
Si T, De A, Bhattacharjee AK (2018) Segmentation of brain mri using wavelet transform and grammatical bee colony. J Circuits, Syst Comput 27(07):1850108
Si T, Dutta R (2019) Partial opposition-based particle swarm optimizer in artiżcial neural network training for medical data classiżcation. Int J Inf Technol Decis Mak 18(5):1717–1750
Si T, Sk S (2016) A comparison of grammatical bee colony and neural networks in medical data mining. Int J Comput Appl 134(6):1–4
Tan Y, Zhu Y (2010) Fireworks algorithm for optimization. International conference in swarm intelligence. Springer, New York, pp 355–364
Tharwat A (2018) Classification assessment methods. Appl Comput Inf. https://doi.org/10.1016/j.aci.2018.08.003
Triantaphyllou E (2000) Multi-criteria decision making methods: a comparative study 44. https://doi.org/10.1007/978-1-4757-3157-6
Valencia-Ramírez JM, Raya JA, Cedeno JR, Suárez RR, Escalante HJ, Graff M (2014) Comparison between genetic programming and full model selection on classification problems. In: 2014 IEEE international autumn meeting on power, electronics and computing (ROPEC). IEEE, pp 1–6
Winkler S, Affenzeller M, Wagner S (2007) Advanced genetic programming based machine learning. J Math Model Algor 6(3):455–480
Yasodha P, Ananthanarayanan N (2018) Detecting the ovarian cancer using big data analysis with effective model
Yeh YC, Wang WJ, Chiou CW (2009) Cardiac arrhythmia diagnosis method using linear discriminant analysis on ecg signals. Measurement 42(5):778–789
Zhang M, Wong P (2008) Genetic programming for medical classification: a program simplification approach. Genet Progr Evolvable Mach 9(3):229–255
Zhao H (2007) A multi-objective genetic programming approach to developing pareto optimal decision trees. Decis Support Syst 43(3):809–826
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Si, T., Miranda, P., Galdino, J.V. et al. Grammar-based automatic programming for medical data classification: an experimental study. Artif Intell Rev 54, 4097–4135 (2021). https://doi.org/10.1007/s10462-020-09949-9
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10462-020-09949-9