Abstract
Diagnosis of Parkinson's disease (PD) is a difficult undertaking that requires a variety of testing and clinical trials. Despite all these testing and trials, there is a considerable risk of misdiagnosis of Parkinson's disease, which causes a delay in making decisions about the best treatment for the patients. Computer aided diagnosis can improve the experiences and helps the doctors to predict PD patients. Four sorts of classification algorithms with various types of feature selection approaches are proposed in this paper for successful PD diagnosis. Support Vector Machine-SVM, Nave Bayes-NB, K-Nearest Neighbor-KNN, and Random Forest-RF are the categorization techniques employed. All the dataset's features are not necessary for classification. To choose the relevant characteristics, four feature selection approaches are used. Least Absolute Shrinkage and Selection Operator (LASSO), Backward-forward, rough set, and tree-based feature selection techniques are used. Experimentation is carried out using the Parkinson's Progressive Markers Initiative (PPMI) dataset. Four classification algorithms are constructed, together with and without feature selection procedures, and comparative research is conducted. To measure the performance of the classifiers, various evaluation methodologies are used. Overall Random Forest classifier with four feature selection methods’ gives best results with average accuracy is 96% and rough set feature selection for all three classifiers’ average accuracy is 97%.






Similar content being viewed by others
Data availability
Based on the request to corresponding author data will be provided.
References
Almuallim H, Dietterich TG (1991) Learning with many irrelevant features. In: Proc. AAAI-91, Anaheim, CA, pp 547–552
Beal MF (2001) Experimental models of Parkinson’s disease. Nat Rev Neurosci 2:325–334
Betarbet R, Sherer TB, Greenamyre JT (2002) Animal models of Parkinson’s disease. BioEssays 24:308–318
Blum AI, Langley P (1997) Selection of relevant features and examples in machine learning. Artif Intell 97:245–271
Bron EE, Smits M, Niessen WJ, Klein S (2015) Feature selection based on the SVM weight vector for classification of dementia. IEEE J Biomed Health Inform 19(5):1617–1626
Chandrashekar G, Sahin F (2014) A survey on feature selection methods. Comput Electr Eng 40(1):16–28
Das R (2010) A comparison of multiple classification methods for diagnosis of Parkinson disease. Expert Syst Appl 37(2):1568–1572
Dash S, Verma S, Khan MS, Wozniak M, Shafi J, Ijaz MF (2021) A hybrid method to enhance thick and thin vessels for blood vessel segmentation. Diagnostics 11(11):2017
Dash S, Verma S, Bevinakoppa S, Wozniak M, Shafi J, Ijaz MF (2022) Guidance image-based enhanced matched filter with modified thresholding for blood vessel extraction. Symmetry 14(2):194
Grifoni P, Caschera MC, Ferri F (2021) Evaluation of a dynamic classification method for multimodal ambiguities based on Hidden markov models. Evol Syst 12(2):377–395
Guo H, Zhang F, Chen J, Xu Y, Xiang J (2017) Machine learning classification combining multiple features of a hyper-network of fMRI data in Alzheimer’s disease. Front Neurosci 11:615
Gupta I, Sharma V, Kaur S, Singh AK (2022) PCA-RF: an efficient Parkinson's Disease prediction model based on random forest classification. arXiv preprint arXiv:2203.11287
Guyon I, Elisseeff A (2003) An introduction to variable and feature selection. J Mach Learn Res 3(Mar):1157–1182
Hassanien AE, Ali JM (2004) Rough set approach for generation of classification rules of breast cancer data. Informatica 15(1):23–38
Ijaz MF, Alfian G, Syafrudin M, Rhee J (2018) Hybrid prediction model for type 2 diabetes and hypertension using DBSCAN-based outlier detection, synthetic minority over sampling technique (SMOTE), and random forest. Appl Sci 8(8):1325
Ijaz MF, Attique M, Son Y (2020) Data-driven cervical cancer prediction model with outlier detection and over-sampling methods. Sensors 20(10):2809
Khemphila A, Boonjing V (2011) Heart disease classification using neural network and feature selection. In: 2011 21st international conference on systems engineering. IEEE, pp 406–409
Khemphila A, Boonjing V (2012) Parkinsons disease classification using neural network and feature selection. World Acad Sci Eng Technol 64:15–18
Kira K, Rendell LA (1992) The feature selection problem: tradional methods and a new algorithm. In: Proc. AAAI-92, San Jose, CA, pp 122–126
Kohavi R, John GH (1997) Wrappers for feature subset selection. Artif Intell 97:273–324
Koller D, Sahami M (1996) Toward optimal feature selection. Stanford InfoLab.
Kumar Y, Koul A, Singla R, Ijaz MF (2022) Artificial intelligence in disease diagnosis: a systematic literature review, synthesizing framework and future research agenda. J Ambient Intell Humaniz Comput:1–28
Lee SH, Lim JS (2012) Parkinson’s disease classification using gait characteristics and wavelet-based feature extraction. Expert Syst Appl 39(8):7338–7344
Li J, Cheng K, Wang S, Morstatter F, Trevino RP, Tang J, Liu H (2017) Feature selection: a data perspective. ACM Comput Surv 50(6):1–45
Mafarja M, Mirjalili S (2018) Whale optimization approaches for wrapper feature selection. Appl Soft Comput 62:441–453
Mafarja MM, Eleyan D, Jaber I, Hammouri A, Mirjalili S (2017) Binary dragonfly algorithm for feature selection. In: 2017 International Conference on New Trends in Computing Sciences (ICTCS). IEEE, pp 12–17
Mafarja M, Aljarah I, Heidari AA, Hammouri AI, Faris H, Ala’M AZ, Mirjalili S (2018) Evolutionary population dynamics and grasshopper optimization approaches for feature selection problems. Knowl-Based Syst 145:25–45
Mandal M, Singh PK, Ijaz MF, Shafi J, Sarkar R (2021) A tri-stage wrapper-filter feature selection framework for disease classification. Sensors 21(16):5571
Mitra P, Murthy CA, Pal SK (2002) Unsupervised feature selection using feature similarity. IEEE Trans Pattern Anal Mach Intell 24(3):301–312
Mittal V, Sharma RK (2021) Machine learning approach for classification of Parkinson disease using acoustic features. J Reliable Intell Environ 7(3):233–239
Mostafa SA, Mustapha A, Mohammed MA, Hamed RI, Arunkumar N, Abd Ghani MK … Khaleefah SH (2019) Examining multiple feature evaluation and classification methods for improving the diagnosis of Parkinson’s disease. Cognit Syst Res 54:90–99
Nilashi M, Ibrahim O, Ahmadi H, Shahmoradi L, Farahmand M (2018) A hybrid intelligent system for the prediction of Parkinson’s Disease progression using machine learning techniques. Biocybern Biomed Eng 38(1):1–15
Novaković J (2016) Toward optimal feature selection using ranking methods and classification algorithms. Yugosl J Oper Res 21(1):119–135
Polat K (2012) Classification of Parkinson’s disease using feature weighting method on the basis of fuzzy C-means clustering. Int J Syst Sci 43(4):597–609
Prashanth R, Roy SD (2018) Early detection of Parkinson’s disease through patient questionnaire and predictive modelling. Int J Med Informatics 119:75–87
Priya SJ, Rani AJ, Subathra MSP, Mohammed MA, Damaševičius R, Ubendran N (2021) Local pattern transformation based feature extraction for recognition of Parkinson’s disease based on gait signals. Diagnostics 11(8):1395
Sharanyaa S, Renjith PN, Ramesh K (2022) An exploration on feature extraction and classification techniques for dysphonic speech disorder in Parkinson’s Disease. In: Inventive communication and computational technologies. Springer, Singapore, pp 33–48
Singh N, Pillay V, Choonara YE (2007) Advances in the treatment of Parkinson’s disease. Prog Neurobiol 81:29–44
Srinivasu PN, Ahmed S, Alhumam A, Kumar AB, Ijaz MF (2021) An AW-HARIS based automated segmentation of human liver using CT images. Comput Mater Contin 69(3):3303–3319
Srinivasu PN, SivaSai JG, Ijaz MF, Bhoi AK, Kim W, Kang JJ (2021) Classification of skin disease using deep learning neural networks with MobileNet V2 and LSTM. Sensors 21(8):2852
Sujatha J, Rajagopalan SP (2017) Performance evaluation of machine learning algorithms in the classification of Parkinson disease using voice attributes. Int J Appl Eng Res 12(21):10669–10675
Tahir NM, Manap HH (2012) Parkinson Disease gait classification based on machine learning approach. J Appl Sci Faisalabad (Faisalabad) 12:180–185
Vulli A, Srinivasu PN, Sashank MSK, Shafi J, Choi J, Ijaz MF (2022) Fine-tuned DenseNet-169 for breast cancer metastasis prediction using FastAI and 1-cycle policy. Sensors 22(8):2988
Weston J, Mukherjee S, Chapelle O, Pontil M, Poggio T, Vapnik V (2001) Feature selection for SVMs. In: Advances in neural information processing systems, pp 668–674
Win TZ, Kham NSM (2019) Information gain measured feature selection to reduce high dimensional data. Seventeenth International Conference on Computer Applications (ICCA 2019)
Yadav G, Kumar Y, Sahoo G (2012) Predication of Parkinson's disease using data mining methods: a comparative analysis of tree, statistical and support vector machine classifiers. In: 2012 national conference on computing and communication systems. IEEE, pp 1–8
Yasar A, Saritas I, Sahman MA, Cinar AC (2019) Classification of Parkinson disease data with artificial neural networks. In: IOP conference series: materials science and engineering, vol 675, no 1. IOP Publishing, pp 012031
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors whose names are listed immediately below certify that they have NO affiliations with or involvement in any organization or entity with any financial interest (such as honoraria; educational grants; participation in speakers’ bureaus; membership, employment, consultancies, stock ownership, or other equity interest; and expert testimony or patent-licensing arrangements), or non-financial interest (such as personal or professional relationships, affiliations, knowledge or beliefs) in the subject matter or materials discussed in this manuscript.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Hema, M.S., Maheshprabhu, R., Reddy, K.S. et al. Prediction analysis for Parkinson disease using multiple feature selection & classification methods. Multimed Tools Appl 82, 42995–43012 (2023). https://doi.org/10.1007/s11042-023-15280-6
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-023-15280-6