Abstract
This study leverages microarray data together with statistical and machine learning techniques to investigate the best set of biomarkers in diagnosing multiple sclerosis (MS). In this work to build an automated system to detect MS two phases are approached. The first phase which is of emphasis is to reduce the dimension of features space and select the most discriminative set of features; biomarkers for MS diagnosis. Principal Component Analysis (PCA) was used as a dimension reduction method. Meanwhile, various feature selection methods were used (Fisher score, chi-square, relief, and MRMR). The second phase of this work is the classification phase, where the output of the first phase were assessed. This phase comprises three different classifiers: LDA, SVM, and KNN. Relief feature selection method achieved 100% accuracy with KNN, using 38 differentially expressed genes (DEGs). Out of this set of these DEGs, Key biomarker genes were identified by studying the gene annotation for all. The genes MIF, PTGES3, CYLD, and JAK1 are realized to be associated with immune and neurological functions. Which is of great relevance to MS.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Ropper, A.H., Samuels, M.A., Klein, J.P.: Multiple Sclerosis and Allied Demyelinative Disease. The McGraw-Hill Companies, New York (2014)
The Multiple Sclerosis International Federation (MSIF). Mapping Multiple Sclerosis around the World Key Epidemiology Findings, Atlas of MS, 3rd ed. London, UK (2020). www.atlasofms.org. Accessed 1 Dec 2020
Sadovnick, A.D., Baird, P.A.: Sex ratio in offspring of patients with multiple sclerosis. N. Engl. J. Med. 306(18), 1114–1115 (1982)
van der Mei, I.A., et al.: Past exposure to sun, skin phenotype, and risk of multiple sclerosis: case–control study. BMJ 327(7410), 316 (2003). https://doi.org/10.1136/bmj.327.7410.316
Birnbaum, G.: Making the diagnosis of multiple sclerosis. Adv. Neurol. 98, 111–124 (2006)
Tang, Y., et al.: Gene expression in blood changes rapidly in neutrophils and monocytes after ischemic stroke in humans: a microarray study. J. Cereb. Blood Flow Metab. 26, 1089–1102 (2006)
Lublin, F.D., Reingold, S.C.: Defining the clinical course of multiple sclerosis: results of an international survey. Neurology 46(4), 907–911 (1996)
Keller, A., et al.: Multiple sclerosis: microRNA expression profiles accurately differentiate patients with relapsing-remitting disease from healthy controls. PLoS ONE 4(10), e7440 (2009)
Calcagno, G., et al.: A multilayer perceptron neural network-based approach for the identification of responsiveness to interferon therapy in multiple sclerosis patients. Inf. Sci. 180(21), 4153–4163 (2010)
Ratzer, R., et al.: Gene expression analysis of relapsing–remitting, primary progressive and secondary progressive multiple sclerosis. Mult. Scler. J. 19(14), 1841–1848 (2013)
Zhao, C., Deshwar, A.G., Morris, Q.: Relapsing-remitting multiple sclerosis classification using elastic net logistic regression on gene expression data. Syst. Biomed. 1(4), 247–253 (2013)
Guo, P., Zhang, Q., Zhu, Z., Huang, Z., Li, K.: Mining gene expression data of multiple sclerosis. PLoS ONE 9(6), e100052 (2014)
DeMarshall, C., et al.: Autoantibodies as diagnostic biomarkers for the detection and subtyping of multiple sclerosis. J. Neuroimmunol. 309, 51–57 (2017)
Ponce de Leon-Sanchez, E.R., Dominguez-Ramirez, O.A., Herrera-Navarro, A.M., Rodriguez-Resendiz, J., Paredes-Orta, C., Mendiola-Santibañez, J.D.: A deep learning approach for predicting multiple sclerosis. Micromachines 14(4), 749 (2023)
Ikram, S.T., Cherukuri, A.K.: Intrusion detection model using fusion of chi-square feature selection and multi-class SVM. . King Saudi Saud University Comput. Inf. Sci (2016). https://doi.org/10.1016/j.jksuci.2015.12.004
Rachburee, N., Punlumjeak, W.: A comparison of feature selection approach between greedy, IG-ratio, Chi-square, and mRMR in educational mining. In: 2015 7th International Conference on Information Technology and Electrical Engineering (ICITEE). IEEE (2015)
Wu, D., Guo, S.Z.: An improved fisher score feature selection method and its application. Chinese J. Liaoning Tech. Univ. 38(5), 472–479 (2019)
Kira, K., Rendell, L.: The feature selection problem: traditional methods and a new algorithm. In: Proceedings of AAAI-92 (1992)
Alcaraz, J., Labbé, M., Landete, M.: Support vector machine with feature selection: a multiobjective approach. Expert Syst. Appl. 204, 117485 (2022)
Araveeporn, A., Banditvilai, S.: A classification study in high-dimensional data of linear discriminant analysis and regularized discriminant analysis. WSEAS Trans. Math. 22, 315–323 (2023)
Ayyad, S.M., Saleh, A.I., Labib, L.M.: Gene expression cancer classification using modified K-Nearest Neighbors technique. Biosystems 176, 41–51 (2019)
Nickles, D., et al.: Blood RNA profiling in a large cohort of multiple sclerosis patients and healthy controls. Hum. Mol. Genet. 22(20), 4194–4205 (2013)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Ahmed, A.D., Hadhoud, M.M.A., Ghoneim, V.F. (2024). Automatic Detection of Multiple Sclerosis Using Genomic Expression. In: Mosbah, M., et al. Advances in Model and Data Engineering in the Digitalization Era. MEDI 2023. Communications in Computer and Information Science, vol 2071. Springer, Cham. https://doi.org/10.1007/978-3-031-55729-3_12
Download citation
DOI: https://doi.org/10.1007/978-3-031-55729-3_12
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-55728-6
Online ISBN: 978-3-031-55729-3
eBook Packages: Computer ScienceComputer Science (R0)