Skip to main content

Advertisement

Log in

An integrated network topology and deep learning model for prediction of Alzheimer disease candidate genes

  • Application of soft computing
  • Published:
Soft Computing Aims and scope Submit manuscript

Abstract

Alzheimer’s disease (AD) is a neurological illness that causes short-term memory loss. There are currently no viable therapeutic therapies for this condition that can cure it. The source of Alzheimer’s disease is unknown. However, genetic factors are thought to have a role in the illness’s development, with about 70% of the disease’s risk attributed to the vast number of genes associated. Despite discovering several potential AD susceptibility genes through genetic association studies, there is a more significant challenge to identify unidentified AD-associated genes and drug targets to gain a good insight into the disease-causing mechanisms of Alzheimer’s disease and develop effective AD therapeutics. The proposed DC-GC (Degree Centrality- Graph Colouring) model brings an accuracy of 96% for (Artificial Neural Network) ANN model, 87.3% for KNN (K-Nearest Neighbourhood classifier) classifier, 86% for SVM (Support Vector Machine) classifier, 85.3% than Decision Tree. It is visible; the network topology model performs well for ANN classifier than other existing models. Similarly, the model also brings a sensitivity measure of 97% for the ANN model, 84% for KNN (K-Nearest Neighbourhood classifier), 84.2% for SVM (Support Vector Machine) classifier and 84% for the Decision tree classifier. In this research work, a novel network topology measure DC-GC (Degree Centrality- Graph Colouring) and intelligent-based machine learning models are used for identifying candidate genes from protein–protein interaction and sequence features of genes. The integrated method helps to identify the target gene for Alzheimer’s disease by evaluating the connectivity between the genes and the physicochemical properties of the genes. The approach helps to rank the genes according to the property that adjacency genes should not share the same colour. The DC-GC (Degree Centrality-Graph Colouring)-based network topology measure provides remarkable improvement over existing centrality measures. The integration of network topology measure with the SVM (Support Vector Machine) model gave promising results of 96% accuracy, 97% sensitivity, 98% specificity, 96% PPV (Positive Predictive Value), 95% NPV (Negative Predictive Value) and 97% F-score.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13

Similar content being viewed by others

Data availability

The data used to support the findings of this study are available from the corresponding author upon request.

Abbreviations

AD:

Alzheimer’s disease

PPI:

Protein–protein interaction

DC:

GC degree centrality—graph colouring

DC:

Degree centrality

BC:

Betweenness centrality

CC:

Closeness centrality

EC:

Eigen vector centrality

NC:

Network centrality

SC:

Subgraph centrality

IC:

Information centrality

KNN:

K-nearest neighbourhood

SVM:

Support vector machine

PPV:

Positive predictive value

NPV:

Negative predictive value

References

  • Ahmadi M, Dashti Ahangar F, Astaraki N, Abbasi M, Babaei B (2021) FWNNet: presentation of a new classifier of brain tumor diagnosis based on fuzzy logic and the wavelet-based neural network using machine-learning methods. Comput Intell Neurosci 2021:13. https://doi.org/10.1155/2021/8542637

  • Ahmadi M, Taghavirashidizadeh A, Javaheri D, Masoumian A, Ghoushchi SJ, Pourasad Y (2022) DQRE-SCnet: a novel hybrid approach for selecting users in federated learning with deep-Q-reinforcement learning based on spectral clustering. J King Saud Univer-Comput Inf Sci 34(9):7445–7458

    Google Scholar 

  • Ali L, Khan SU, Golilarz NA, Yakubu I, Qasim I, Noor A, Nour R (2019) A feature-driven decision support system for heart failure prediction based on statistical model and Gaussian naive bayes. Comput Math Methods Med 2019:8. https://doi.org/10.1155/2019/6314328

  • Anand R, Gill KD, Mahdi AA (2014) Therapeutics of Alzheimer’s disease: past, present and future. Neuropharmacology 76:27–50

    Google Scholar 

  • Attari MYN, Ahmadi M, Ala A, Moghadamnia E (2022) RSDM-AHSnet: designing a robust stochastic dynamic model to allocating health service network under disturbance situations with limited capacity using algorithm NSGA-II and PSO. Comput Biol Med 147:105649

  • Attrill H, Gaudet P, Huntley RP, Lovering RC, Engel SR, Poux S, Van Auken KM, Georghiou G, Chibucos MC, Berardini TZ, Wood V (2019) Annotation of gene product function from high-throughput studies using the Gene Ontology. Database 2019

  • Azizi S, Soleimani R, Ahmadi M, Malekan A, Abualigah L, Dashtiahangar F (2022) Performance enhancement of an uncertain nonlinear medical robot with optimal nonlinear robust controller. Comput Biol Med 146:105567

    Google Scholar 

  • Bondi MW, Edmonds EC, Salmon DP (2017) Alzheimer’s disease: past, present, and future. J Int Neuropsychol Soc 23(9–10):818–831

    Google Scholar 

  • Boutet E, Lieberherr D, Tognolli M, Schneider M, Bairoch A (2007) Uniprotkb/swiss-prot. In: Plant bioinformatics. Humana Press, pp 89–112

  • Braak H, Braak E (1997) Diagnostic criteria for neuropathologic assessment of Alzheimer’s disease. Neurobiol Aging 18(4):S85–S88

    Google Scholar 

  • Cai YD, Liu XJ, Xu XB, Zhou GP (2001) Support vector machines for predicting protein structural class. BMC Bioinformat 2(1):1–5

    Google Scholar 

  • Capriotti E, Altman RB (2011) Improving the prediction of disease-related variants using protein three-dimensional structure. BMC Bioinformat 12(4):1–11

    Google Scholar 

  • Cheng J, Wu W, Zhang Y, Li X, Jiang X, Wei G, Tao S (2013) A new computational strategy for predicting essential genes. BMC Genom 14(1):1–13

    Google Scholar 

  • Cheng WS, Chen GY, Shih XY, Elsisi M, Tsai MH, Dai HJ (2022) Vickers hardness value test via multi-task learning convolutional neural networks and image augmentation. Appl Sci 12(21):10820

    Google Scholar 

  • Damoulas T, Girolami MA (2008) Probabilistic multi-class multi-kernel learning: on protein fold recognition and remote homology detection. Bioinformatics 24(10):1264–1270

    Google Scholar 

  • Eftekhari M, Gheibi M, Azizi-Toupkanloo H, Hossein-Abadi Z, Khraisheh M, Fathollahi-Fard AM, Tian G (2021) Statistical optimization, soft computing prediction, mechanistic and empirical evaluation for fundamental appraisal of copper, lead and malachite green adsorption. J Ind Inf Integr 23:100219

    Google Scholar 

  • Elsisi M, Tran MQ (2021) Development of an IoT architecture based on a deep neural network against cyber attacks for automated guided vehicles. Sensors 21(24):8467

    Google Scholar 

  • Emahazion T, Feuk L, Jobs M, Sawyer SL, Fredman D, St Clair D, Prince JA, Brookes AJ (2001) SNP association studies in Alzheimer’s disease highlight problems for complex disease analysis. Trends Genet 17(7):407–413

    Google Scholar 

  • Fatima M, Pasha M (2017) Survey of machine learning algorithms for disease diagnostic. J Intell Learn Syst Appl 9(01):1

    Google Scholar 

  • Georgiou DN, Karakasidis TE, Nieto JJ, Torres A (2009) Use of fuzzy clustering technique and matrices to classify amino acids and its impact to Chou’s pseudo amino acid composition. J Theor Biol 257(1):17–26

    MATH  Google Scholar 

  • Gholizadeh H, Fathollahi-Fard AM, Fazlollahtabar H, Charles V (2022) Fuzzy data-driven scenario-based robust data envelopment analysis for prediction and optimisation of an electrical discharge machine’s parameters. Expert Syst Appl 193:116419

    Google Scholar 

  • Goecks J, Jalili V, Heiser LM, Gray JW (2020) How machine learning will transform biomedicine. Cell 181(1):92–101

    Google Scholar 

  • Goedert M, Spillantini MG (2006) A century of Alzheimer’s disease. Science 314(5800):777–781

    Google Scholar 

  • Ho TK (1998) The random subspace method for constructing decision forests. IEEE Trans Pattern Anal Mach Intell 20(8):832–844

    Google Scholar 

  • Hwang YC, Lin CC, Chang JY, Mori H, Juan HF, Huang HC (2009) Predicting essential genes based on network and sequence analysis. Mol BioSyst 5(12):1672–1678

    Google Scholar 

  • Islam MR, Ali SM, Fathollahi-Fard AM, Kabir G (2021) A novel particle swarm optimization-based grey model for the prediction of warehouse performance. J Comput Des Eng 8(2):705–727

    Google Scholar 

  • Jeong H, Mason SP, Barabási AL, Oltvai ZN (2001) Lethality and centrality in protein networks. Nature 411(6833):41–42

    Google Scholar 

  • Khotimah BK, Miswanto M, Suprajitno H (2020) Optimization of feature selection using genetic algorithm in naïve Bayes classification for incomplete data. Int J Intell Eng Syst 13(1):334–343

    Google Scholar 

  • Koschützki D, Schreiber F (2008) Centrality analysis methods for biological networks and their application to gene regulatory networks. Gene Regul Syst Biol 2:GRSB-S702

    Google Scholar 

  • Lei X, Yang X, Fujita H (2019) Random walk based method to identify essential proteins by integrating network topology and biological characteristics. Knowl-Based Syst 167:53–67

    Google Scholar 

  • Letunic I, Copley RR, Schmidt S, Ciccarelli FD, Doerks T, Schultz J, Ponting CP, Bork P (2004) SMART 4.0: towards genomic data integration. Nucl Acids Res 32(Suppl_1):D142–D144

    Google Scholar 

  • Liu S, Liu C, Deng L (2018) Machine learning approaches for protein–protein interaction hot spot prediction: progress and comparative assessment. Molecules 23(10):2535

    Google Scholar 

  • Milo R, Shen-Orr S, Itzkovitz S, Kashtan N, Chklovskii D, Alon U (2002) Network motifs: simple building blocks of complex networks. Science 298(5594):824–827

    Google Scholar 

  • Petrova NV, Wu CH (2006) Prediction of catalytic residues using Support Vector Machine with selected protein sequence and structural properties. BMC Bioinformat 7(1):1–12

    Google Scholar 

  • Rual JF, Venkatesan K, Hao T, Hirozane-Kishikawa T, Dricot A, Li N, Berriz GF, Gibbons FD, Dreze M, Ayivi-Guedehoussou N, Klitgord N (2005) Towards a proteome-scale map of the human protein–protein interaction network. Nature 437(7062):1173–1178

    Google Scholar 

  • Ruhnau B (2000) Eigenvector-centrality-a node-centrality? Soc Netw 22(4):357–365

    Google Scholar 

  • Seshadri S, Fitzpatrick AL, Ikram MA, DeStefano AL, Gudnason V, Boada M, Bis JC, Smith AV, Carrasquillo MM, Lambert JC, Harold D (2010) Genome-wide analysis of genetic loci associated with Alzheimer disease. JAMA 303(18):1832–1840

    Google Scholar 

  • Tahir M, Hayat M (2017) Machine learning based identification of protein–protein interactions using derived features of physiochemical properties and evolutionary profiles. Artif Intell Med 78:61–71

    Google Scholar 

  • Tran MQ, Amer M, Abdelaziz AY, Dai HJ, Liu MK, Elsisi M (2022a) Robust fault recognition and correction scheme for induction motors using an effective IoT with deep learning approach. Measurement 207:112398

  • Tran MQ, Elsisi M, Liu MK, Vu VQ, Mahmoud K, Darwish MM, Abdelaziz AY, Lehtonen M (2022b) Reliable deep learning and IoT-based monitoring system for secure computer numerical control machines against cyber-attacks with experimental verification. IEEE Access 10:23186–23197

    Google Scholar 

  • Vassar R (2014) BACE1 inhibitor drugs in clinical trials for Alzheimer’s disease. Alzheimer’s Res Therapy 6(9):1–14

    Google Scholar 

  • Wang LH, Liu J, Li YF, Zhou HB (2004) Predicting protein secondary structure by a support vector machine based on a new coding scheme. Genome Informat 15(2):181–190

    Google Scholar 

  • Wang J, Li M, Wang H, Pan Y (2012) Identification of essential proteins based on edge clustering coefficient. IEEE/ACM Trans Comput Biol Bioinformat (TCBB) 9(4):1070–1080

    Google Scholar 

  • Wang YB, You ZH, Yang S, Yi HC, Chen ZH, Zheng K (2020) A deep learning-based method for drug-target interaction prediction based on long short-term memory neural network. BMC Med Inform Decis Mak 20(2):1–9

    Google Scholar 

  • Weller J, Budson A (2018) Current understanding of Alzheimer’s disease diagnosis and treatment. F1000Research 7

  • Xia JF, Wu M, You ZH, Zhao XM, Li XL (2010) Prediction of β-hairpins in proteins using physicochemical properties and structure information. Prot Pept Lett 17(9):1123–1128

    Google Scholar 

  • Yousef M, Nebozhyn M, Shatkay H, Kanterakis S, Showe LC, Showe MK (2006) Combining multi-species genomic data for microRNA identification using a Naive Bayes classifier. Bioinformatics 22(11):1325–1334

    Google Scholar 

  • Yu C, Helwig EJ (2022) The role of AI technology in prediction, diagnosis and treatment of colorectal cancer. Artif Intell Rev 55(1):323–343

    Google Scholar 

  • Zeng M, Li M, Wu FX, Li Y, Pan Y (2019) DeepEP: a deep learning framework for identifying essential proteins. BMC Bioinformat 20(16):1–10

    Google Scholar 

  • Zhang W, Xu J, Li Y, Zou X (2016a) Detecting essential proteins based on network topology, gene expression data, and gene ontology information. IEEE/ACM Trans Comput Biol Bioinf 15(1):109–116

    Google Scholar 

  • Zhang X, Acencio ML, Lemke N (2016b) Predicting essential genes and proteins based on machine learning and network topological features: a comprehensive review. Front Physiol 7:75

    Google Scholar 

  • Zhang F, Song H, Zeng M, Li Y, Kurgan L, Li M (2019) DeepFunc: a deep learning framework for accurate prediction of protein functions from protein sequences and interactions. Proteomics 19(12):1900019

    Google Scholar 

  • Zhong J, Wang J, Peng W, Zhang Z, Pan Y (2013) Prediction of essential proteins based on gene expression programming. BMC Genom 14(4):1–8

    Google Scholar 

  • Zotenko E, Mestre J, O’Leary DP, Przytycka TM (2008) Why do hubs in the yeast protein interaction network tend to be essential: reexamining the connection between the network topology and essentiality. PLoS Comput Biol 4(8):e1000140

    MathSciNet  Google Scholar 

Download references

Funding

There was no financial support received from any organization for carrying out this work.

Author information

Authors and Affiliations

Authors

Contributions

All authors contributed equally for the preparation of the manuscript.

Corresponding author

Correspondence to Suresh Muthusamy.

Ethics declarations

Conflict of interest

The authors declare no potential conflicts of interest with respect to the research, authorship, and/or publication of this article. To the best of my knowledge and belief any actual, perceived or potential conflicts between my duties as an employee and my private and/or business interests have been fully disclosed in this form in accordance with the requirements of the journal.

Ethical approval

This material is the authors’ own original work, which has not been previously published elsewhere. The paper is not currently being considered for publication elsewhere. The paper reflects the authors’ own research and analysis in a truthful and complete manner.

Consent to participate

I have been informed of the risks and benefits involved, and all my questions have been answered to my satisfaction. Furthermore, I have been assured that any future questions I may have will also be answered by a member of the research team. I voluntarily agree to take part in this study.

Consent to Publish

Individuals may consent to participate in a study, but object to having their data published in a journal article.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Gnanadesigan, N.S., Dhanasegar, N., Ramasamy, M.D. et al. An integrated network topology and deep learning model for prediction of Alzheimer disease candidate genes. Soft Comput 27, 14189–14203 (2023). https://doi.org/10.1007/s00500-023-08390-8

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00500-023-08390-8

Keywords

Navigation