Abstract
Huntington’s disease is a type of neurodegenerative disease caused by gene HTT. To date, its molecular pathogenesis is still unclear. Clinically, behavior, cognitive, and mental function are affected progressively. With the rapid development of sequencing technologies, it is possible to explore the molecular mechanisms at the genome-wide transcriptomic level using computational methods. Our previous studies have shown that it is difficult to distinguish disease genes from non-disease genes. To understand the molecular pathogenesis under complex clinical phenotypes during the disease progression, it is better to identify biomarkers corresponding to different disease stage. Therefore, in this study, we designed a label propagation based semi-supervised feature selection approach (LPFS) to identify disease-associated genes corresponding to different clinical phenotypes. LPFS selects disease-associated genes corresponding to different disease stage through the alternative iteration of label propagation clustering and feature selection. We then conducted an enrichment analysis to understand gene functions and affected pathways during the disease progression, thus to decode the changes in individual behavioral and mental characteristics during neurodegenerative disease progression at the gene expression level. Our results have shown that LPFS performs better in comparison with the-state-of-art methods. We found that TGF-beta signaling pathway, olfactory transduction, cytokine-cytokine receptor interaction, immune response, and inflammatory response were gradually affected during the disease progression. In addition, we found that the expression of Ccdc33, Capsl, Al662270, and Dlgap5 were seriously changed caused by the development of the disease.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Ross, C.A., et al.: Huntington disease: natural history, biomarkers and prospects for therapeutics. Nat. Rev. Neurol. 10(4), 204 (2014)
Appel, S.H., Smith, R.G., Le, W.D.: Immune-mediated cell death in neurodegenerative disease. Adv. Neurol. 69, 153–159 (1996)
Hardy, J.: Pathways to primary neurodegenerative disease. In: Mayo Clinic Proceedings, pp. 835–837. Elsevier (1999)
Gammon, K.: Neurodegenerative disease: brain windfall. Nature 515(7526), 299–300 (2014)
Seredenina, T., LuthiCarter, R.: What have we learned from gene expression proles in huntington’s disease? Neurobiol. Dis. 45(1), 83–98 (2012)
Wang, X., Huang, T., Bu, G., Xu, H.: Dysregulation of protein tracking in neurodegeneration. Mol. Neurodegeneration 9(1), 1–9 (2014)
Diglia, M., et al.: Aggregation of huntingtin in neuronal intranuclear inclusions and dystrophic neurites in brain. Science 277(5334), 1990–1993 (1997)
Waldvogel, H.J., Kim, E.H., Thu, D.C., Tippett, L.J., Faull, R.L.: New perspectives on the neuropathology in huntington’s disease in the human brain and its relation to symptom variation. J. Huntington’s Dis. 1(2), 143–153 (2012)
Ideker, T., Ozier, O., Schwikowski, B., et al.: Discovering regulatory and signalling circuits in molecular interaction networks. Bioinformatics 18(suppl. 1), S233 (2002)
Jiang, X., Zhang, H., Duan, F., Quan, X.: Identify huntington’s disease associated genes based on restricted boltzmann machine with rna-seq data. BMC Bioinf. 18(1), 447 (2017)
Jiang, X., Zhang, H., Zhang, Z., Quan, X.: Flexible non-negative matrix factorization to unravel disease-related genes. IEEE/ACM Trans. Comput. Biol. Bioinf. 1(99), 1–11 (2018)
Frey, B.J., Dueck, D.: Clustering by passing messages between data points. Science 315(5814), 972–976 (2007)
Robinson, M.D., Smyth, G.K.: Moderated statistical tests for assessing differences in tag abundance. Bioinformatics 23(21), 2881–2887 (2007)
Robinson, M.D., McCarthy, D.J., Smyth, G.K.: edgeR: a bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics 26(1), 139–140 (2010)
Ritchie, M.E., et al.: LIMMA powers differential expression analyses for rna-sequencing and microarray studies. Nucleic Acids Res. 43(7), 47 (2015)
Hong, F., Breitling, R.: A comparison of meta-analysis methods for detecting differentially expressed genes in microarray experiments. Bioinformatics 24(3), 374–382 (2008)
Ding, C., Zhou, D., He, X., Zha, H.: R1-PCA: rotational invariant L1-norm principal component analysis for robust subspace factorization. In: International Conference on Machine Learning, pp. 281–288 (2006)
Liu, H., Shao, M., Fu, Y.: Consensus guided unsupervised feature selection. In: Proceedings of the Association for the Advancement of Artificial Intelligence, Phoenix, AZ, USA, pp. 12–17, February 2016
Langfelder, P., et al.: Integrated genomics and proteomics de ne huntingtin cag length-dependent networks in mice. Nat. Neurosci. 19(4), 623 (2016)
Huang, D.W., Sherman, B.T., Lempicki, R.A.: Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat. Protoc. 4(1), 44–57 (2009)
Huang, D.W., Sherman, B.T., Lempicki, R.A.: Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists. Nucleic Acids Res. 37(1), 1–13 (2009)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Jiang, X., Chen, M., Wang, W., Song, W., Lin, G.N. (2019). Label Propagation Based Semi-supervised Feature Selection to Decode Clinical Phenotype of Huntington’s Disease. In: Huang, DS., Bevilacqua, V., Premaratne, P. (eds) Intelligent Computing Theories and Application. ICIC 2019. Lecture Notes in Computer Science(), vol 11643. Springer, Cham. https://doi.org/10.1007/978-3-030-26763-6_51
Download citation
DOI: https://doi.org/10.1007/978-3-030-26763-6_51
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-26762-9
Online ISBN: 978-3-030-26763-6
eBook Packages: Computer ScienceComputer Science (R0)