Abstract
This paper focuses on the comparison of two different approaches to the analysis of Single Nucleotide Polymorphism (SNP) profiles data regarding Crohn’s Disease; the first one is based on a single SNP analysis, conducted by means of classical statistical tools, to assess the correlation existing between SNP’s profile and phenotype; the second one makes use of classifiers based on Regularized Logistic Regression. The findings of the study show that the machine learning techniques adopted are able to provide statistically significant prediction accuracy of the phenotypic status of the subjects analyzed by SNP data. Moreover, they are poorly influenced by the noise embedded in the data and are suitable for genome-wide analysis.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Hugot, J.P., Chamaillard, M., Zouali, H., Lesage, S., Cezard, J.P., et al.: Association of NOD2 leucine-rich repeat variants with susceptibility to Crohn disease. Nature 411, 599–603 (2001)
Peltekova, V.D., Wintle, R.F., Rubin, L.A., Amos, C.I., Huang, Q., et al.: Functional variants of OCTN cation transporter genes are associated with Crohn disease. Nat. Genet. 36, 471–475 (2004)
Stoll, M., Corneliussen, B., Costello, C.M., Waetzig, G.H., Mellgard, B., et al.: Genetic variation in DLG5 is associated with inflammatory bowel disease. Nat. Genet. 36, 476–480 (2004)
Duerr, R.H., Taylor, K.D., Brant, S.R., Rioux, J.D., Silverberg, M.S., et al.: A genome-wide association study identifies IL23R as an inflammatory bowel disease gene. Science 314, 1461–1463 (2006)
Yamazaki, K., McGovern, D., Ragoussis, J., Paolucci, M., Butler, H., et al.: Single nucleotide polymorphisms in TNFSF15 confer susceptibility to Crohn disease. Hum. Mol. Genet. 14, 3499–3506 (2005)
Hampe, J., Franke, A., Rosenstiel, P., Till, A., Teuber, M., et al.: A genomewide association scan of non-synonymous SNPs identifies a susceptibility variant for Crohn disease in ATG16L1. Nat. Genet. 39, 207–211 (2007)
Risch, N.J.: Searcing for genetic determinants in the new millennium. Nature 405(6788), 847–856 (2000)
Sillanpää, M.J., Auranen, K.: Replication in genetic studies of complex traits. Ann. Human. Genet. 68, 646–657 (2004)
D’Addabbo, A., Latiano, A., Palmieri, O., Maglietta, R., Annese, V., Ancona, N.: Regularized Least Squares Cancer Classifiers may predict Crohn’s disease from profiles of Single Nucleotide Polymorphisms. Ann. Hum. Genet. 71(4), 537–549 (2007)
Vapnik, V.: Statistical Learning Theory. John Wiley & Sons, Inc., Chichester (1998)
Subramanian, A., Tamayo, P., et al.: Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles. PNAS 102(43), 15545–15550 (2005)
Hastie, T., Tibshirani, R., Friedman, J.: The Elements of Statistical Learning - Data mining, Inference and Prediction. Springer Series in Statistics, pp. 95–99 (2001)
Mukherjee, S., Tamayo, P., Rogers, S., et al.: Estimating dataset size requirements for classifying dna microarray data. J. Comp. Biol. 10, 119–142 (2003)
Good, P.: Permutation tests: a practical guide to resampling methods for testing hypothesis. Springer, Heidelberg (1994)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Colella, R., D’Addabbo, A., Latiano, A., Palmieri, O., Annese, V., Ancona, N. (2008). Prediction of Crohn’s Disease by Profiles of Single Nucleotide Polymorphisms. In: Lovrek, I., Howlett, R.J., Jain, L.C. (eds) Knowledge-Based Intelligent Information and Engineering Systems. KES 2008. Lecture Notes in Computer Science(), vol 5179. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-85567-5_70
Download citation
DOI: https://doi.org/10.1007/978-3-540-85567-5_70
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-85566-8
Online ISBN: 978-3-540-85567-5
eBook Packages: Computer ScienceComputer Science (R0)