Skip to main content

Analysis of SNP-Complex Disease Association by a Novel Feature Selection Method

  • Conference paper
  • First Online:
Operations Research Proceedings 2010

Part of the book series: Operations Research Proceedings ((ORP))

Abstract

Selecting a subset of SNPs (Single Nucleotide Polymorphism pronounced snip) that is informative and small enough to conduct association studies and reduce the experimental and analysis overhead has become an important step toward effective disease-gene association studies. In this study, we developed a novel methods for selecting Informative SNP subsets for greater association with complex disease by making use of methods of machine learning. We constructed an integrated system that makes use of major public databases to prioritize SNPs according to their functional effects and finds SNPs that are closely associated with genes which are proven to be associated with a particular complex disease. This helped us gain insights for understanding the complex web of SNPs and gene interactions and integrate as much as possible of the molecular level data relevant to the mechanisms that link genetic variation and disease. We tested the validity and accuracy of developed model by applying it to real life case control data set and got promising results. We hope that results of this study will support timely diagnosis, personalized treatments, and targeted drug design, through facilitating reliable identification of SNPs that are involved in the etiology of complex diseases.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. V. Bafna, B.V. Halldorsson, R. Schwartz, A.G. Clark, and S. Istrail. Haplotypes and informative SNP selection algorithms: don’t block out information. In Proc RECOMB’ 03, pages 19–27, 2006.

    Google Scholar 

  2. M. C. Byng, J. C. Whittaker, A. P. Cuthbert, C. G. Mathew, and C. M. Lewis. SNP subset selection for genetic association studies. Annals of Human Genetics, 67: 543–556, 2003.

    Article  Google Scholar 

  3. C.S. Carlson, M.A. Eberle, M.J. Rieder, Q. Yi, L. Kruglyak, and D.A. Nickerson. Selecting a maximally informative set of single-nucleotide polymorphisms for association analyses using linkage disequilibrium. Am J Hum Genet., 74: 106–120, 2004.

    Article  Google Scholar 

  4. M. J. Daly, J. D. Rioux, S. F. Schaffner, T. J. Hudson, and E. S. Lander. Highresolution haplotype structure in the human genome. Nature Genetics, 29(2): 229–232, 2001.

    Article  Google Scholar 

  5. V. Dinu, H. Zhao, and P.L. Miller. Integrating domain knowledge with statistical and data mining methods for high-density genomic SNP disease association analysis. J. of Biomedical Informatics, 40(6): 750–760, 2007.

    Article  Google Scholar 

  6. E. Halperin, G. Kimmel, and R. Shamir. Tag SNP selection in genotype data for maximizing SNP prediction accuracy. Bioinformatics, 21(1): i195–i203, 2005.

    Article  Google Scholar 

  7. B. Horne and N. J. Camp. Principal component analysis for selection of optimal SNP-sets that capture intragenic genetic variation. Genetic Epidemiology, 26(11): 11–21, 2004.

    Article  Google Scholar 

  8. P. H. Lee and H. Shatkay. BNTagger: Improved Tagging SNP Selection using Bayesian Networks. ISMB 2006 Supplement of Bioinformatics, 22(14): e211–219, 2006.

    Google Scholar 

  9. N. Patil, A. J. Berno, D. A. Hinds, W. A. Barrett, J. M. Doshi, C. R. Hacker, C. R. Kautzer, D. H. Lee, C. Marjoribanks, and D. P. McDonough. Blocks of limited haplotype diversity revealed by high resolution scanning of human chromosome 21. Science, 294: 1719–1722, 2001.

    Article  Google Scholar 

  10. S.F. Saccone, R. Bolze, P. Thomas, J. Quan, G. Mehta, E. Deelman, J.A. Tischfield JA, and J.P. Rice. SPOT: a web-based tool for using biological databases to prioritize SNPs after a genome-wide association study. Nucleic Acids Research (Epub ahead of print), 2010.

    Google Scholar 

  11. K. Zhang, M. Deng, T. Chen, M. S. Waterman, and F. Sun. A dynamic programming algorithm for haplotype block partitioning. In In Proceedings of the National Academy of Sciences (PNAS), volume 99, pages 7335–7339, 2002.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to G. Üstünkar .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Üstünkar, G., Özöğür-Akyüz, S., Weber, GW., Son, Y.A. (2011). Analysis of SNP-Complex Disease Association by a Novel Feature Selection Method. In: Hu, B., Morasch, K., Pickl, S., Siegle, M. (eds) Operations Research Proceedings 2010. Operations Research Proceedings. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-20009-0_4

Download citation

Publish with us

Policies and ethics