What Do We Learn from Network-Based Analysis of Genome-Wide Association Data?

Ayati, Marzieh; Erten, Sinan; Koyutürk, Mehmet

doi:10.1007/978-3-662-45523-4_70

Marzieh Ayati¹⁵,
Sinan Erten¹⁵ &
Mehmet Koyutürk^15,16

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 8602))

Included in the following conference series:

European Conference on the Applications of Evolutionary Computation

1777 Accesses
1 Citations

Abstract

Network based analyses are commonly used as powerful tools to interpret the findings of genome-wide association studies (GWAS) in a functional context. In particular, identification of disease-associated functional modules, i.e., highly connected protein-protein interaction (PPI) subnetworks with high aggregate disease association, are shown to be promising in uncovering the functional relationships among genes and proteins associated with diseases. An important issue in this regard is the scoring of subnetworks by integrating two quantities that are not readily compatible: disease association of individual gene products and network connectivity among proteins. Current scoring schemes either disregard the level of connectivity and focus on the aggregate disease association of connected proteins or use a linear combination of these two quantities. However, such scoring schemes may produce arbitrarily large subnetworks which are often not statistically significant, or require tuning of parameters that are used to weigh the contributions of network connectivity and disease association. Here, we propose a parameter-free scoring scheme that aims to score subnetworks by assessing the disease association of pairwise interactions and incorporating the statistical significance of network connectivity and disease association. We test the proposed scoring scheme on a GWAS dataset for type II diabetes (T2D). Our results suggest that subnetworks identified by commonly used methods may fail tests of statistical significance after correction for multiple hypothesis testing. In contrast, the proposed scoring scheme yields highly significant subnetworks, which contain biologically relevant proteins that cannot be identified by analysis of genome-wide association data alone.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Adie, E.A., Adams, R.R., et al.: Speeding disease gene discovery by sequence based candidate prioritization. BMC Bioinformatics, 6 (2005)
Google Scholar
Adie, E.A., Adams, R.R., et al.: SUSPECTS: enabling fast and effective prioritization of positional candidates. Bioinformatics, 22 (2006)
Google Scholar
Baranzini, S.E., Galwey, N.W., Wang, J., Khankhanian, P., et al.: Pathway and network-based analysis of genome-wide association studies in multiple sclerosis. Hum. Mol. Genet. 18, 2078–2090 (2009)
Article Google Scholar
Obberghen, E.V., Grunfeld, C., Baird, K., Kahn, C.R.: Glucocorticoid-induced insulin resistance in vitro: Evidence for both receptor and postreceptor defects. Endocrinology 109, 1723–1730 (1981)
Google Scholar
Clauset, A., Newman, M.E.J., Moore, C.: Finding community structure in very large networks. Phys. Rev, E 70 (2004)
Google Scholar
W. T. C. C. Consortium: Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls. Nature 447, 661–678 (2007)
Article Google Scholar
Deng, J.Y., Hsieh, P.S., Huang, J.P., et al.: Activation of estrogen receptor is crucial for resveratrol-stimulating muscular glucose uptake via both insulin-dependent and -independent pathways. Diabetes 57, 1814–1823 (2008)
Article Google Scholar
Driel, M.A., Cuelenaere, K., Kemmeren, P.P., et al.: GeneSeeker: extraction and integration of human disease-related information from web-based genetic databases. Nucleic Acids Res., 33 (2005)
Google Scholar
Gallagher, C.J., Langerfeld, C.D., Gordon, C.J., et al.: Association of the estrogen receptor-gene with the metabolic syndrome and its component traits in african-american families. Diabetes 56, 2135–2141 (2007)
Article Google Scholar
Ideker, T., Ozier, O., Schwikowski, B., Siegel, A.F.: Discovering regulatory and signalling circuits in molecular interaction networks. Bioinformatics 18, 233–240 (2002)
Article Google Scholar
Jia, P., Zheng, S., Long, J., Zheng, W., Zhao, Z.: dmGWAS: dense module searching for genome-wide association studies in protein-protein interaction networks. Bioinformatics 27, 95–102 (2011)
Article Google Scholar
Lim, J., Hong, K., Jin, H., Kim, Y., Park, H., Oh, B.: Type 2 diabetes genetic association database manually curated for the study design and odds ratio. BMC Medical Informatics and Decision Making (2010)
Google Scholar
Linderman, G.C., Chance, M.R., Bebek, Gurkan.: MicroArray Gene expression and Network Evaluation Toolkit. Nucl. Acids Res., MAGNET (2012)
Google Scholar
Lopez-Bigas, N., Ouzounis, C.A.: Genome-wide identification of genes likely to be involved in human genetic disease. Nucleic Acids Res., 32 (2004)
Google Scholar
Ma, H., Schadt, E., Kaplan, L.M., Zhao, H.: COSINE: COndition-SpecIfic sub-NEtwork identification using a global optimization method. Bioinformatics (2011)
Google Scholar
Maglott, D., Ostell, J., Pruitt, K.D., Tatusova, T.: Entrez gene: gene-centered information at NCBI. Nucl. Acids Res., 35 (2007)
Google Scholar
Moore, J.H., Asselbergs, F.W., Williams, S.M.: Bioinformatics challenges for genome-wide association studies. Bioinformatics 26(4), 445–455 (2010)
Article Google Scholar
Newman, M.E.J.: Fast algorithm for detecting community structure in networks. Phys. Rev, E 69(066133) (2004)
Google Scholar
Perez-Iratxeta, C., Wjst, M., Bork, P., Andrade, M.A.: G2D: a tool for mining genes associated with disease. BMC Genet., 6 (2005)
Google Scholar
Purcell, S., Neale, B., Todd-Brown, K., Thomas, L., et al.: PLINK: a tool set for whole-genome association and population-based linkage analyses. American Journal of Human Genetics 81, 559–575 (2007)
Article Google Scholar
Ritchie, M.D.: Using biological knowledge to uncover the mystery in the search for epistasis in genome-wide association studies. Annals of Human Genetics 75(1), 172–182 (2011)
Article Google Scholar
Scott, L.J.: A Genome-Wide Association Study of Type 2 Diabetes in Finns Detects Multiple Susceptibility Variants. Science 316(5829), 1341–1345 (2007)
Article Google Scholar
Tiffin, N., Adie, E., Turner, F., et al.: Computational disease gene identification: a concert of methods prioritizes type 2 diabetes and obesity candidate genes. Nucleic Acids Res. (2006)
Google Scholar
Tiffin, N., Kelso, J.F., et al.: Integration of text- and data-mining using ontologies successfully selects disease gene candidates. Nucleic Acids Res., 33 (2005)
Google Scholar
Turner, F.S., Clutterbuck, D.R., Semple, C.A.: POCUS: mining genomic sequence annotation to predict disease genes. Genome Biol., 4 (2003)
Google Scholar
Xia, Y., Wang, Y.: Condition specific subnetwork identification using an optimization model. In: Proceedings of The Second International Symposium on Optimization and Systems Biology, pp. 333–340 (2008)
Google Scholar
Zhang, Y., Zhao, X., Yang, F.: The mediator complex and lipid metabolism. Journal of Biochemical and Pharmacological Research 1, 51–55 (2013)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Electrical Engineering and Computer Science, Case Western Reserve University, 10900 Euclid Ave., Cleveland, OH 44106, United States
Marzieh Ayati, Sinan Erten & Mehmet Koyutürk
Center for Proteomics and Bioinformatics, Case Western Reserve University, 10900 Euclid Ave., Cleveland, OH 44106, United States
Mehmet Koyutürk

Authors

Marzieh Ayati
View author publications
You can also search for this author in PubMed Google Scholar
Sinan Erten
View author publications
You can also search for this author in PubMed Google Scholar
Mehmet Koyutürk
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Depto. Estadística e Investigación, Universidad Politécnica de Valencia, Valencia, Spain
Anna I. Esparcia-Alcázar
University of Granada, Granada, Spain
Antonio M. Mora

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ayati, M., Erten, S., Koyutürk, M. (2014). What Do We Learn from Network-Based Analysis of Genome-Wide Association Data?. In: Esparcia-Alcázar, A., Mora, A. (eds) Applications of Evolutionary Computation. EvoApplications 2014. Lecture Notes in Computer Science(), vol 8602. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-45523-4_70

Download citation

DOI: https://doi.org/10.1007/978-3-662-45523-4_70
Published: 29 November 2014
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-45522-7
Online ISBN: 978-3-662-45523-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics