Abstract
This work addresses a central question in medical genetics – the distinction between disease-causing SNPs and neutral variations. Unlike previous studies that focused mainly on coding SNPs, our efforts were centered around variations in regulatory regions and specifically within transcription factor (TF) binding sites. We have compiled a comprehensive collection of genome wide TF binding sites and developed computational measures to estimate the effects of binding site variations on the expression profiles of the regulated genes. Applying these measures to binding sites of known TFs, we were able to make predictions that were in line with published experimental evidence and with structural data on DNA-protein interactions. We attempted to generalize the properties of expression-altering substitutions by accumulating statistics from many substitutions across multiple binding sites. We found that in the yeast genome substitutions that abolish a G or a C are on average more severe than substitutions that abolish an A or a T. This may be attributed to the low GC content of the yeast genome, in which G and C may be important for conferring specificity. We found additional factors that are correlated with the severity of a substitution. Such factors can be integrated in order to create a set of rules for the prioritization of regulatory SNPs according to their disease-causing potential.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Ng, P.C., Henikoff, S.: Predicting deleterious amino acid substitutions. Genome Res. 11, 863–874 (2001)
Ng, P.C., Henikoff, S.: Accounting for human polymorphisms predicted to affect protein function. Genome Res. 12, 436–446 (2002)
Ng, P.C., Henikoff, S.: SIFT: predicting amino acid changes that affect protein function. Nucleic Acids Res. 31, 3812–3814 (2003)
Wang, Z., Moult, J.: SNPs, protein structure, and disease. Hum. Mutat. 17, 263–270 (2001)
Sunyaev, S., Ramensky, V., Bork, P.: Towards a structural basis of human non-synonymous single nucleotide polymorphisms. Trends Genet. 16, 198–200 (2000)
Sunyaev, S., Ramensky, V., Koch, I., Lathe 3rd, W., Kondrashov, A.S., Bork, P.: Prediction of deleterious human alleles. Hum. Mol. Genet. 10, 591–597 (2001)
Vitkup, D., Sander, C., Church, G.M.: The amino-acid mutational spectrum of human genetic disease. Genome Biol. 4, R72 (2003)
Prokunina, L., Castillejo-Lopez, C., Oberg, F., Gunnarsson, I., Berg, L., Magnusson, V., Brookes, A.J., Tentler, D., Kristjansdottir, H., Grondal, G., Bolstad, A.I., Svenungsson, E., Lundberg, I., Sturfelt, G., Jonssen, A., Truedsson, L., Lima, G., Alcocer-Varela, J., Jonsson, R., Gyllensten, U.B., Harley, J.B., Alarcon-Segovia, D., Steinsson, K., Alarcon-Riquelme, M.E.: A regulatory polymorphism in PDCD1 is associated with susceptibility to systemic lupus erythematosus in humans. Nat. Genet. 32, 666–669 (2002)
Zwarts, K.Y., Clee, S.M., Zwinderman, A.H., Engert, J.C., Singaraja, R., Loubser, O., James, E., Roomp, K., Hudson, T.J., Jukema, J.W., Kastelein, J.J., Hayden, M.R.: ABCA1 regulatory variants influence coronary artery disease independent of effects on plasma lipid levels. Clin. Genet. 61, 115–125 (2002)
Rockman, M.V., Wray, G.A.: Abundant raw material for cis-regulatory evolution in humans. Mol. Biol. E 19, 1991–2004 (2002)
Shalgi, R., Lapidot, M., Shamir, R., Pilpel, Y.: A catalog of stability-associated sequence elements in 3’ UTRs of yeast mRNAs. Genome Biol. 6, R86 (2005)
Harbison, C.T., Gordon, D.B., Lee, T.I., Rinaldi, N.J., Macisaac, K.D., Danford, T.W., Hannett, N.M., Tagne, J.B., Reynolds, D.B., Yoo, J., Jennings, E.G., Zeitlinger, J., Pokholok, D.K., Kellis, M., Rolfe, P.A., Takusagawa, K.T., Lander, E.S., Gifford, D.K., Fraenkel, E., Young, R.A.: Transcriptional regulatory code of a eukaryotic genome. Nature 431, 99–104 (2004)
Pilpel, Y., Sudarsanam, P., Church, G.M.: Identifying regulatory networks by combinatorial analysis of promoter elements. Nat. Genet. 29, 153–159 (2001)
Sudarsanam, P., Pilpel, Y., Church, G.M.: Genome-wide Co-occurrence of Promoter Elements Reveals a cis-Regulatory Cassette of rRNA Transcription Motifs in Saccharomyces cerevisiae. Genome Res. 12, 1723–1731 (2002)
Lapidot, M., Pilpel, Y.: Comprehensive quantitative analyses of the effects of promoter sequence elements on mRNA transcription. Nucleic Acids Res. 31, 3824–3828 (2003)
Benjamini, Y., Hochberg, Y.: Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. J. Roy. Stat. Soc. 57, 289–300 (1995)
Lamoureux, J.S., Stuart, D., Tsang, R., Wu, C., Glover, J.N.: Structure of the sporulation-specific transcription factor Ndt80 bound to DNA. Embo. J. 21, 5721–5732 (2002)
Pierce, M., Benjamin, K.R., Montano, S.P., Georgiadis, M.M., Winter, E., Vershon, A.K.: Sum1 and Ndt80 proteins compete for binding to middle sporulation element sequences that control meiotic gene expression. Mol. Cell Biol. 23, 4814–4825 (2003)
Elkon, R., Linhart, C., Sharan, R., Shamir, R., Shiloh, Y.: Genome-wide in silico identification of transcriptional regulators controlling the cell cycle in human cells. Genome Res. 13, 773–780 (2003)
ExpressDB: http://salt2.med.harvard.edu/ExpressDB/
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer Berlin Heidelberg
About this paper
Cite this paper
Lapidot, M., Pilpel, Y. (2007). Characterization of the Effects of TF Binding Site Variations on Gene Expression Towards Predicting the Functional Outcomes of Regulatory SNPs. In: Eskin, E., Ideker, T., Raphael, B., Workman, C. (eds) Systems Biology and Regulatory Genomics. RSB RRG 2005 2005. Lecture Notes in Computer Science(), vol 4023. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-48540-7_5
Download citation
DOI: https://doi.org/10.1007/978-3-540-48540-7_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-48293-2
Online ISBN: 978-3-540-48540-7
eBook Packages: Computer ScienceComputer Science (R0)