Abstract
The identification of binding sites for transcription factors regulating gene transcription is one of the most important and challenging problems in molecular biology and bioinformatics. Here we present an algorithm that, given a set of promoters from co–regulated genes, identifies over-represented binding sites by using profiles (position specific frequency matrices) defining the sequence binding specificity of known TFs as well as matching statistics on a whole–genome level, bypassing the need of defining matching thresholds and/or the use of homologous sequences. Preliminary tests performed on experimentally validated sequence sets are very promising; moreover, the same algorithm is suitable also for the use with any model of the binding specificity of TFs.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
MacIsaac, K.D., Fraenkel, E.: Practical strategies for discovering regulatory DNA sequence motifs. PLoS Comput. Biol. 2(4), e36 (2006)
Matys, V., Kel-Margoulis, O.V., Fricke, E., Liebich, I., Land, S., Barre-Dirrie, A., Reuter, I., Chekmenev, D., Krull, M., Hornischer, K., Voss, N., Stegmaier, P., Lewicki-Potapov, B., Saxel, H., Kel, A.E., Wingender, E.: TRANSFAC and its module TRANSCompel: transcriptional gene regulation in eukaryotes. Nucleic Acids Res 34, D108–110 (2006) (Database issue)
Sandelin, A., Alkema, W., Engstrom, P., Wasserman, W.W., Lenhard, B.: JASPAR: an open-access database for eukaryotic transcription factor binding profiles. Nucleic Acids Res 32, 91–94 (2004) (Database issue)
Stormo, G.D.: DNA binding sites: representation and discovery. Bioinformatics 16(1), 16–23 (2000)
Sauer, T., Shelest, E., Wingender, E.: Evaluating phylogenetic footprinting for human-rodent comparisons. Bioinformatics 22(4), 430–437 (2006)
Frith, M.C., Fu, Y., Yu, L., Chen, J.F., Hansen, U., Weng, Z.: Detection of functional DNA motifs via statistical over-representation. Nucleic Acids Res 32(4), 1372–1381 (2004)
Ho Sui, S.J., Mortimer, J.R., Arenillas, D.J., Brumm, J., Walsh, C.J., Kennedy, B.P., Wasserman, W.W.: opossum: identification of over-represented transcription factor binding sites in co-expressed genes. Nucleic Acids Res 33(10), 3154–3164 (2005)
Defrance, M., Touzet, H.: Predicting transcription factor binding sites using local over-representation and comparative genomics. BMC Bioinformatics 7, 396 (2006)
Cam, H., Balciunaite, E., Blais, A., Spektor, A., Scarpulla, R.C., Young, R., Kluger, Y., Dynlacht, B.D.: A common set of gene regulatory networks links metabolism and growth inhibition. Mol Cell 16(3), 399–411 (2004)
Elkon, R., Zeller, K.I., Linhart, C., Dang, C.V., Shamir, R., Shiloh, Y.: Silico identification of transcriptional regulators associated with c-Myc. Nucleic Acids Res 32(17), 4955–4961 (2004)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Pavesi, G., Zambelli, F. (2007). Prediction of over Represented Transcription Factor Binding Sites in Co-regulated Genes Using Whole Genome Matching Statistics. In: Masulli, F., Mitra, S., Pasi, G. (eds) Applications of Fuzzy Sets Theory. WILF 2007. Lecture Notes in Computer Science(), vol 4578. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-73400-0_83
Download citation
DOI: https://doi.org/10.1007/978-3-540-73400-0_83
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-73399-7
Online ISBN: 978-3-540-73400-0
eBook Packages: Computer ScienceComputer Science (R0)