Abstract
Proteins able to interact with ribonucleic acids (RNA) are involved in many cellular processes. A detailed knowledge about the binding pairs is necessary to construct computational models which can avoid time consuming biological experiments. This paper addresses the creation of a model based on support vector machines and trained on experimentally validated data. The goal is the identification of RNA molecules binding specifically to a regulatory protein, called CELF1.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
AAAI Press: Fitting a mixture model by expectation maximization to discover motifs in biopolymers. AAAI Press (1994)
Auweter, S., Oberstrass, F., Allain, F.: Sequence-specific binding of single-stranded rna: is there a code for recognition? Nucleic Acid Research 34(17), 4943–4959 (2006)
Chang, C.C., Lin, C.J.: LIBSVM: a library for support vector machines (2001), http://www.csie.ntu.edu.tw/~cjlin/libsvm
Chawla, N.V., Bowyer, K.W., Hall, L.O., Kegelmeyer, W.P.: Smote: Synthetic minority over-sampling technique. Journal of Artificial Intelligence Research 16, 341–378 (2002)
Cheng, C.W., Chia-Yu, S., Hwang, J., Sung, T., Hsu, W.: Predicting rna-binding sites of proteins using support vector machines and evolutionary information. BMC Bioinformatics 9 (2008)
Dreyfuss, G., Kim, V.N., Kataoka, N.: Messenger-rna-binding proteins and the messages they carry. Nature Reviews Molecular Cell Biology 3, 195–205 (2002)
Green, E., Brenner, S., Regents, U.: motifbs. a program to generate dna or rna position-specific scoring matrices and to search databases of sequences with these matrices (2003), http://compbio.berkeley.edu/people/ed/motifBS.html
Gupta, A., Gribskov, M.: The role of rna sequence and structure in rna–protein interactions. Journal of Molecular Biology 409(4), 574–587 (2011)
Hafner, M., Landthaler, M., Burger, L., Khorshid, M., Hausser, J., Berninger, P., Rothballer, A., Ascano, M.J., Jungkamp, A.C., Munschauer, M., Ulrich, A., Wardle, G.S., Dewell, S., Zavolan, M., Tuschl, T.: Transcriptome-wide identification of rna-binding protein and microrna target sites by par-clip. Cell 141(1), 129–141 (2010)
Hebsgaard, S.M., Korning, P.G., Tolstrup, N., Engelbrecht, J., Rouze, P., Brunak, S.: Splice site prediction in arabidopsis thaliana pre-mrna by combining local and global sequence information. Nucleic Acid Research 24(17), 3439–3452 (1996)
Jeong, E., Chung, I.F., Miyano, S.: A neural network method for identification of rna-interacting residues in protein. Genome Informatics 15(1), 105–116 (2004)
Jones, S., Daley, D.T., Luscombe, N.M., Berman, H.M.: Protein-rna interactions: a structural analysis. Nucleic Acid Research 29(4), 943–954 (2001)
Klug, S.J., Famulok, M.: All you wanted to know about selex. Molecular Biology Reports 20(2), 97–107 (1994)
Liu, Z.P., Wu, L.Y., Wang, Y., Zhang, X.S., Chen, L.: Prediction of protein–rna binding sites by a random forest method with combined features. Bioinformatics 26(13), 1616–1622 (2010)
Maetschke, S., Yuan, Z.: Exploiting structural and topological information to improve prediction of rna-protein binding sites. BMC Bioinformatics 10(341) (2009)
Marquis, J., Paillard, L., Audic, Y., Cosson, B., Danos, O., Bec, C.L., Osborne, H.B.: Cug-bp1/celf1 requires ugu-rich sequences for high-affinity binding. Biochemical Journal 400(2), 291–301 (2006)
Mersch, B., Gepperth, A., Suhai, S., Hotz-Wagenblatt, A.: Automatic detection of exonic splicing enhancers (eses) using svms. BMC Bioinformatics 9(1), 369 (2008)
Segata, N.: Falkm-lib v1.0: a library for fast local kernel machines. Tech. rep., DISI, University of Trento, Italy (2009), Software available at http://disi.unitn.it/~segata/FaLKM-lib
Terribilini, M., Lee, J., Yan, C., Jernigan, R.L., Honavar, V., Dobbs, D.: Prediction of rna binding sites in proteins from amino acid sequences. RNA (12), 1450–1462 (2006)
Le Tonquèze, O., Gschloessl, B., Namanda-Vanderbeken, A., Legagneux, V., Paillard, L., Audic, Y.: Chromosome wide analysis of cugbp1 binding sites identifies the tetraspanin cd9 mrna as a target for cugbp1-mediated down-regulation. Biochemical and Biophysical Research Communications 394(4), 884–889 (2010)
Vapnik, V.N.: The Nature of Statistical Learning Theory. Springer, Heidelberg (1995)
Wang, L., Brown, J.: Bindn: a web-based tool for efficient prediction of dna and rna binding sites in amino acid sequences. Nucleic Acid Research 34, 243–248 (2006)
Zien, A., Raetsch, G., Mika, S., Schoelkopf, B., Lengauer, T., Mueller, K.R.: Engineering support vector machine kernels that recognize translation initiation sites. Bioinformatics (2000)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Livi, C.M., Paillard, L., Blanzieri, E., Audic, Y. (2012). Identification of Regulatory Binding Sites on mRNA Using in Vivo Derived Informations and SVMs. In: Rocha, M., Luscombe, N., Fdez-Riverola, F., RodrÃguez, J. (eds) 6th International Conference on Practical Applications of Computational Biology & Bioinformatics. Advances in Intelligent and Soft Computing, vol 154. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-28839-5_4
Download citation
DOI: https://doi.org/10.1007/978-3-642-28839-5_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-28838-8
Online ISBN: 978-3-642-28839-5
eBook Packages: EngineeringEngineering (R0)