Abstract
We contribute a novel, ball-histogram approach to DNA-binding propensity prediction of proteins. Unlike state-of-the-art methods based on constructing an ad-hoc set of features describing the charged patches of the proteins, the ball-histogram technique enables a systematic, Monte-Carlo exploration of the spatial distribution of charged amino acids, capturing joint probabilities of specified amino acids occurring in certain distances from each other. This exploration yields a model for the prediction of DNA binding propensity. We validate our method in prediction experiments, achieving favorable accuracies. Moreover, our method also provides interpretable features involving spatial distributions of selected amino acids.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Ohlendorf, D.H., Matthew, J.B.: Electrostatics and flexibility in protein-DNA interactions. Advances in Biophysics 20, 137–151 (1985)
Stawiski, E.W., Gregoret, L.M., Mandel-Gutfreund, Y.: Annotating nucleic acid-binding function based on protein structure. J. Mol. Biol. (2003)
Jones, S., Shanahan, H.P., Berman, H.M., Thornton, J.M.: Using electrostatic potentials to predict DNA-binding sites on DNA-binding proteins. Nucleic Acid Research 31(24), 7189–7198 (2003)
Tsuchiya, Y., Kinoshita, K., Nakamura, H.: Structure-based prediction of DNA-binding sites on proteins using the empirical preference of electrostatic potential and the shape of molecular surfaces. Proteins: Structure, Function, and Bioinformatics 55(4), 885–894 (2004)
Ahmad, S., Sarai, A.: Moment-based prediction of DNA-binding proteins. Journal of Molecular Biology 341(1), 65–71 (2004)
Bhardwaj, et al.: Kernel-based machine learning protocol for predicting DNA-binding proteins. Nuc. Acids Res. (2005)
Szilágyi, A., Skolnick, J.: Efficient Prediction of Nucleic Acid Binding Function from Low-resolution Protein Structures. Journal of Molecular Biology 358, 922–933 (2006)
Moreland, J.L., Gramada, A., Buzko, O.V., Zhang, Q., Bourne, P.E.: The Molecular Biology Toolkit (MBT): A Modular Platform for Developing Molecular Visualization Applications. BMC Bioinformatics (2005)
Breiman, L.: Random Forests. Machine Learning 45, 5–32 (2001)
Caruana, R., Karampatziakis, N., Yessenalina, A.: An empirical evaluation of supervised learning in high dimensions. In: International Conference on Machine Learning (ICML), pp. 96–103 (2008)
Lavrač, N., Flach, P.: An Extended Transformation Approach to Inductive Logic Programming. ACM Transactions on Computational Logic 2, 458–494 (2001)
Pabo, C.O., Sauer, R.T.: Transcription factors: structural families and principles of DNA recognition. Annual Review of Biochemistry 20, 137–151 (1992)
Mandel-Gutfreund, Y., Schueler, O., Margalit, H.: Comprehensive analysis of hydrogen bonds in regulatory protein DNA-complexes: in search of common principles. Journal of Molecular Biology 253, 370–382 (1995)
Jones, S., van Heyningen, P., Berman, H.M., Thornton, J.M.: Protein-DNA interactions: a structural analysis. Journal of Molecular Biology 287, 877–896 (1999)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Szabóová, A., Kuželka, O., Morales E., S., Železný, F., Tolar, J. (2011). Prediction of DNA-Binding Propensity of Proteins by the Ball-Histogram Method. In: Chen, J., Wang, J., Zelikovsky, A. (eds) Bioinformatics Research and Applications. ISBRA 2011. Lecture Notes in Computer Science(), vol 6674. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-21260-4_34
Download citation
DOI: https://doi.org/10.1007/978-3-642-21260-4_34
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-21259-8
Online ISBN: 978-3-642-21260-4
eBook Packages: Computer ScienceComputer Science (R0)