Abstract
We describe a library of molecular fragments designed to model and predict non-bonded interactions between atoms. We apply the Bayesian approach, whereby prior knowledge and uncertainty of the mathematical model are incorporated into the estimated model and its parameters. The molecular interaction data are strengthened by narrowing the atom classification to 14 atom types, focusing on independent molecular contacts that lie within a short cutoff distance, and symmetrizing the interaction data for the molecular fragments. Furthermore, the location of atoms in contact with a molecular fragment are modeled by Gaussian mixture densities whose maximum a posteriori estimates are obtained by applying a version of the expectation-maximization algorithm that incorporates hyperparameters for the components of the Gaussian mixtures. A routine is introduced providing the hyperparameters and the initial values of the parameters of the Gaussian mixture densities. A model selection criterion, based on the concept of a `minimum message length' is used to automatically select the optimal complexity of a mixture model and the most suitable orientation of a reference frame for a fragment in a coordinate system. The type of atom interacting with a molecular fragment is predicted by values of the posterior probability function and the accuracy of these predictions is evaluated by comparing the predicted atom type with the actual atom type seen in crystal structures. The fact that an atom will simultaneously interact with several molecular fragments forming a cohesive network of interactions is exploited by introducing two strategies that combine the predictions of atom types given by multiple fragments. The accuracy of these combined predictions is compared with those based on an individual fragment. Exhaustive validation analyses and qualitative examples (e.g., the ligand-binding domain of glutamate receptors) demonstrate that these improvements lead to effective modeling and prediction of molecular interactions.
Similar content being viewed by others
References
Goodford, P.J., J. Med. Chem., 28 (1985) 849.
Wade, R.C., Clark, K.J. and Goodford, P.J., J.Med. Chem., 36 (1993) 140.
Wade, R.C. and Goodford, P.J., J.Med. Chem., 36 (1993) 148.
Kellogg, G.E., Semus, S.F. and Abraham, D.J., J. Comput.-Aided Mol. Des., 5 (1991) 545.
Danziger, D.J. and Dean, P.M., P. Roy. Soc. Lond. B Biol., 236 (1989) 101.
Danziger, D.J. and Dean, P.M., P. Roy. Soc. Lond. B Biol., 236 (1989) 115.
Laskowski, R.A., Thornton, J.M., Humblet, C. and Singh, J., J. Mol. Biol., 259 (1996) 175.
Pitt, W.R. and Goodfellow, J.M., Protein Eng., 4 (1991) 531.
Böhm, H.J., J. Comput.-Aided Mol. Des., 6 (1992) 61. 461
Böhm, H.J., J. Comput.-Aided Mol. Des., 6 (1992) 593.
Böhm, H.J., J. Comput.-Aided Mol. Des., 8 (1994) 623.
Bruno, I.J., Cole, J.C., Lommerse, J.P., Rowland, R.S., Taylor, R. and Verdonk, M.L., J. Comput.-Aided Mol. Des., 11 (1997) 525.
Verdonk, M.L., Cole, J.C. and Taylor R., J. Mol. Biol., 289 (1999) 1093.
Nissink, J.W.M., Verdonk, M.L. and Klebe, G., J. Comput.-Aided Mol. Des., 14 (2000) 787.
Verdonk, M.L., Cole, J.C., Watson, P., Gillet, V. and Willett, P., J. Mol. Biol., 307 (2001) 841.
Boer, D.R., Kroon J., Cole, J.C., Smith, B. and Verdonk, M.L., J. Mol. Biol., 312 (2001) 275.
Klebe, G., J. Mol. Biol., 237 (1994) 212.
Verkhivker, G., Appelt, K., Freer, S.T. and Villafranca, J.E., Protein Eng., 8 (1995) 677.
Mitchell, J.B.O., Laskowki, R.A., Alex, A. and Thornton, J.M., J. Comput. Chem., 20 (1999) 1165.
Mitchell, J.B.O., Laskowki, R.A., Alex, A., Forster, M.J. and Thornton, J.M., J. Comput. Chem., 20 (1999) 1177.
Muegge, I. and Martin, Y.C., J. Med. Chem., 42 (1999) 791.
Gohlke, H., Hendlich, M. and Klebe, G., J. Mol. Biol., 295 (2000) 337.
Hendlich, M., Acta Crystallogr. D, 54 (1998) 1178.
Berman, H.M., Westbrook, J., Feng, Z., Gilliland, G., Bhat, T.N., Weissig, H., Shindyalov, I.N. and Bourne, P.E., Nucleic Acids Res., 28 (2000) 235.
Rantanen, V.-V., Denessiouk, K.A., Gyllenberg, M., Koski, T. and Johnson, M.S., J. Mol. Biol., 313 (2001) 197.
Bernardo, J.M. and Smith, A.F.M., Bayesian Theory, John Wiley and Sons, Chichester, UK, 1994.
McLachlan, G.J. and Krishnan, T., The EM Algorithm and Extensions, John Wiley and Sons, New York, 1997.
McLachlan, G.J. and Peel, T., Finite Mixture Models, John Wiley and Sons, New York, 2000.
Durbin, R., Eddy, S.R., Krogh, A. and Mitchison, G.J., Biological Sequence Analysis: Probabilistic Models for Proteins and Nucleic Acids, Cambridge University Press, Cambridge, 1998.
Lanterman, A.D., Int. Stat. Rev., 69 (2001) 185.
Li, A.-J. and Nussinov, R., Proteins, 32 (1998) 111.
Rantanen, V.-V., Gyllenberg, M., Koski, T. and Johnson, M.S., Bioinformatics, 18 (2002) 1257.
Bondi, A., J. Phys. Chem., 68 (1964) 441.
Böhning, D., Schlattman, P. and Lindsay, B.G., Biometrics, 48 (1992) 283.
Ewens, W.J. and Grant, G.R., Statistical Methods of Bioinformatics, Springer Verlag, New York, 2001.
Geiger, D. and Heckerman, D., Ann. Stat., 25 (1997) 1344.
Gyllenberg, M. and Koski, T., Math. Biosci., 177&178 (2002) 161.
Geiger, D. and Heckerman, D., Ann. Stat., 30 (2002) 1412.
Gauvain, J.-L. and Lee, C.-H., IEEE T. Speech Audi. P., 2 (1994) 291.
Hastie, T. and Tibshirani, R., J. Roy. Stat. Soc. B Met., 58 (1996) 158.
Rissanen, J., IEEE T. Inform. Theory, 42 (1996) 40.
Rissanen, J., J. Comput. Syst. Sci., 55 (1997) 89.
Figueiredo, M. and Jain, A.K., IEEE T. Pattern Anal., 24 (2002) 381.
Wallace, C.S. and Freeman, P.R., J. Roy. Stat. Soc. B Met., 49 (1987) 241.
Wallace, C.S. and Freeman, P.R., J. Roy. Stat. Soc. B Met., 54 (1992) 195.
Samudrala, R. and Moult, J., J. Mol. Biol., 275 (1998) 895.
Chou, P.Y. and Fasman, G.D., Biochemistry, 13 (1974) 211.
Kittler, J., Hatef, M., Duin, R.P.W. and Matas, J., IEEE T. Pattern Anal., 20 (1998) 226.
Tax, D.M.J., van Breukelen, M., Duin, R.P.W. and Kittler, J., Pattern Recogn., 33 (2000) 1475.
Kuusinen, A., Arvola, M. and Keinänen, K., EMBO J., 14 (1995) 6327.
Armstrong, N., Sun, Y., Chen, G.Q. and Gouaux, E., Nature, 395 (1998) 913.
Armstrong, N. and Gouaux, E., Neuron, 28 (2000) 165.
Kraulis, P.J., J. Appl. Crystallogr., 24 (1991) 946.
Merritt, E.A. and Bacon, D.J., Methods Enzymol., 277 (1997) 505.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Rantanen, VV., Gyllenberg, M., Koski, T. et al. A Bayesian molecular interaction library. J Comput Aided Mol Des 17, 435–461 (2003). https://doi.org/10.1023/A:1027371810547
Issue Date:
DOI: https://doi.org/10.1023/A:1027371810547