Abstract
We present a novel approach to define molecular similarity and its application in virtual screening. The algorithm is based on molecular surface properties in combination with a geometric encoding scheme. The molecular surface is described by screening charges calculated via COSMO. COSMO, the COnductor-like Screening MOdel, is a quantum-chemical molecular description originally developed and widely validated for solubilities and partition coefficients of molecules in the liquid state. The screening charges it calculates also define properties relevant to ligand-target binding such as hydrogen-bond donors and acceptors, positive and negative charges and lipophilic moieties from first principles. Encoding of properties is performed by three-point pharmacophores which were found to outperform other approaches. The similarity measure was validated on a dataset derived from the MDL Drug Data Report (MDDR) which comprises five classes of active compounds that are 5HT3 ligands, ACE inhibitors, HMG-CoA reductase inhibitors, PAF antagonists and TXA2 inhibitors. Compared to other approaches, the method presented here compares favorably with respect to the number of active compounds retrieved, finds different active scaffolds and is based on a solid theoretical foundation. Further work will be undertaken in order to find better shape and pharmacophoric feature encoding schemes as well as to make quantitative predictions of bioactivity.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Bender, A., Glen, R.C.: Molecular similarity: a key technique in molecular informatics. Org. Biomol. Chem. 2, 3204–3218 (2004)
Johnson, M.A., Maggiora, G.M.: Concepts and Applications of Molecular Similarity. John Wiley & Sons, New York (1990)
Willett, P., Barnard, J.M., Downs, G.M.: Chemical similarity searching. J. Chem. Inf. Comput. Sci. 38, 983–996 (1998)
Byvatov, E., Schneider, G.: SVM-based feature selection for characterization of focused compound collections. J. Chem. Inf. Comput. Sci. 44, 993–999 (2004)
Wegner, J.K., Frohlich, H., Zell, A.: Feature selection for Descriptor based classification models. 1. Theory and GA-SEC algorithm. J. Chem. Inf. Comput. Sci. 44, 921–930 (2004)
Bender, A., Mussa, H.Y., Glen, R.C., Reiling, S.: Molecular similarity searching using atom environments, information-based feature selection, and a naive bayesian classifier. J. Chem. Inf. Comput. Sci. 44, 170–178 (2004)
Bender, A., Mussa, H.Y., Glen, R.C., Reiling, S.: Similarity searching of chemical databases using atom environment descriptors: evaluation of performance. J. Chem. Inf. Comput. Sci. 44, 1708–1718 (2004)
Downs, G.M., Willett, P., Fisanick, W.: Similarity Searching and Clustering of Chemical-Structure Databases Using Molecular Property Data. J. Chem. Inf. Comput. Sci. 34, 1094–1102 (1994)
Artymiuk, P.J., Bath, P.A., Grindley, H.M., Pepperrell, C.A., Poirrette, A.R., Rice, D.W., Thorner, D.A., Wild, D.J., Willett, P., Allen, F.H., et al.: Similarity searching in databases of three-dimensional molecules and macromolecules. J. Chem. Inf. Comput. Sci. 32, 617–630 (1992)
Erlanson, D.A., McDowell, R.S., O’Brien, T.: Fragment-based drug discovery. J. Med. Chem. 47, 3463–3482 (2004)
Schneider, G., Neidhart, W., Giller, T., Schmid, G.: Scaffold-hopping by topological pharmacophore search: A contribution to virtual screening. Angew. Chem.-Int. Edit. 38, 2894–2896 (1999)
Cramer, R.D., Patterson, D.E., Bunce, J.D.: Comparative Molecular-Field Analysis (Comfa). 1. Effect of Shape on Binding of Steroids to Carrier Proteins. J. Am. Chem. Soc. 110, 5959–5967 (1988)
Pastor, M., Cruciani, G., McLay, I., Pickett, S., Clementi, S.: GRid-INdependent descriptors (GRIND): a novel class of alignment-independent three-dimensional molecular descriptors. J. Med. Chem. 43, 3233–3243 (2000)
Bender, A., Mussa, H.Y., Gill, G.S., Glen, R.C.: Molecular surface point environments for virtual screening and the elucidation of binding patterns (MOLPRINT). J. Med. Chem. 47, 6569–6583 (2004)
Gund, P.: Three-dimensional pharmacophoric pattern searching. Prog. Mol. Subcell. Biol. 5, 117–143 (1977)
Mason, J.S., Good, A.C., Martin, E.J.: 3-D pharmacophores in drug discovery. Curr. Pharm. Des. 7, 567–597 (2001)
Brown, R.D., Martin, Y.C.: The information content of 2D and 3D structural descriptors relevant to ligand-receptor binding. J. Chem. Inf. Comput. Sci. 37, 1–9 (1997)
Matter, H.: Selecting optimally diverse compounds from structure databases: a validation study of two-dimensional and three-dimensional molecular descriptors. J. Med. Chem. 40, 1219–1229 (1997)
Thimm, M., Goede, A., Hougardy, S., Preissner, R.: Comparison of 2D similarity and 3D superposition. Application to searching a conformational drug database. J. Chem. Inf. Comput. Sci. 44, 1816–1822 (2004)
Goodford, P.J.: A computational procedure for determining energetically favorable binding sites on biologically important macromolecules. J. Med. Chem. 28, 849–857 (1985)
Klamt, A., Schuurmann, G.: Cosmo - a New Approach to Dielectric Screening in Solvents with Explicit Expressions for the Screening Energy and Its Gradient. J. Chem. Soc.-Perkin Trans. 2, 799–805 (1993)
Eckert, F., Klamt, A.: COSMOtherm (Version Version C1.2, Release 01.04). COSMOlogic GmbH & Co. KG., Leverkusen, Germany (2004)
Klamt, A., Eckert, F., Hornig, M.: COSMO-RS: A novel view to physiological solvation and partition questions. J. Comput.-Aided Mol. Des. 15, 355–365 (2001)
Klamt, A.: Conductor-Like Screening Model for Real Solvents - a New Approach to the Quantitative Calculation of Solvation Phenomena. J. Phys. Chem. 99, 2224–2235 (1995)
Haraki, K.S., Sheridan, R.P., Venkataraghavan, R., Dunn, D.A., McCulloch, R.: Looking fo Pharmacophores in 3D-Databases: Does Conformational Searching Improve the Yield of Actives? Tetrahedron Comput. Methodol. 3, 565–573 (1990)
Tomasi, J., Persico, M.: Molecular-Interactions in Solution - an Overview of Methods Based on Continuous Distributions of the Solvent. Chem. Rev. 94, 2027–2094 (1994)
Schafer, A., Klamt, A., Sattel, D., Lohrenz, J.C.W., Eckert, F.: COSMO Implementation in TURBOMOLE: Extension of an efficient quantum chemical code towards liquid systems. PCCP Phys. Chem. Chem. Phys. 2, 2187–2193 (2000)
Ahlrichs, R., Bar, M., Haser, M., Horn, H., Kolmel, C.: Electronic-Structure Calculations on Workstation Computers - the Program System Turbomole. Chem. Phys. Lett. 162, 165–169 (1989)
Klamt, A.: COSMO-RS, From Quantum Chemistry to Fluid Phase Thermodynamics and Drug Design. Elsevier, Amsterdam (2005)
Hornig, M., Klamt, A.: COSMOfrag: A Novel Tool for High Throughput ADME Property Prediction and Similarity Screening Based on Quantum Chemistry. J. Chem. Inf. Model (2005) (submitted)
MDL Drug Data Report; MDL ISIS/HOST software, MDL Information Systems, Inc.
Briem, H., Lessel, U.: In vitro and in silico affinity fingerprints: Finding similarities beyond structural classes. Perspect. Drug Discov. Des. 20, 231–244 (2000)
Bender, A., Mussa, H.Y., Gill, G.S., Glen, R.C.: Molecular surface point environments for virtual screening and the elucidation of binding patterns (MOLPRINT). IEEE Int. Conf. Syst. Man Cybern. 5, 4553–4558 (2004)
MOE (Molecular Operating Environment); Chemical Computing Group Inc.: Montreal, Quebec, Canada
Sadowski, J., Gasteiger, J., Klebe, G.: Comparison of Automatic 3-Dimensional Model Builders Using 639 X-Ray Structures. J. Chem. Inf. Comput. Sci. 34, 1000–1008 (1994)
Rarey, M., Dixon, J.S.: Feature trees: a new molecular similarity measure based on tree matching. J. Comput.-Aided Mol. Des. 12, 471–490 (1998)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Bender, A., Klamt, A., Wichmann, K., Thormann, M., Glen, R.C. (2005). Molecular Similarity Searching Using COSMO Screening Charges (COSMO/3PP). In: R. Berthold, M., Glen, R.C., Diederichs, K., Kohlbacher, O., Fischer, I. (eds) Computational Life Sciences. CompLife 2005. Lecture Notes in Computer Science(), vol 3695. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11560500_16
Download citation
DOI: https://doi.org/10.1007/11560500_16
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29104-6
Online ISBN: 978-3-540-31726-5
eBook Packages: Computer ScienceComputer Science (R0)