Skip to main content
Log in

Similarity searching in large combinatorial chemistry spaces

  • Published:
Journal of Computer-Aided Molecular Design Aims and scope Submit manuscript

Abstract

We present a novel algorithm, called Ftrees-FS, for similarity searching in large chemistry spaces based on dynamic programming. Given a query compound, the algorithm generates sets of compounds from a given chemistry space that are similar to the query. The similarity search is based on the feature tree similarity measure representing molecules by tree structures. This descriptor allows handling combinatorial chemistry spaces as a whole instead of looking at subsets of enumerated compounds. Within few minutes of computing time, the algorithm is able to find the most similar compound in very large spaces as well as sets of compounds at an arbitrary similarity level. In addition, the diversity among the generated compounds can be controlled. A set of 17 000 fragments of known drugs, generated by the RECAP procedure from the World Drug Index, was used as the search chemistry space. These fragments can be combined to more than 1018 compounds of reasonable size. For validation, known antagonists/inhibitors of several targets including dopamine D4, histamine H1, and COX2 are used as queries. Comparison of the compounds created by Ftrees-FS to other known actives demonstrates the ability of the method to jump between structurally unrelated molecule classes.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Johnson, M.A. and Maggiora, G.M., Concepts and Applications of Molecular Similarity. Wiley, New York, 1990.

    Google Scholar 

  2. Dean, P.M., Molecular Similarity in Drug Design. Chapman & Hall, London, 1995.

    Google Scholar 

  3. Downs, G.M. and Willett, P., in Lipkowitz, K.B. and Boyd, D.B. (eds), Reviews in Computational Chemistry, Vol. 7. VCH, New York, 1996.

    Google Scholar 

  4. Good, A.C. and Mason, J.S., in Lipkowitz, K.B. and Boyd, D.B. (eds), Reviews in Computational Chemistry, Vol. 7. VCH, New York, 1996.

    Google Scholar 

  5. Willett, P., J. Chem. Inf. Comput. Sci., 38 (1998) 983.

    Google Scholar 

  6. Kubinyi, H., in Kubinyi, H., Folkers, G. and Martin, Y.C. (eds), 3D QSAR in Drug Design: Ligand Protein Interactions and Molecular Similarity, Vol. 9-11. Kluwer/ESCOM, Dordrecht, 1998.

    Google Scholar 

  7. Daylight Software Manual, Daylight Inc., Mission Viejo, California, USA.

  8. MACCS II, MDL Information Systems Inc., San Leandro, California, USA.

  9. Leach, A.R. and Hann, M.M., Drug Discovery Today, 5 (2000) 326.

    Google Scholar 

  10. Barnard, J.M., J. Chem. Inf. Comput. Sci., 37 (1997) 59.

    Google Scholar 

  11. Lewell, X.Q., Judd, D.B., Watson, S.P. and Hann, M.M., J. Chem. Inf. Comput. Sci., 38 (1998) 511.

    Google Scholar 

  12. Schneider, G., Lee, M.-L., Stahl, M. and Schneider, P., J. Comput. Aid. Mol. Des., 14 (2000) 487.

    Google Scholar 

  13. Douguet, D., Thoreau, E. and Grassy, G., J. Comput. Aid. Mol. Des., 14 (2000) 449.

    Google Scholar 

  14. Weininger, D., J. Chem. Inf. Comput. Sci., 28 (1988) 31.

    Google Scholar 

  15. Andrews, K.M. and Cramer, R.D., J. Med. Chem., 43 (2000) 1723.

    Google Scholar 

  16. Rarey, M. and Dixon, J.S., J. Comput. Aid. Mol. Des., 12 (1998) 471.

    Google Scholar 

  17. Matter, H. and Rarey, M., in Jung, G. (Ed.), Combinatorial Organic Chemistry. Wiley-VCH, New York, NY, 1999.

    Google Scholar 

  18. Stahl, M., Rarey, M. and Klebe, G., in Lengauer, T. (Ed.), Bioinformatics: From Genomes to Drugs. VCH, Weinheim, 2000.

    Google Scholar 

  19. Rarey, M., Kramer, B., Lengauer, T. and Klebe, G., J. Mol. Biol., 261 (1996) 470.

    Google Scholar 

  20. Rarey, M., Wefing, S. and Lengauer, T., J. Comput.-Aid. Mol. Des., 10 (1996) 41.

    Google Scholar 

  21. WDI:World Drug Index.

  22. Sanner, M.A., Exp. Opin. Ther. Patents, 8 (1998) 383.

    Google Scholar 

  23. Aslanian, R. and Piwinski, J.J., Exp. Opin. Ther. Patents, 7 (1997) 201.

    Google Scholar 

  24. Carter, J.S., Exp. Opin. Ther. Patents, 8 (1997) 21.

    Google Scholar 

  25. Friesen, R.W., Brideau, C., Chan, C.C., Charleson, S., Deschenes, D., Dubé, D., Ethier, D., Fortin, R., Gauthier, J.Y., Girard, Y., Gordon, R., Greig, G.M., Riendau, D., Savoie, C., Wang, Z., Wong, E., Visco, D., Xu, L.J. and Young, R.N., Bioorg. Med. Chem. Lett., 8 (1998) 2777.

    Google Scholar 

  26. Kalgutkar, A.S., Exp. Opin. Ther. Patents, 9 (1999) 831.

    Google Scholar 

  27. García-Echeverría, C., Traxler, P. and Evans, D.B., Med. Res. Rev., 20 (2000) 28.

    Google Scholar 

  28. Boehm, J.C. and Adams, J.L., Exp. Opin. Ther. Patents, 10 (2000) 25.

    Google Scholar 

  29. Chakravarty, P.K., Exp. Opin. Ther. Patents, 5 (1995) 431.

    Google Scholar 

  30. Murray, C.W., Auton, T.R. and Elridge, M.D., J. Comput. Aid. Mol. Des., 12 (1999) 503.

    Google Scholar 

  31. Wiley, M.R. and Fisher, M.J., Exp. Opin. Ther. Patents, 7 (1997) 1265.

    Google Scholar 

  32. Bernstein, F.C., Koetzle, T.E., Williams, G.J.B., Meyer, J., E. F., Brice, M.D., Rodgers, J.R., Kennard, O., Shimanouchi, T. and M., T., J. Mol. Biol., 112 (1977) 535.

    Google Scholar 

  33. Rarey, M., Kramer, B. and Lengauer, T., J. Comput. Aid. Mol. Des., 11 (1997) 369.

    Google Scholar 

  34. Rarey, M., Kramer, B. and Lengauer, T., Bioinformatics, 15 (1999) 243.

    Google Scholar 

  35. Al-Obeidi, F. and Ostrem, J.A., Exp. Opin. Ther. Patents, 9 (1999) 931.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Rarey, M., Stahl, M. Similarity searching in large combinatorial chemistry spaces. J Comput Aided Mol Des 15, 497–520 (2001). https://doi.org/10.1023/A:1011144622059

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1023/A:1011144622059

Navigation