Summary
The contribution that the Chemical Abstracts structural database (CAST-3D) and the Maybridge database (MAY) would make to diversifying the structural information and property space spanned by our corporate database (CBI) is assessed. A subset of the CAST-3D database has been selected to augment the structural diversity of various electronic databases used in computer-assisted drug design projects. The analysis of the MAY database directly offers the potential to expand the CBI compound library, but also provides a source for structural diversity in a format suitable for computer-assisted database searching and molecular design. The analysis performed is twofold. First, a nonhierarchical clustering technique available in the Daylight clustering package is applied to evaluate the structural differences between databases. The comparison is then extended to analyze various structure-derived property spaces calculated from molecular descriptors such as the logarithm of the octanol-water partition coefficient (CLOGP), the molar refractivity (CMR) and the electronic dipole moment (CDM). The diversity contribution of each database to these property spaces is quantified in relation to our corporate database.
Similar content being viewed by others
References
CAST-3D database, Chemical Abstracts Services, Columbus, OH.
MayBridge 94 Database, Daylight Chemical Information Systems, Irvine, CA, 1994.
Barnard, J.M. and Downs, G.M., J. Chem. Inf. Comput. Sci., 32 (1992) 644.
Brown, R.D., Bures, M.G. and Martin, Y.C., manuscript in preparation.
Jarvis, R.A. and Patrick, E.A., IEEE Trans. Comput., C 22 (1973) 1025.
Weininger, D., clustering package, v. 4.3, Daylight Chemical Information Systems Inc., Irvine, CA, 1993.
Willett, P., Winterman, V. and Bawden, D., J. Chem. Inf. Comput. Sci., 26 (1986) 109.
Downs, G.M., Willett, P. and Fisanick, W., J. Chem. Inf. Comput, Sci., 34 (1994) 1094.
Dubes, R. and Jain, A.K., Pattern Recogn., 11 (1979) 235.
Leo, A., CLOGP and CMR, v. 4.54E, BioByte Corporation, Claremont, CA, 1994.
Tripos Associates Inc., St. Louis, MO, 1994.
Hall, L.H., Molconn-X, v. 2, Hall Associates Consulting, Quincy, MA, 1993.
Martin, E.J., Blaney, J.M., Siani, M.A., Spellmeyer, D.C., Wong, A.K. and Moos, W.H., J. Med. Chem., 38 (1995) 1431.
Pearlman, R.S., Balducci, R., Rusinko, A., Skell, J.M. and Smith, K.M., CONCORD, Tripos Associates, St. Louis, MO, 1994.
Weininger, D., J. Chem. Inf. Comput. Sci., 28 (1988) 31.
Molecular Design Inc., San Leandro, CA.
James, C.A. and Weininger, D., Daylight Theory Manual, Daylight Chemical Information Systems Inc., Irvine, CA, 1995, pp. 43–50.
Willett, P., Winterman, V. and Bawden, D., J. Chem. Inf. Comput. Sci., 26 (1986) 36.
Weininger, D. and Delany, J., clustering package, v. 4.4, Daylight Chemical Information Systems Inc., Irvine, CA, 1995.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Shemetulskis, N.E., Dunbar, J.B., Dunbar, B.W. et al. Enhancing the diversity of a corporate database using chemical database clustering and analysis. J Computer-Aided Mol Des 9, 407–416 (1995). https://doi.org/10.1007/BF00123998
Received:
Accepted:
Issue Date:
DOI: https://doi.org/10.1007/BF00123998