Abstract
FTrees (FT) is a known chemoinformatic tool able to condense molecular descriptions into a graph object and to search for actives in large databases using graph similarity. The query graph is classically derived from a known active molecule, or a set of actives, for which a similar compound has to be found. Recently, FT similarity has been extended to fragment space, widening its capabilities. If a user were able to build a knowledge-based FT query from information other than a known active structure, the similarity search could be combined with other, normally separate, fields like de-novo design or pharmacophore searches. With this aim in mind, we performed a comprehensive analysis of several databases in terms of FT description and provide a basic statistical analysis of the FT spaces so far at hand. Vendors’ catalogue collections and MDDR as a source of potential or known “actives”, respectively, have been used. With the results reported herein, a set of ranges, mean values and standard deviations for several query parameters are presented in order to set a reference guide for the users. Applications on how to use this information in FT query building are also provided, using a newly built 3D-pharmacophore from 57 5HT-1F agonists and a published one which was used for virtual screening for tRNA-guanine transglycosylase (TGT) inhibitors.


Similar content being viewed by others
References
Willett P (2006) Drug Discov Today 11:1046
Stiefl N, Zaliani A (2006) J Chem Inf Model 46:87
Stiefl N, Watson IA, Baumann K, Zaliani A (2006) J Chem Inf Model 46:208
Fechner U, Paetz J, Schneider G (2005) QSAR Comb Sci 24(8):961
Gillet VJ, Willett P, Bradshaw J (2003) J Chem Inf Comput Sci 43:338
Bajorath J (2002) Nat Rev Drug Discov 1:882
Bajorath J (2002) Drug Discov Today 7:1035
Tanimoto TT (1957) IBM Internal Report
Rarey M, Stahl M (2001) J Comput Aided Mol Des 15(6):497–520
Rarey M, Dixon JS (1998) J Comput Aided Mol Des 12:471
Bender A, Mussa HY, Gill GS, Glen RC (2004) J Med Chem 47:6569
Evers A, Hessler G, Matter H, Klabunde T (2005) J Med Chem 48:5448
Bissantz C, Schalon C, Guba W, Stahl M (2005) Proteins 61:938
Schneider G, Schneider P, Renner S (2006) QSAR Comb Sci 25:1162
Renner S, Schneider G (2006) ChemMedChem 1:181
Klebe G (2006) Drug Discov Today 11:580
Hartshorn MJ, Murray CA, Cleasby A, Frederickson M, Tickle IJ, Jhoti H (2005) J Med Chem 48:403
Hajduk PJ, Huth JR, Fesik SW (2005) J Med Chem 48:2518
Hessler G, Zimmermann M, Matter H, Evers A, Naumann T, Lengauer T, Rarey M (2005) J Med Chem 48:6575
The MDL Drug Data Report is available from MDL Information Systems Inc., San Leandro CA
Irwin JJ, Shoichet BK (2005) J Chem Inf Model 45:177
http://blaster.docking.org/zinc/subset1 (accessed Dec 2007)
Gasteiger J, Rudolph C, Sadowski J (1992) Tetrahedron Comput Method 3:537
http://www.chemcomp.com (accessed Dec 2007)
FTrees 1.5.1, http://www.biosolveit.de/FTrees (accessed Dec 2007) available from BioSolveIT, St. Augustin, Germany
http://www.python.org (accessed Dec 2007)
http://www.scipy.org (accessed Dec 2007)
Filla SA, Mathes BM, Johnson KW, Phebus LA, Cohen ML, Nelson DL, Zgombick JM, Erickson JA, Schenck KW, Wainscott DB, Branchek TA, Schaus JM (2003) J Med Chem 46:3060
Dixon SL, Smondyrev AM, Rao SN (2006) Chem Biol Drug Des 67:370
Dixon SL, Smondyrev AM, Knoll EH, Rao SN, Shaw DE, Friesner RA (2006) J Comput Aided Mol Des 20:647
http://www.schroedinger.com (accessed Dec 2007)
http://www.biosolveit.de/FTrees/download/ftrees_ug.pdf (accessed Dec 2007)
Brenk R, Naerum L, Gradler U, Gerber HD, Garcia GA, Reuter K, Stubbs MT, Klebe G (2003) J Med Chem 46:1133
Vieth M, Siegel MG, Higgs RE, Watson IA, Robertson DH, Savin KA, Durst GL, Hipskind PA (2004) J Med Chem 47:224
Teague SJ, Davis AM, Leeson PD, Oprea T (1999) Angew Chem Int Ed 38:3743
Laurie AT, Jackson RM (2005) Bioinformatics 21:1908
Lipinski CA, Lombardo F, Dominy BW, Feeney PJ (2001) Adv Drug Deliv Rev 46:3
Carr RA, Congreve M, Murray CW, Rees DC (2005) Drug Discov Today 10:987
Rarey M, Stahl M (2001) J Comput Aided Mol Des 15:497
Acknowledgements
We are in debt to Joerg Degen and Prof. Matthias Rarey (ZBH, Hamburg) for their suggestions and to the reviewers for their constructing criticism.
Author information
Authors and Affiliations
Corresponding author
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Gerlach, C., Broughton, H. & Zaliani, A. FTree query construction for virtual screening: a statistical analysis. J Comput Aided Mol Des 22, 111–118 (2008). https://doi.org/10.1007/s10822-008-9178-7
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10822-008-9178-7