Abstract
The bacterial reverse mutation assay (Ames test) is a biological assay used to assess the mutagenic potential of chemical compounds. In this paper approaches for the development of an in silico mutagenicity screening tool are described. Three individual in silico models, which cover both structure activity relationship methods (SARs) and quantitative structure activity relationship methods (QSARs), were built using three different modelling techniques: (1) an in-house alert model: which uses SAR approach where alerts are generated based on experts judgements; (2) a kNN approach (k-Nearest Neighbours), which is a QSAR model where a prediction is given based on outcomes of its k chemical neighbours; (3) a naive Bayesian model (NB), which is another QSAR model, where a prediction is derived using a Bayesian formula through preselected identified informative chemical features (e.g., physico-chemical, structural descriptors). These in silico models, were compared against two well-known alert models (DEREK and ToxTree) and also against three different consensus approaches (Categorical Bayesian Integration Approach (CBI), Partial Least Squares Discriminate Analysis (PLS-DA) and simple majority vote approach). By applying these integration methods on the validation sets it was shown that both integration models (PLS-DA and CBI) achieved better performance than any of the individual models or consensus obtained by simple majority rule. In conclusion, the recommendation of this paper is that when obtaining consensus predictions for Ames mutagenicity, approaches like PLS-DA or CBI should be the first choice for the integration as compared to a simple majority vote approach.
Similar content being viewed by others
References
Ames BN, Lee FD, Durston WE (1973) An improved bacterial test system for the detection and classification of mutagens and carcinogens. PNAS 70:782–786
Mortelmans K, Zeiger E (2000) The Ames Salmonella/microsome mutagenicity assay. Mutat Res 455:29–60
Yan Z, Caldwell G (eds) (2004) Improvement of the Ames test using human liver S9 preparation. Optimization in drug discovery: in vitro methods. Methods in pharmacology and toxicology. Humana Press
Ames B, McCann J, Yamasaki E (1975) Methods for detecting carcinogens and mutagens with the Salmonella/mammalian-microsome mutagenicity test. Mutat Res 31:347–360
Ames BN (1984) The detection of environmental mutagens and potential carcinogens. Cancer 53:2030–2040
Miller JA, Miller EC (1977) Ultimate chemical carcinogen as reactive mutagenic electrophiles. In: Hiatt HH, Watson JD, Winsten JA (eds) Origin of human cancers. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, pp 605–627
Ashby J, Tennant RW (1988) Chemical-structure, salmonella mutagenicity and extent of carcinogenicity as indicators of genotoxic carcinogenesis among 222 chemicals tested in rodents by the United-States NCI/NTP. Mutat Res 204:17–115
Ashby J, Tennant RW (1991) Definitive relationships among chemical structure, carcinogenicity and mutagenicity for 301 chemicals tested by the US NTP. Mutat Res 257:229–306
Ashby J (1985) Fundamental structural alerts to potential carcinogenicity or non-carcinogenicity. Environ Mutagen 7:919–921
Cariello NF, Wilson JD, Britt BH, Wedd DJ, Burlinson B, Gombar V (2002) Comparison of the computer programs DEREK and TOPKAT to predict bacterial mutagenicity. Mutagenesis 17:321–329
Benfenati E, Benigni R, Demarini DM, Helma C, Kirkland D, Martin TM, Mazzatorta G, Ouedraogo-Arras G, Richard AM, Schilter B, Schoonen WGEJ, Snyder RD, Yang C (2009) Predictive models for carcinogenicity and mutagenicity: frameworks, state-of-the-art, and perspectives. J Environ Sci Health C 27:57–90
Brinn M, Walsh P, Payne M, Bott B (1992) Neural network classification of mutagens using structural fragment data. SAR QSAR Environ Res 1:169–211
Basak SC, Mills D, Gute BD, Hawkins DM (2003) Predicting mutagenicity of congeneric and diverse sets of chemicals using computed molecular descriptors: a hierarchical approach. In: Benigni R (ed) Quantitative structure-activity relationship (QSAR) models of chemical mutagens and carcinogens. CRC Press, Boca Raton, pp 207–234
Klopman G, Chakravarti SK, Harris H, Ivanov J, Saiakhov RD (2003) In silico screening of high production volume chemicals for mutagenicity using the MCASE QSAR expert system. SAR QSAR Environ Res 14(2):165–180
Klopman G, Chakravarti SK, Harris H, Ivanov J, Saiakhov RD (2003) In silico screening of high production volume chemicals for mutagenicity using the MCASE QSAR expert system. SAR QSAR Environ Res 14:165–180
Serafimova R, Todorov M, Pavlov T, Kotov S, Jacob E, Aptula A, Mekenyan O (2007) Identification of the structural requirements for mutagenicity, by incorporating molecular flexibility and metabolic activation of chemicals. II. General Ames mutagenicity model. Chem Res Toxicol 20:662–676
Carlsson L, Helgee EA, Boyer S (2009) Interpretation of nonlinear QSAR models applied to Ames mutagenicity data. J Chem Inform Model 49:2551–2558
Didziapetris R, Lanevskij K, Japertas P (2008) Trainable QSAR model of Ames genotoxicity. Abstracts of papers, 236th ACS national meeting, Philadelphia, United States, August 17–21, TOXI-088
Langham JJ, Jain AN (2008) Accurate and interpretable computational modelling of chemical mutagenicity. J Chem Inform Model 48:1833–1839
Maran U, Sild S (2003) QSAR modeling of genotoxicity on non-congeneric sets of organic compounds. Artific Intell Rev 20:13–38
Helma C, Cramer T, Kramer S, De Raedt L (2004) Data mining and machine learning techniques for the identification of mutagenicity inducing substructures and structure activity relationships of noncongeneric compounds. J Chem Inf Comp Sci 44:1402–1411
Contrera JF, Matthews EJ, Kruhlak NL, Benz RD (2005) In silico screening of chemicals for bacterial mutagenicity using electropological E-state indices and MDL QSAR software. Regulat Pharmacol Toxicol 43:313–323
Mekenyan O, Dimitrov S, Serafimova R, Thompson ED, Kotov S, Dimitrova N, Walker JD (2004) Identification of the structural requirements for mutagenicity by incorporating molecular flexibility and metabolic activation of chemicals I: TA100 model. Chem Res Toxicol 17:753–766
Sanderson DM, Earnshaw CG (1991) Computer prediction of possible toxic action from chemical structure; the DEREK system. Hum Exp Toxicol 10:261–273
Greene N, Judson PN, Langowski JJ, Marchant CA (1999) Knowledge-based expert systems for toxicity and metabolism prediction: DEREK, StAR and METEOR. SAR QSAR Environ Res 10:299–314
Judson PN (2006) Using computer reasoning about qualitative and quantitative information to predict metabolism and toxicity. In: Testa B, Kramer SD, Wunderli-Allespach H, Volkers G (eds) Pharmacokinetic profiling in drug research: biological, physicochemical, and computational strategies. Wiley, New York, pp 183–215
Ridings JE, Barratt MD, Cary R, Earnshaw CG, Eggington CE, Ellis MK, Judson PN, Langowski JJ, Marchant CA, Payne MP, Watson WP, Yih TD (1996) Computer prediction of possible toxic action from chemical structure: an update on the DEREK system. Toxicology 106:267–279
Benigni R, Bossa C, Jeliazkova N, Netzeva T, Worth A (2008) The Benigni/Bossa rulebase for mutagenicity and carcinogenicity: a module of ToxTree. Assessed on 25th Aug 11 http://toxtree.sourceforge.net/carc.html
Serafimova R, Gatnik MF, Worth A (2010) Review of QSAR models and software tools for predicting genotoxicity and carcinogenicity, JRC Scientific and technical report (2010) Assessed on 5th Aug 10. http://ecb.jrc.ec.europa.eu/DOCUMENTS/QSAR/EUR_24427_EN.pdf
Kazius J, McGuire R, Bursi R (2005) Derivation and validation of toxicophores for mutagenicity prediction. J Med Chem 48:312–320
FDA CDER FOI site: http://www.accessdata.fda.gov/scripts/cder/drugsatfda. Accessed June 5, 2010
CCRIS: http://toxnet.nlm.nih.gov/cgi-bin/sis/htmlgen?CCRIS Accessed June 5, 2010
Tennant RW (1991) The genetic toxicity database of the National Toxicology Program: evaluation of the relationships between genetic toxicity and carcinogenicity. Environ Health Perspect 96:47–51
Tokyo-Eiken (2007) Tokyo Metropolitan Institute of Public Health, Mutagenicity of food additives. http://www.tokyoeiken.go.jp/henigen/index.htm. Accessed June 5, 2010
Bercu JP, Stuart MM, Deahl JT, Gombar VK, Callis CM, van Lier R (2010) In silico approaches to predicting cancer potency for risk assessment of genotoxic impurities in drug substances. Regul Toxicol Pharmacol 57:300–306
Cohen J (1968) Weighed kappa: nominal scale agreement with provision for scaled disagreement or partial credit. Psychol Bull 70:213–220
Strijbos J, Martens R, Prins F, Jochems W (2006) Content analysis: what are they talking about? Comput Educ 46:29–48
Matthews BW (1975) Comparison of the predicted and observed secondary structure of T4 phage lysozyme. Biochim Biophys Acta 405:442–451
Enoch SJ, Cronin MTD (2010) A review of the electrophilic reaction chemistry involved in covalent DNA binding. Crit Rev Toxicol 40:728–748
Enoch SJ, Madden JC, Cronin MTD (2008) Identification of mechanism of toxic action for skin sensitisation using a SMARTS pattern based approach. SAR QSAR Environ Res 19:555–578
Hillebrecht A, Muster W, Brigo A, Kansy M, Weiser T, Singer T (2011) Comparative evaluation of in silico systems for Ames test mutagenicity prediction: scope and limitations. Chem Res Toxicol 24:843–854
Snyder RD, Smith MD (2005) Computational prediction of genotoxicity: room for improvement. Drug Discov Today Biosilico 10:1119–1124
Marchant CA, Briggs KA, Long A (2008) In silico tools for sharing data and knowledge on toxicity and metabolism: Derek for windows Meteor, and Vitic. Toxicol Mech Methods 18:177–187
White AC, Mueller RA, Gallavan R, Aaron S, Wilson AGE (2003) A multiple in silico program approach for the prediction of mutagenicity from chemical structure. Mutat Res 539:77–89
Benigni R, Bossa C (2006) Structural alerts of mutagens and carcinogens. Curr Comput Aid Drug Des 2:169–176
Benigni R, Bossa C (2006) Structure-activity models of chemical carcinogens: state of the art, and new directions. Ann Ist Super Sanità 42:118–126
Zheng W, Tropsha A (2000) Novel variable selection quantitative structure property relationship approach based on k-nearest neighbor principle. J Chem Inf Comput Sci 40:185–194
Hassan M, Brown RD, Varma-O’Brian S, Rogers D (2006) Cheminformatics analysis and learning in a data pipelining environment. Mol Divers 10:283–299
Glick M, Jenkins JL, Nettles JH, Hitchings H, Davies JW (2006) Enrichment of high-throughput screening data with increasing levels of noise using support-vector machines, recursive partitioning, and laplacian-modified Naïve Bayesian classifiers. J Chem Inf Model 46:193–200
Rogers D, Hahn M (2010) Extended-connectivity fingerprints. J Chem Inf Model 50:742–754
Hansen K, Mika S, Schroeter T, Sutter A, ter Laak A, Steger-Hartmann T, Heinrich N, Müller KR (2009) Benchmark data set for in silico prediction of Ames mutagenicity. J Chem Inf Model 49:2077–2081
Sun HA (2005) Naive bayes classifier for prediction of multidrug resistance reversal activity on the basis of atom typing. J Med Chem 48:4031–4039
http://www.epa.gov/nrmrl/std/cppb/qsar/testuserguide.pdf. Assessed 25th Aug 2011)
Mazzatorta P, Tran L, Schilter B, Grigorov M (2007) Integration of structure-activity relationship and artificial intelligence systems to improve in silico prediction of Ames test mutagenicity. J Chem Inform Model 47:34–38
Ferrari T, Gini G (2010) An open source multistep model to predict mutagenicity from statistical analysis and relevant structural alerts. Chem Central J 2010, 4(Suppl 1):S2 (29 July 2010)–http://www.journal.chemistrycentral.com/content/4/S1/S2)
Rosenkranz HS, Klopman G, Chankong V, Pet-Edwards J, Haimes YY (1984) Prediction of environmental carcinogens: a strategy for the mid-1980s. Environ Mutagen 6:231–258
Chankong V, Haimes YY, Rosenkranz HS, Pet-Edwards J (1985) The carcinogenicity prediction and battery selection (CPBS) method: a bayesian approach. Mutat Res 153:135–166
Modi S, Hughes M, Garrow A, White A (2012) The value of in silico chemistry in the safety assessment of chemicals in the consumer goods and pharmaceutical industries. Drug Discov Today 17:135–142
Gleeson PM, Modi S, Bender A, Marchese-Robinson RL, Kirchmair J, Promkatkaew M, Hannongbua S, Glen RC (2012) Curr Pharm Des 18:1266–1291
Piegorsch WW, Zeiger E (1991) Measuring intra-assay agreement for the Ames Salmonella assay. In: Rienhoff O, Lindberg DAB (eds) Statistical methods in toxicology. Springer, Heidelberg, pp 35–41
Naven RT, Louise-May S, Greene N (2010) Expert Opin Drug Metab Toxicol 6:797–807
Sushko I, Novotarskyi S, Körner R, Pandey AK, Cherkasov A et al (2010) Applicability domains for classification problems: benchmarking of distance to models for ames mutagenicity set. J Chem Inf Model 50:2094–2111
Acknowledgments
Development of the dataset for this work was performed with help from Leadscope. The authors wish to acknowledge Dr Glenn Myatt (Leadscope) for his constant and generous help during the course of this work.
Author information
Authors and Affiliations
Corresponding author
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Modi, S., Li, J., Malcomber, S. et al. Integrated in silico approaches for the prediction of Ames test mutagenicity. J Comput Aided Mol Des 26, 1017–1033 (2012). https://doi.org/10.1007/s10822-012-9595-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10822-012-9595-5