ABSTRACT
Active sites are regions in the enzyme surface designed to interact with other molecules. Given their importance to enzyme function, active site amino acids are more conserved during evolution than the whole sequence, and can be a useful source of information for function prediction. For this reason, great effort has been put into identifying active sites in proteins. The majority of methods for this purpose uses an active site template of a protein of known function to search for similar structures into proteins of unknown function. In this direction, we recently proposed GASS (Genetic Active Site Search), a method based on an evolutionary algorithm to search for active sites in proteins. Although the method obtained very accurate results, its main strength and weakness are related to using only the spatial distance from the template to the protein to evaluate candidate sites. In this direction, this paper proposes MeGASS, a multi-objective version of GASS that also considers the depth of the residues when looking for active sites. This is important, as active sites are known for being closer to the protein surface to allow interactions with ligands. Results showed the depth attribute improves over the results of GASS, and its role into the method is worth further investigation.
- H. Berman, J. Westbrook, Z. Feng, G. Gilliland, T. Bhat, H. Weissig, I. Shindyalov, and P. Bourne. The protein data bank. Nucleic acids research, 28(1):235, 2000.Google Scholar
- M. Brylinski and J. Skolnick. A threading-based method (FINDSITE) for ligand-binding site prediction and functional annotation. Proc. of the National Academy of Sciences of the USA, 105(1):129--134, 2008.Google ScholarCross Ref
- T. G. Cassarino, L. Bordoli, and T. Schwede. Assessment of ligand binding site predictions in CASP 10. Proteins, 82((Suppl 2)):154--163, 2014.Google Scholar
- S. Chakravarty and R. Varadarajan. Residue depth: a novel parameter for the analysis of protein structure and stability. Structure with Folding & Design, 7(7):723--732, Jul 15 1999.Google ScholarCross Ref
- F. L. Custodio, H. J. Barbosa, and L. E. Dardenne. A multiple minima genetic algorithm for protein structure prediction. Applied Soft Computing, 15(0):88 -- 99, 2014. Google ScholarDigital Library
- K. Deb and D. Kalyanmoy. Multi-Objective Optimization Using Evolutionary Algorithms. John Wiley & Sons, Inc., New York, NY, USA, 2001. Google ScholarDigital Library
- T. Fober, M. Mernberger, G. Klebe, and E. Hüllermeier. Evolutionary construction of multiple graph alignments for the structural analysis of biomolecules. Bioinformatics, 25(16):i2110--i2117, 2009. Google ScholarDigital Library
- G. B. Fogel and D. W. Corne, editors. Evolutionary Computation in Bioinformatics. Morgan Kaufmann, 2002.Google Scholar
- S. Henikoff and J. G. Henikoff. Amino-acid Substitution Matrices from Protein Blocks. Proc. of The National Academy of Sciences of The USA, 89(22):10915--10919, Nov 15 1992.Google ScholarCross Ref
- S. C. Izidoro, R. C. de Melo-Minardi, and G. L. Pappa. GASS: identifying enzyme active sites with genetic algorithms. Bioinformatics, 2014.Google Scholar
- T. Kato and N. Nagano. Metric learning for enzyme active-site search. Bioinformatics, 26(21):2698--2704, 2010. Google ScholarDigital Library
- R. A. Laskowski, J. D. Watson, and J. M. Thornton. Protein function prediction using local 3D templates. Journal of Molecular Biology, 351:614--626, 2005.Google ScholarCross Ref
- B. Lee and F. M. Richards. Interpretation of Protein Structures - Estimation of Static Accessibility. Journal of Molecular Biology, 55(3):379--&, 1971.Google ScholarCross Ref
- F. C. Lightstone, S. E. Wong, D. A. Kirshner, and J. P. Nilmeier. Rapid Catalytic Template Searching as an Enzyme Function Prediction Procedure. PLoS ONE, 8(5):1--17, may 2013.Google Scholar
- G. Lopez, P. Maietta, J. M. Rodriguez, A. Valencia, and M. L. Tress. firestar-advances in the prediction of functionally important residues. Nucleic Acids Research, 39(2):W235--W241, Jul 2011.Google ScholarCross Ref
- A. Marhaman and J. M. Thornton. Methods to Characterize the Structure of Enzyme Binding Sites. In T. Schwede and M. Peitsch, editors, Computational Structural Biology - Methods and Applications, pages 189--221. World Scientific Publishing, 2008.Google Scholar
- N. Nadzirin, E. J. Gardiner, P. Willett, P. J. Artymiuk, and M. Firdaus-Raih. SPRITE and ASSAM: web servers for side chain 3D-motif searching in protein structures. Nucleic Acids Res., 40:W380--W386, May 2012.Google ScholarCross Ref
- S. Pal, S. Bandyopadhyay, and S. Ray. Evolutionary computation in bioinformatics: a review. Systems, Man, and Cybernetics, Part C: Applications and Reviews, IEEE Transactions on, 36(5):601--615, 2006. Google ScholarDigital Library
- C. T. Porter, G. J. Bartlett, and J. M. Thornton. The Catalytic Site Atlas: a resource of catalytic sites and residues identified in enzymes using structural data. Nucleic Acids Res., 32:D129--D133, 2004.Google ScholarCross Ref
- A. Stark and R. B. Russell. Annotation in three dimensions. PINTS: Patterns in Non-homologous Tertiary Structures. Nucleic Acids Res., 31(13):3341--3344, 2003.Google ScholarCross Ref
- J. W. Torrance and J. M. Thornton. Structure-based Prediction of Enzymes and Their Active Sites. Wiley, 2009.Google Scholar
- A. C. Wallace, N. Borkakoti, and J. M. Thornton. Tess: A geometric hashing algorithm for deriving 3d coordinate templates for searching structural databases application to enzyme active sites. Protein Sci., 6:2308--2323, 1997.Google ScholarCross Ref
- J. C. Whisstock and A. M. Lesk. Prediction of protein function from protein sequence and structure. Q. Rev. Biophys, 36:307--340, 2003.Google ScholarCross Ref
Index Terms
- MeGASS: Multi-Objective Genetic Active Site Search
Recommendations
Research article: Heuristic-based tabu search algorithm for folding two-dimensional AB off-lattice model proteins
The protein structure prediction problem is a classical NP hard problem in bioinformatics. The lack of an effective global optimization method is the key obstacle in solving this problem. As one of the global optimization algorithms, tabu search (TS) ...
AffinityVAE: A multi-objective model for protein-ligand affinity prediction and drug design
AbstractIn the prediction of protein-ligand affinity, the traditional methods require a large amount of computing resources, and have certain limitations in predicting and simulating the structural changes. Although employing data-driven approaches can ...
Graphical abstractDisplay Omitted
Highlights- An affinity prediction and drug generation multi-objective deep learning model was developed.
- An innovative protein-ligand feature interaction module has been proposed to capture the interaction information.
- Adaptive autoencoders ...
Genetic Threading
The biological function of proteins is dependent, to a large extent, on their native three dimensional conformation. Thus, it is important to know the structure of as many proteins as possible. Since experimental methods for structure determination are ...
Comments