Abstract
This paper investigates the applicability of Particle Swarm Optimization (PSO) to motif matching in protein structures, which can help in protein structure classification and function annotation. A 3D motif is a spatial, local pattern in a protein structure important for its function. In this study, the problem of 3D motif matching is formulated as an optimization task with an objective function of minimizing the least Root Mean Square Deviation (lRMSD) between the query motif and target structures. Evaluation results on two protein datasets demonstrate the ability of the proposed approach on locating the true query motif of all 66 target proteins almost always (9 and 8 times, respectively, on average, out of 10 trials per target). A large-scale application of motif matching is protein classification, where the proposed approach distinguished between the positive and negative examples by consistently ranking all positive examples at the very top of the search results.
Chapter PDF
References
Illergård, K., Ardell, D.H., Elofsson, A.: Structure is three to ten times more conserved than sequence—A study of structural response in protein cores. Proteins: Structure, Function, and Bioinformatics 77, 499–508 (2009)
Murzin, A.G., Brenner, S.E., Hubbard, T., Chothia, C.: SCOP: A structural classification of proteins database for the investigation of sequences and structures. Journal of Molecular Biology 247, 536–540 (1995)
Brejova, B., Vinar, T., Li, M.: Pattern discovery: Methods and software. Introduction to Bioinformatics, 491–522 (2003)
Kleywegt, G.J.: Recognition of spatial motifs in protein structures. Journal of Molecular Biology 285, 1887–1897 (1999)
Watson, J.D., Laskowski, R.A., Thornton, J.M.: Predicting protein function from sequence and structural data. Current Opinion in Structural Biology 15, 275–284 (2005)
Shapiro, J., Brutlag, D.: FoldMiner: Structural motif discovery using an improved superposition algorithm. Protein Science 13, 278–294 (2004)
Shi, S., Zhong, Y., Majumdar, I., Krishna, S.S., Grishin, N.V.: Searching for three-dimensional secondary structural patterns in proteins with ProSMoS. Bioinformatics 23, 1331–1338 (2007)
Andreeva, A., Howorth, D., Brenner, S.E., Hubbard, T.J., Chothia, C., Murzin, A.G.: SCOP database in 2004: Refinements integrate structure and sequence family data. Nucleic Acids Research 32, D226-D229 (2004)
Lesk, A.M.: Systematic representation of protein folding patterns. Journal of Molecular Graphics 13, 159–164 (1995)
Pearl, F.M.G., Bennett, C., Bray, J.E., Harrison, A.P., Martin, N., Shepherd, A., Sillitoe, I., Thornton, J., Orengo, C.A.: The CATH database: An extended protein family resource for structural and functional genomics. Nucleic Acids Research 31, 452–455 (2003)
Hasegawa, H., Holm, L.: Advances and pitfalls of protein structural alignment. Current Opinion in Structural Biology 9, 341–348 (2009)
Odegard, J.E.: Molecular Distance Measures (2007), http://cnx.org/content/m11608/1.23/
Hendrickson, W.A.: Transformations to optimize the superposition of similar structures. Acta Crystallographica Section A: Crystal Physics, Diffraction, Theoretical and General Crystallography 35, 158–163 (1979)
Sali, A., Blundell, T.L.: Definition of general topological equivalence in protein structures: A procedure involving comparison of properties and relationships through simulated annealing and dynamic programming. Journal of Molecular Biology 212, 403–428 (1990)
Szustakowski, J.D., Weng, Z.: Protein structure alignment using a genetic algorithm. Proteins: Structure, Function, and Bioinformatics 38, 428–440 (2000)
Pelta, D., González, J., Vega, M.M.: A simple and fast heuristic for protein structure comparison. BMC Bioinformatics 9, 161 (2008)
Parsopoulos, K.E., Vrahatis, M.N.: Particle swarm optimization and intelligence: Advances and applications. Information Science Reference Hershey (2010)
Clerc, M., Kennedy, J.: The particle swarm-explosion, stability, and convergence in a multidimensional complex space. IEEE Transactions on Evolutionary Computation 6, 58–73 (2002)
del Valle, Y., Venayagamoorthy, G.K., Mohagheghi, S., Hernandez, J.-C., Harley, R.G.: Particle swarm optimization: Basic concepts, variants and applications in power systems. IEEE Transactions on Evolutionary Computation 12, 171–195 (2008)
Kabsch, W.: A solution for the best rotation to relate two sets of vectors. Acta Crystallographica Section A: Crystal Physics, Diffraction, Theoretical and General Crystallography 32, 922–923 (1976)
Kabsch, W.: A discussion of the solution for the best rotation to relate two sets of vectors. Acta Crystallographica Section A: Crystal Physics, Diffraction, Theoretical and General Crystallography 34, 827–828 (1978)
Laity, J.H., Lee, B.M., Wright, P.E.: Zinc finger proteins: New insights into structural and functional diversity. Current opinion in Structural Biology 11, 39–46 (2001)
Ebert, J.C., Altman, R.B.: Robust recognition of zinc binding sites in proteins. Protein Science 17, 54–65 (2008)
Simpson, R.J., Lee, S.H.Y., Bartle, N., Sum, E.Y., Visvader, J.E., Matthews, J.M., Mackay, J.P., Crossley, M.: A classic zinc finger from friend of GATA mediates an interaction with the coiled-coil of transforming acidic coiled-coil 3. Journal of Biological Chemistry 279, 39789–39797 (2004)
Xu, S., Rahmat-Samii, Y.: Boundary conditions in particle swarm optimization revisited. IEEE Transactions on Antennas and Propagation 55, 760–765 (2007)
Täubig, H., Buchner, A., Griebsch, J.: PAST: Fast structure-based searching in the PDB. Nucleic Acids Research 34, W20–W23 (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Ahmed, H.R., Glasgow, J. (2014). A Novel Particle Swarm-Based Approach for 3D Motif Matching and Protein Structure Classification. In: Sokolova, M., van Beek, P. (eds) Advances in Artificial Intelligence. Canadian AI 2014. Lecture Notes in Computer Science(), vol 8436. Springer, Cham. https://doi.org/10.1007/978-3-319-06483-3_1
Download citation
DOI: https://doi.org/10.1007/978-3-319-06483-3_1
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-06482-6
Online ISBN: 978-3-319-06483-3
eBook Packages: Computer ScienceComputer Science (R0)