Abstract
This paper aims to develop methods for protein structural representation and to implement structural blocks retrieval into a macromolecule or even into the entire protein data-base (PDB). A first problem to deal with is that of representing structural blocks. A proposal is to exploit the Extended Gaussian Image (EGI) which maps on the unitary sphere the histogram of the orientations of the object surface. In fact, we propose to adopt a particular ’abstract’ data-structure named Protein Gaussian Image (PGI) for representing the orientation of the protein secondary structures (helices and sheets). The ’concrete’ data structure is the same as for the EGI, however, in this case the points of the Gaussian sphere surface do not contain the area of patches having that orientation but features of the secondary structures (SSs) having that direction. Among the features we may include the versus (e.g. + origin versus surface or - vice versa), the length of the structure (number of amino), biochemical properties, and even the sequence of the amino (such as in a list). We consider this representation very effective for a preliminary screening when searching on the PDB. We propose to employ the PGI in a fold recognition problem in which the learning task is performed by means an unsupervised method for structured data.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Protein Data Bank, http://www.pdb.org/
Kabsch, W., Sander, C.: Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features. Biopolymers 22(12), 2577–2637 (1983)
David, E.: The discovery of the alfa-helix and beta-sheet, the principal structural features of proteins. Proceedings of the National Academy of Sciences USA, 100:11207–100:11210 (2003)
Cantoni, V., Ferone, A., Petrosino, A.: Protein Gaussian Image (PGI) - A Protein Structural Representation Based on the Spatial Attitude of Secondary Structure. New Tools and Methods for Pattern Recognition in Complex Biological Systems (in press)
Shulman-Peleg, A., Nussinov, R., Wolfson, H.: Recognition of Functional Sites in Protein Structures. J. Mol. Biol. 339, 607–633 (2004)
Bock, M.E., Garutti, C., Guerra, C.: Spin image profile: a geometric descriptor for identifying and matching protein cavities. In: Proc. of CSB, San Diego (2007)
Frome, A., Huber, D., Kolluri, R., Bülow, T., Malik, J.: Recognizing Objects in Range Data Using Regional Point Descriptors. In: Pajdla, T., Matas, J(G.) (eds.) ECCV 2004. LNCS, vol. 3023, pp. 224–237. Springer, Heidelberg (2004)
Glaser, F., Morris, R.J., Najmanovich, R.J., Laskowski, R.A., Thornton, J.M.: A Method for Localizing Ligand Binding Pockets in Protein Structures. PROTEINS: Structure, Function, and Bioinformatics 62, 479–488 (2006)
Horn, B.K.P.: Extended Gaussian images. Proc. IEEE 72(12), 1671–1686 (1984)
Kang, S.B., Ikeuchi, K.: The complex EGI: a new representation for 3-D pose determination. In: IEEE-T-PAMI, pp. 707–721 (1993)
Shum, H., Hebert, M., Ikeuchi, K.: On 3D shape similarity. In: Proceedings of the IEEE-CVPR 1996, pp. 526–531 (1996)
Kang, S., Ikeuchi, K.: The complex EGI, a new representation for 3D pose determination. IEEE Trans. Pattern Anal. Mach. Intell. 15(7), 707–721 (1993)
Murzin, A.G., Brenner, S.E., Hubbard, T., Chothia, C.: SCOP: a structural classification of proteins database for the investigation of sequences and structures. J. Mol. Biol. 247, 536–540 (1995)
Orengo, C.A., Michie, A.D., Jones, S., Jones, D.T., Swindells, M.B., Thornton, J.M.: CATH–a hierarchic classification of protein domain structures. Structure 5(8), 1093–1108 (1997)
Hagenbuchner, M., Sperduti, A., Tsoi, A.H.: A self-organizing map for adaptive processing of structured data. IEEE Transactions on Neural Networks 14(3), 491–505 (2003)
Sperduti, A.: Knowledge-Base Neurocomputing. In: Cloete, I., Zurada, J.M. (eds.) pp. 117–152. MIT Press, Cambridge (2000)
Kohonen, T.: Self–Organization andAssociative Memory, 3rd edn. Springer, New York (1990)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Cantoni, V., Ferone, A., Ozbudak, O., Petrosino, A. (2012). Protein Structural Blocks Representation and Search through Unsupervised NN. In: Villa, A.E.P., Duch, W., Érdi, P., Masulli, F., Palm, G. (eds) Artificial Neural Networks and Machine Learning – ICANN 2012. ICANN 2012. Lecture Notes in Computer Science, vol 7553. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33266-1_64
Download citation
DOI: https://doi.org/10.1007/978-3-642-33266-1_64
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-33265-4
Online ISBN: 978-3-642-33266-1
eBook Packages: Computer ScienceComputer Science (R0)