Skip to main content

Protein Structural Blocks Representation and Search through Unsupervised NN

  • Conference paper
Artificial Neural Networks and Machine Learning – ICANN 2012 (ICANN 2012)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7553))

Included in the following conference series:

  • 3151 Accesses

Abstract

This paper aims to develop methods for protein structural representation and to implement structural blocks retrieval into a macromolecule or even into the entire protein data-base (PDB). A first problem to deal with is that of representing structural blocks. A proposal is to exploit the Extended Gaussian Image (EGI) which maps on the unitary sphere the histogram of the orientations of the object surface. In fact, we propose to adopt a particular ’abstract’ data-structure named Protein Gaussian Image (PGI) for representing the orientation of the protein secondary structures (helices and sheets). The ’concrete’ data structure is the same as for the EGI, however, in this case the points of the Gaussian sphere surface do not contain the area of patches having that orientation but features of the secondary structures (SSs) having that direction. Among the features we may include the versus (e.g. + origin versus surface or - vice versa), the length of the structure (number of amino), biochemical properties, and even the sequence of the amino (such as in a list). We consider this representation very effective for a preliminary screening when searching on the PDB. We propose to employ the PGI in a fold recognition problem in which the learning task is performed by means an unsupervised method for structured data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Protein Data Bank, http://www.pdb.org/

  2. Kabsch, W., Sander, C.: Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features. Biopolymers 22(12), 2577–2637 (1983)

    Article  Google Scholar 

  3. David, E.: The discovery of the alfa-helix and beta-sheet, the principal structural features of proteins. Proceedings of the National Academy of Sciences USA, 100:11207–100:11210 (2003)

    Google Scholar 

  4. Cantoni, V., Ferone, A., Petrosino, A.: Protein Gaussian Image (PGI) - A Protein Structural Representation Based on the Spatial Attitude of Secondary Structure. New Tools and Methods for Pattern Recognition in Complex Biological Systems (in press)

    Google Scholar 

  5. Shulman-Peleg, A., Nussinov, R., Wolfson, H.: Recognition of Functional Sites in Protein Structures. J. Mol. Biol. 339, 607–633 (2004)

    Article  Google Scholar 

  6. Bock, M.E., Garutti, C., Guerra, C.: Spin image profile: a geometric descriptor for identifying and matching protein cavities. In: Proc. of CSB, San Diego (2007)

    Google Scholar 

  7. Frome, A., Huber, D., Kolluri, R., Bülow, T., Malik, J.: Recognizing Objects in Range Data Using Regional Point Descriptors. In: Pajdla, T., Matas, J(G.) (eds.) ECCV 2004. LNCS, vol. 3023, pp. 224–237. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  8. Glaser, F., Morris, R.J., Najmanovich, R.J., Laskowski, R.A., Thornton, J.M.: A Method for Localizing Ligand Binding Pockets in Protein Structures. PROTEINS: Structure, Function, and Bioinformatics 62, 479–488 (2006)

    Article  Google Scholar 

  9. Horn, B.K.P.: Extended Gaussian images. Proc. IEEE 72(12), 1671–1686 (1984)

    Article  Google Scholar 

  10. Kang, S.B., Ikeuchi, K.: The complex EGI: a new representation for 3-D pose determination. In: IEEE-T-PAMI, pp. 707–721 (1993)

    Google Scholar 

  11. Shum, H., Hebert, M., Ikeuchi, K.: On 3D shape similarity. In: Proceedings of the IEEE-CVPR 1996, pp. 526–531 (1996)

    Google Scholar 

  12. Kang, S., Ikeuchi, K.: The complex EGI, a new representation for 3D pose determination. IEEE Trans. Pattern Anal. Mach. Intell. 15(7), 707–721 (1993)

    Article  Google Scholar 

  13. Murzin, A.G., Brenner, S.E., Hubbard, T., Chothia, C.: SCOP: a structural classification of proteins database for the investigation of sequences and structures. J. Mol. Biol. 247, 536–540 (1995)

    Google Scholar 

  14. Orengo, C.A., Michie, A.D., Jones, S., Jones, D.T., Swindells, M.B., Thornton, J.M.: CATH–a hierarchic classification of protein domain structures. Structure 5(8), 1093–1108 (1997)

    Article  Google Scholar 

  15. Hagenbuchner, M., Sperduti, A., Tsoi, A.H.: A self-organizing map for adaptive processing of structured data. IEEE Transactions on Neural Networks 14(3), 491–505 (2003)

    Article  Google Scholar 

  16. Sperduti, A.: Knowledge-Base Neurocomputing. In: Cloete, I., Zurada, J.M. (eds.) pp. 117–152. MIT Press, Cambridge (2000)

    Google Scholar 

  17. Kohonen, T.: Self–Organization andAssociative Memory, 3rd edn. Springer, New York (1990)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Cantoni, V., Ferone, A., Ozbudak, O., Petrosino, A. (2012). Protein Structural Blocks Representation and Search through Unsupervised NN. In: Villa, A.E.P., Duch, W., Érdi, P., Masulli, F., Palm, G. (eds) Artificial Neural Networks and Machine Learning – ICANN 2012. ICANN 2012. Lecture Notes in Computer Science, vol 7553. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33266-1_64

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-33266-1_64

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-33265-4

  • Online ISBN: 978-3-642-33266-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics