Skip to main content

Fast protein fold recognition and accurate sequence-structure alignment

  • Molecular Modeling
  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1278))

Abstract

We present two approaches to the sequence-structure alignment or threading problem: given an amino acid sequence and a protein structure, find the best mapping of sequence residues to structure positions with respect to some scoring system. Methods to solve this problem have two main applications: first, the recognition or identification of a plausible fold for a protein sequence of unknown structure out of a database of representative protein structures and, second, the computation of accurate alignments by improving on sequence alignments using structural information in order to find a better starting point for homology based modeling.

We describe the application of these threading methods to a blind prediction of the structure of thymidine kinase (TK) of herpes simplex virus I: in combination with standard alignment and alignment evaluation methods implemented in our software package ToPLign, we were able to identify a model structure and to build a quite accurate partial model of essential parts of the structure including the active site.

This is a preview of subscription content, log in via an institution.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. N. Alexandrov, R. Nussinov, and R.M. Zimmer. Fast protein fold recognition via sequence to structure alignment and contact capacity potentials. In L. Hunter and T.E. Klein, editors, Pacific Symposium on Biocomputing'96, pages 53–72. World Scientific Publishing Co., 1996.

    Google Scholar 

  2. A. Bairoch, P. Bucher, and K. Hofmann. The PROSITE database, its status in 1995. Nucleic Acid Research, 24(1):189–196, 1996.

    Article  Google Scholar 

  3. F.C. Bernstein, T.F. Koetzle, G.J.B. Williams, E.F. Jr. Meyer, M.D. Brice, J.R. Rodgers, O. Kennard, T. Shimanouchi, and M. Tasumi. The protein data bank: a computer based archival file for macromolecular structures. Journal of Molecular Biology, 112:535–542, 1977.

    PubMed  Google Scholar 

  4. D.G. Brown, R. Visse, G. Sandhu, A. Davies, P.J. Rizkallah, C. Melitz, W.C. Summers, and M.R. Sanderson. Crystal structures of the thymidine kinase from herpes simplex virus type I in complex with deoxythymidine and ganciclovir. Nature: Structural Biology, 2(10):876–881, 1995.

    Article  Google Scholar 

  5. Margaret O. Dayhoff, R.M. Schwartz, and B.C. Orcutt. A model of evolutionary change in proteins. Atlas of Protein Sequence and Structure, 5(Supplement 3):345–352, 1978.

    Google Scholar 

  6. K. Diederichs and G.E. Schulz. The three-dimensional structure of the complex between beef heart mitochondrial matrix adenylate kinase and its substrate AMP at 1.85 angstroms resolution. Journal of Molecular Biology, 217:541–549, 1991.

    Article  PubMed  Google Scholar 

  7. D. Dreusicke, P.A. Karplus, and G.E. Schulz. Refined structure of porcine cytosolic adenylate kinase at 2.1 a resolution. Journal of Molecular Biology, 199:359–371, 1988.

    Article  PubMed  Google Scholar 

  8. G. Geourjon and G. Deleage. SOPMA: significant improvements in protein secondary structure prediction by consensus prediction from multiple alignments. Computer Applications in Biological Sciences, 11(6):681–684, 1995.

    Google Scholar 

  9. A. Godzik, A. Kolinski, and J. Skolnick. Topology fingerprint approach to the inverse protein folding problem. Journal of Molecular Biology, 227(1):227–238, 1992.

    Article  PubMed  Google Scholar 

  10. R.H. Lathrop. The protein threading problem with sequence amino acid interaction preferences is NP-complete. Protein Engineering, 7(9):1059–1068, 1994.

    PubMed  Google Scholar 

  11. C.A. Orengo, T.P. Flores, W.R. Taylor, and J.M. Thornton. Identification and classification of protein fold families. Protein Engineering, 6(5):485–500, 1993.

    PubMed  Google Scholar 

  12. A. Sali and T. Blundell. Comparative protein modelling by satisfaction of spatial restraints. Journal of Molecular Biology, 234:779–815, 1993.

    Article  PubMed  Google Scholar 

  13. C. Sander and R. Schneider. Database of homology-derived protein structures and the structural meaning of sequence alignment. PROTEINS: Structure, Function and Genetics, 9:56–68, 1991.

    Google Scholar 

  14. M. Sippl. Calculation of conformational ensembles from potentials of mean force: An approach to the knowledge-based prediction of local structures in globular proteins. Journal of Molecular Biology, 213:859–883, 1990.

    PubMed  Google Scholar 

  15. R. Thiele, R.M. Zimmer, and T. Lengauer. Recursive dynamic programming for adaptive sequence and structure alignment. In C. Rawlings et al., editor, Intelligent Systems for Molecular Biology, pages 384–392, Cambridge, UK, 1995. American Association for Artificial Intelligence, AAAI Press.

    Google Scholar 

  16. K. Wild, T. Bohner, A. Aubry, G. Folkers, and G.E. Schulz. The three-dimensional struture of thymidin kinase from herpes simplex virus type I. Federation of European Biochemical Societies, 368:289–292, 1995.

    Google Scholar 

  17. R.M. Zimmer and N. Alexandrov. unpublished results, 1996.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Ralf Hofestädt Thomas Lengauer Markus Löffler Dietmar Schomburg

Rights and permissions

Reprints and permissions

Copyright information

© 1997 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Zimmer, R., Thiele, R. (1997). Fast protein fold recognition and accurate sequence-structure alignment. In: Hofestädt, R., Lengauer, T., Löffler, M., Schomburg, D. (eds) Bioinformatics. GCB 1996. Lecture Notes in Computer Science, vol 1278. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0033212

Download citation

  • DOI: https://doi.org/10.1007/BFb0033212

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-63370-9

  • Online ISBN: 978-3-540-69524-0

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics