Skip to main content

A Parameterized Algorithm for Protein Structure Alignment

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNBI,volume 3909))

Abstract

This paper proposes a parameterized algorithm for aligning two protein structures, in the case where one protein structure is represented by a contact map graph and the other by a contact map graph or a distance matrix. If the sequential order of alignment is not required, the time complexity is polynomial in the protein size and exponential with respect to two parameters \(\frac{D_u}{D_l}\) and \(\frac{D_c}{D_l}\), which usually can be treated as constants. In particular, D u is the distance threshold determining if two residues are in contact or not, D c is the maximally allowed distance between two matched residues after two proteins are superimposed, and D l is the minimum inter-residue distance in a typical protein. This result indicates that if both \(\frac{D_u}{D_l}\) and \(\frac{D_c}{D_l}\) are small enough, then there is a polynomial-time approximation scheme for the non-sequential protein structure alignment problem. Empirically, both \(\frac{D_u}{D_l}\) and \(\frac{D_c}{D_l}\) are very small and can be treated as constants. This result clearly demonstrates that the hardness of the contact-map based protein structure alignment problem is related not to protein size but to several parameters, which depend on how the protein structure alignment problem is modeled. The result is achieved by decomposing the protein structure using tree decomposition and discretizing the rigid-body transformation space. We have implemented our algorithm and preliminary experimental results indicate that on a Linux PC, it takes from ten minutes to one hour to align two proteins with approximately 100 residues.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Comin, M., Guerra, C., Zanotti, G.: PROuST: a comparison method of three-dimensional structures of proteins using indexing techniques. Journal of Computational Biology 11(6), 1061–1072 (2004)

    Article  Google Scholar 

  2. Holm, L., Sander, C.: Protein structure comparison by alignment of distance matrices. Journal of Molecular Biology 233, 123–138 (1993)

    Article  Google Scholar 

  3. Alexandrov, N.N.: SARFing the PDB. Protein Engineering 9, 727–732 (1996)

    Article  Google Scholar 

  4. Gerstein, M., Levitt, M.: Using iterative dynamic programming to obtain accurate pairwise and multiple alignments of protein structures. In: Proceedings of International Conference on Intelligent Systems in Molecular Biology, pp. 59–67 (1996)

    Google Scholar 

  5. Singh, A.P., Brutlag, D.L.: Hierarchical protein structure superposition using both secondary structure and atomic representations. In: Proceedings of International Conference on Intelligent Systems in Molecular Biology, pp. 284–293 (1997)

    Google Scholar 

  6. Gibrat, J.F., Madej, T., Bryant, S.H.: Surprising similarities in structure comparison. Current Opinion in Structural Biology (6), 377–385 (1996)

    Google Scholar 

  7. Akutsu, T., Tashimo, H.: Protein structure comparison using representation by line segment sequences. In: Proceedings of Pacific Symposium on Biocomputing 1996 (PSB 1996), pp. 25–40 (1996)

    Google Scholar 

  8. Lancia, G., Carr, R., Walenz, B., Istrail, S.: 101 optimal PDB structure alignments: a branch-and-cut algorithm for the maximum contact map overlap problem. In: RECOMB 2001, pp. 193–202. ACM Press, New York (2001)

    Chapter  Google Scholar 

  9. Caprara, A., Lancia, G.: Structural alignment of large–size proteins via Lagrangian relaxation. In: RECOMB 2002, pp. 100–108. ACM Press, New York (2002)

    Chapter  Google Scholar 

  10. Lancia, G., Istrail, S.: Protein structure comparison: Algorithms and applications. In: Guerra, C., Istrail, S. (eds.) Mathematical Methods for Protein Structure Analysis and Design. LNCS (LNBI), vol. 2666, pp. 1–33. Springer, Heidelberg (2003)

    Chapter  Google Scholar 

  11. Lemmen, C., Lengauer, T.: Computational methods for the structural alignment of molecules. Journal of Computer-Aided Molecular Design 14, 215–232 (2000)

    Article  Google Scholar 

  12. Kolodny, R., Linial, N.: Approximate protein structural alignment in polynomial time. PNAS 101(33), 12201–12206 (2004)

    Article  Google Scholar 

  13. Goldman, D., Papadimitriou, C.H., Istrail, S.: Algorithmic aspects of protein structure similarity. In: FOCS 1999: Proceedings of the 40th Annual Symposium on Foundations of Computer Science, pp. 512–522. IEEE Computer Society, Los Alamitos (1999)

    Google Scholar 

  14. Shindyalov, I.N., Bourne, P.E.: Protein structure alignment by incremental combinatorial extension (CE) of the optimal path. Protein Engineering 11(9), 739–747 (1998)

    Article  Google Scholar 

  15. Verbitsky, O.: On the largest common subgraph problem (1994) (unpublished manuscript)

    Google Scholar 

  16. Jokisch, S., Müller, H.: Inter-point-distance-dependent approximate point set matching. Technical Report Research Report No. 653 (1997)

    Google Scholar 

  17. Yuan, X., Bystroff, C.: Non-sequential structure-based alignments reveal topology-independent core packing arrangements in proteins. Bioinformatics 27, 1010–1019 (2005)

    Google Scholar 

  18. Dror, O., Benyamini, H., Nussinov, R., Wolfson, H.: MASS: Multiple structural alignment by secondary structures. Bioinformatics 19(Suppl. 1), 95–104 (2003)

    Article  Google Scholar 

  19. Zhu, J., Weng, Z.: FAST: A novel protein structure alignment algorithm. Proteins: Structure Function, and Bioinformatics (in Press, 2004)

    Google Scholar 

  20. Berman, H.M., Westbrook, J., Feng, Z., Gilliland, G., Bhat, T.N., Weissig, H., Shindyalov, I.N., Bourne, P.E.: The protein data bank. Nucleic Acids Research 28, 235–242 (2000)

    Article  Google Scholar 

  21. Arun, K.S., Huang, T.S., Blostein, S.D.: Least-square fitting of two 3-d point sets. IEEE Trans. on Pattern Analysis and Machine Intelligence 9(5), 698–700 (1987)

    Article  Google Scholar 

  22. Robertson, N., Seymour, P.D.: Graph minors. II. algorithmic aspects of tree-width. Journal of Algorithms 7, 309–322 (1986)

    Article  MATH  MathSciNet  Google Scholar 

  23. Xu, J.: Rapid side-chain packing via tree decomposition. In: Miyano, S., Mesirov, J., Kasif, S., Istrail, S., Pevzner, P.A., Waterman, M. (eds.) RECOMB 2005. LNCS (LNBI), vol. 3500, pp. 423–439. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  24. Hao, M.H., Rackovsky, S., Liwo, A., Pincus, M.R., Scheraga, H.A.: Proc. Natl. Acad. Sci. USA 89, 6614–6618 (1992)

    Google Scholar 

  25. Caprara, A., Carr, R., Istrail, S., Lancia, G., Walenz, B.: 101 optimal PDB structure alignments: a branch-and-cut algorithm for the maximum contact map overlap problem. Journal of Computational Biology 11(1), 27–52 (2004)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Xu, J., Jiao, F., Berger, B. (2006). A Parameterized Algorithm for Protein Structure Alignment. In: Apostolico, A., Guerra, C., Istrail, S., Pevzner, P.A., Waterman, M. (eds) Research in Computational Molecular Biology. RECOMB 2006. Lecture Notes in Computer Science(), vol 3909. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11732990_41

Download citation

  • DOI: https://doi.org/10.1007/11732990_41

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-33295-4

  • Online ISBN: 978-3-540-33296-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics