Skip to main content

A Novel Approach for Fast Protein Structure Comparison and Heuristic Structure Database Searching Based on Residue EigenRank Scores

  • Conference paper
  • First Online:
  • 723 Accesses

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1018))

Abstract

With the rapid growth of public protein structure databases, computational techniques for storing as well as comparing proteins in an efficient manner are still in demand. Proteins play a major role in virtually all processes in life, and comparing their three-dimensional structures is essential to understanding the functional and evolutionary relationships between them.

In this study, a novel approach to compute three-dimensional protein structure alignments by means of so-called EigenRank score profiles is proposed. These scores are obtained by utilizing the LeaderRank algorithm—a vertex centrality indexing scheme originally introduced to infer the opinion leading role of individual actors in social networks. The obtained EigenRank representation of a given structure is not just highly-specific, but can also be used to compute profile alignments from which three-dimensional structure alignments can be rapidly deduced. This technique thus could provide a tool to rapidly scan entire databases containing thousands of structures.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    The set of adjacency matrices can actually be used to derive one single matrix containing binned residue-residue distances. The underlying structure of the protein can be reconstructed from that matrix.

References

  1. Altschul, S.F., Gish, W., Miller, W., Myers, E.W., Lipman, D.J.: Basic local alignment search tool. J. Mol. Biol. 215, 403–410 (1990). https://doi.org/10.1016/S0022-2836(05)80360-2

    Article  Google Scholar 

  2. Altschul, S.F., et al.: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25, 3389–3402 (1997)

    Article  Google Scholar 

  3. Bamakan, S.M.H., Nurgaliev, I., Qu, Q.: Opinion leader detection: a methodological review. Expert Syst. Appl. 115, 200–222 (2019). https://doi.org/10.1016/j.eswa.2018.07.069. http://www.sciencedirect.com/science/article/pii/S0957417418304950

    Article  Google Scholar 

  4. wwPDB consortium: Protein Data Bank: the single global archive for 3D macromolecular structure data. Nucleic Acids Research, October 2018. https://doi.org/10.1093/nar/gky949

  5. Frank, K., Gruber, M., Sippl, M.J.: COPS benchmark: interactive analysis of database search methods. Bioinformatics 26, 574–575 (2010). https://doi.org/10.1093/bioinformatics/btp712. (Oxford, England)

    Article  Google Scholar 

  6. Kabsch, W.: A solution for the best rotation to relate two sets of vectors. Acta Crystallogr. Sect. A 32(5), 922–923 (1976). https://doi.org/10.1107/S0567739476001873

    Article  Google Scholar 

  7. Li, Q., Zhou, T., Lü, L., Chen, D.: Identifying influential spreaders by weighted LeaderRank. Phys. A Stat. Mech. Appl. 404(Supplement C), 47–55 (2014). https://doi.org/10.1016/j.physa.2014.02.041

    Article  MathSciNet  MATH  Google Scholar 

  8. Lü, L., Zhang, Y.C., Yeung, C.H., Zhou, T.: Leaders in social networks, the delicious case. PloS One 6(6), e21202 (2011). https://doi.org/10.1371/journal.pone.0021202

    Article  Google Scholar 

  9. Mrozek, D.: Scalable Big Data Analytics for Protein Bioinformatics. CB, vol. 28. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-98839-9

    Book  MATH  Google Scholar 

  10. Needleman, S.B., Wunsch, C.D.: A general method applicable to the search for similarities in the amino acid sequence of two proteins. J. Mol. Biol. 48, 443–453 (1970)

    Article  Google Scholar 

  11. Pearson, W.R., Lipman, D.J.: Improved tools for biological sequence comparison. Proc. Nat. Acad. Sci. U.S.A. 85, 2444–2448 (1988)

    Article  Google Scholar 

  12. Prlic, A., et al.: BioJava: an open-source framework for bioinformatics in 2012. Bioinformatics 28, 2693–2695 (2012). https://doi.org/10.1093/bioinformatics/bts494. (Oxford, England)

    Article  Google Scholar 

  13. Sadreyev, R.I., Grishin, N.V.: Accurate statistical model of comparison between multiple sequence alignments. Nucleic Acids Res. 36, 2240–2248 (2008). https://doi.org/10.1093/nar/gkn065

    Article  Google Scholar 

  14. Schulz, G.E., Schirmer, R.H.: Principles of Protein Structure, 5th edn. Springer, New York (1984)

    Google Scholar 

  15. Shindyalov, I.N., Bourne, P.E.: Protein structure alignment by incremental combinatorial extension (CE) of the optimal path. Protein Eng. 11, 739–747 (1998)

    Article  Google Scholar 

  16. Smith, T., Waterman, M.: Identification of common molecular subsequences. J. Mol. Biol. 147(1), 195–197 (1981). https://doi.org/10.1016/0022-2836(81)90087-5. http://www.sciencedirect.com/science/article/pii/0022283681900875

    Article  Google Scholar 

  17. Soeding, J.: Protein homology detection by HMM-HMM comparison. Bioinformatics 21(7), 951–960 (2005). https://doi.org/10.1093/bioinformatics/bti125

    Article  Google Scholar 

  18. Spranger, M., Becker, S., Heinke, F., Siewerts, H., Labudde, D.: The infiltration game: artificial immune system for the exploitation of crime relevant information in social networks. In: Proceedings of Seventh International Conference on Advances in Information Management and Mining (IMMM), pp. 24–27. IARIA. ThinkMind Library (2017)

    Google Scholar 

  19. Suhrer, S.J., Wiederstein, M., Gruber, M., Sippl, M.J.: COPS - A novel workbench for explorations in fold space. Nucleic Acids Res. 37, W539–W544 (2009). https://doi.org/10.1093/nar/gkp411

    Article  Google Scholar 

  20. Surade, S., Blundell, T.L.: Structural biology and drug discovery of difficult targets: the limits of ligandability. Chem. Biol. 19(1), 42–50 (2012). https://doi.org/10.1016/j.chembiol.2011.12.013

    Article  Google Scholar 

  21. Ye, Y., Godzik, A.: Flexible structure alignment by chaining aligned fragment pairs allowing twists. Bioinformatics 19(Suppl 2), ii246–ii255 (2003). (Oxford, England)

    Article  Google Scholar 

  22. Zemla, A.: LGA: a method for finding 3D similarities in protein structures. Nucleic Acids Res. 31(13), 3370–3374 (2003). https://doi.org/10.1093/nar/gkg571

    Article  Google Scholar 

  23. Zhang, Y., Skolnick, J.: TM-align: a protein structure alignment algorithm based on the TM-score. Nucleic Acids Res, 33, 2302–2309 (2005). https://doi.org/10.1093/nar/gki524

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Florian Heinke .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Heinke, F., Hempel, L., Labudde, D. (2019). A Novel Approach for Fast Protein Structure Comparison and Heuristic Structure Database Searching Based on Residue EigenRank Scores. In: Kozielski, S., Mrozek, D., Kasprowski, P., Małysiak-Mrozek, B., Kostrzewa, D. (eds) Beyond Databases, Architectures and Structures. Paving the Road to Smart Data Processing and Analysis. BDAS 2019. Communications in Computer and Information Science, vol 1018. Springer, Cham. https://doi.org/10.1007/978-3-030-19093-4_18

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-19093-4_18

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-19092-7

  • Online ISBN: 978-3-030-19093-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics