Abstract
With the rapid growth of public protein structure databases, computational techniques for storing as well as comparing proteins in an efficient manner are still in demand. Proteins play a major role in virtually all processes in life, and comparing their three-dimensional structures is essential to understanding the functional and evolutionary relationships between them.
In this study, a novel approach to compute three-dimensional protein structure alignments by means of so-called EigenRank score profiles is proposed. These scores are obtained by utilizing the LeaderRank algorithm—a vertex centrality indexing scheme originally introduced to infer the opinion leading role of individual actors in social networks. The obtained EigenRank representation of a given structure is not just highly-specific, but can also be used to compute profile alignments from which three-dimensional structure alignments can be rapidly deduced. This technique thus could provide a tool to rapidly scan entire databases containing thousands of structures.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsNotes
- 1.
The set of adjacency matrices can actually be used to derive one single matrix containing binned residue-residue distances. The underlying structure of the protein can be reconstructed from that matrix.
References
Altschul, S.F., Gish, W., Miller, W., Myers, E.W., Lipman, D.J.: Basic local alignment search tool. J. Mol. Biol. 215, 403–410 (1990). https://doi.org/10.1016/S0022-2836(05)80360-2
Altschul, S.F., et al.: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25, 3389–3402 (1997)
Bamakan, S.M.H., Nurgaliev, I., Qu, Q.: Opinion leader detection: a methodological review. Expert Syst. Appl. 115, 200–222 (2019). https://doi.org/10.1016/j.eswa.2018.07.069. http://www.sciencedirect.com/science/article/pii/S0957417418304950
wwPDB consortium: Protein Data Bank: the single global archive for 3D macromolecular structure data. Nucleic Acids Research, October 2018. https://doi.org/10.1093/nar/gky949
Frank, K., Gruber, M., Sippl, M.J.: COPS benchmark: interactive analysis of database search methods. Bioinformatics 26, 574–575 (2010). https://doi.org/10.1093/bioinformatics/btp712. (Oxford, England)
Kabsch, W.: A solution for the best rotation to relate two sets of vectors. Acta Crystallogr. Sect. A 32(5), 922–923 (1976). https://doi.org/10.1107/S0567739476001873
Li, Q., Zhou, T., Lü, L., Chen, D.: Identifying influential spreaders by weighted LeaderRank. Phys. A Stat. Mech. Appl. 404(Supplement C), 47–55 (2014). https://doi.org/10.1016/j.physa.2014.02.041
Lü, L., Zhang, Y.C., Yeung, C.H., Zhou, T.: Leaders in social networks, the delicious case. PloS One 6(6), e21202 (2011). https://doi.org/10.1371/journal.pone.0021202
Mrozek, D.: Scalable Big Data Analytics for Protein Bioinformatics. CB, vol. 28. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-98839-9
Needleman, S.B., Wunsch, C.D.: A general method applicable to the search for similarities in the amino acid sequence of two proteins. J. Mol. Biol. 48, 443–453 (1970)
Pearson, W.R., Lipman, D.J.: Improved tools for biological sequence comparison. Proc. Nat. Acad. Sci. U.S.A. 85, 2444–2448 (1988)
Prlic, A., et al.: BioJava: an open-source framework for bioinformatics in 2012. Bioinformatics 28, 2693–2695 (2012). https://doi.org/10.1093/bioinformatics/bts494. (Oxford, England)
Sadreyev, R.I., Grishin, N.V.: Accurate statistical model of comparison between multiple sequence alignments. Nucleic Acids Res. 36, 2240–2248 (2008). https://doi.org/10.1093/nar/gkn065
Schulz, G.E., Schirmer, R.H.: Principles of Protein Structure, 5th edn. Springer, New York (1984)
Shindyalov, I.N., Bourne, P.E.: Protein structure alignment by incremental combinatorial extension (CE) of the optimal path. Protein Eng. 11, 739–747 (1998)
Smith, T., Waterman, M.: Identification of common molecular subsequences. J. Mol. Biol. 147(1), 195–197 (1981). https://doi.org/10.1016/0022-2836(81)90087-5. http://www.sciencedirect.com/science/article/pii/0022283681900875
Soeding, J.: Protein homology detection by HMM-HMM comparison. Bioinformatics 21(7), 951–960 (2005). https://doi.org/10.1093/bioinformatics/bti125
Spranger, M., Becker, S., Heinke, F., Siewerts, H., Labudde, D.: The infiltration game: artificial immune system for the exploitation of crime relevant information in social networks. In: Proceedings of Seventh International Conference on Advances in Information Management and Mining (IMMM), pp. 24–27. IARIA. ThinkMind Library (2017)
Suhrer, S.J., Wiederstein, M., Gruber, M., Sippl, M.J.: COPS - A novel workbench for explorations in fold space. Nucleic Acids Res. 37, W539–W544 (2009). https://doi.org/10.1093/nar/gkp411
Surade, S., Blundell, T.L.: Structural biology and drug discovery of difficult targets: the limits of ligandability. Chem. Biol. 19(1), 42–50 (2012). https://doi.org/10.1016/j.chembiol.2011.12.013
Ye, Y., Godzik, A.: Flexible structure alignment by chaining aligned fragment pairs allowing twists. Bioinformatics 19(Suppl 2), ii246–ii255 (2003). (Oxford, England)
Zemla, A.: LGA: a method for finding 3D similarities in protein structures. Nucleic Acids Res. 31(13), 3370–3374 (2003). https://doi.org/10.1093/nar/gkg571
Zhang, Y., Skolnick, J.: TM-align: a protein structure alignment algorithm based on the TM-score. Nucleic Acids Res, 33, 2302–2309 (2005). https://doi.org/10.1093/nar/gki524
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Heinke, F., Hempel, L., Labudde, D. (2019). A Novel Approach for Fast Protein Structure Comparison and Heuristic Structure Database Searching Based on Residue EigenRank Scores. In: Kozielski, S., Mrozek, D., Kasprowski, P., Małysiak-Mrozek, B., Kostrzewa, D. (eds) Beyond Databases, Architectures and Structures. Paving the Road to Smart Data Processing and Analysis. BDAS 2019. Communications in Computer and Information Science, vol 1018. Springer, Cham. https://doi.org/10.1007/978-3-030-19093-4_18
Download citation
DOI: https://doi.org/10.1007/978-3-030-19093-4_18
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-19092-7
Online ISBN: 978-3-030-19093-4
eBook Packages: Computer ScienceComputer Science (R0)