Abstract
Predicting the functions of the proteins from their structure is an active area of interest. The current trends of the secondary structure representation use direct letter representation of the specific secondary structure element of every amino acid in the linear sequence. Using graph representation to represent the protein sequence provides additional information about the structural relationships within the amino acid sequence. This study outlines the protein secondary structure with a novel approach of representing the proteins using protein secondary structure graph where nodes are amino acids from the protein sequence, and the edges denote the peptide and hydrogen bonds that construct the secondary structure. The developed model for protein function prediction Structure2Function operates on these graphs with a defined variant of the present idea from deep learning on non-Euclidian graph-structure data, the Graph Convolutional Networks (GCNs).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Al-Lazikani, B., Jung, J., Xiang, Z., Honig, B.: Protein structure prediction. Curr. Opin. Chem. Biol. 5(1), 51–56 (2001)
Altschul, S.F., et al.: Gapped blast and psi-blast: a new generation of protein database search programs. Nucleic Acids Res. 25(17), 3389–3402 (1997)
Apweiler, R., et al.: Uniprot: the universal protein knowledgebase. Nucleic Acids Res. 32, D115–D119 (2004)
Ashburner, M., et al.: Gene ontology: tool for the unification of biology. Nat. Genet. 25(1), 25 (2000)
Berman, H.M., et al.: The protein data bank. Nucleic Acids Res. 28(1), 235–242 (2000)
Bronstein, M.M., Bruna, J., LeCun, Y., Szlam, A., Vandergheynst, P.: Geometric deep learning: going beyond euclidean data. IEEE Signal Process. Mag. 34(4), 18–42 (2017)
Chen, C., Huang, H., Wu, C.H.: Protein bioinformatics databases and resources. In: Wu, C.H., Arighi, C.N., Ross, K.E. (eds.) Protein Bioinformatics. MMB, vol. 1558, pp. 3–39. Springer, New York (2017). https://doi.org/10.1007/978-1-4939-6783-4_1
Clark, W.T., Radivojac, P.: Analysis of protein function and its prediction from amino acid sequence. Proteins Struct. Funct. Bioinf. 79(7), 2086–2096 (2011)
Gene Ontology Consortium: The gene ontology (GO) database and informatics resource. Nucleic Acids Res. 32(Suppl. 1), D258–D261 (2004)
Crooks, G.E., Brenner, S.E.: Protein secondary structure: entropy, correlations and prediction. Bioinformatics 20(10), 1603–1611 (2004)
Dai, H., Dai, B., Song, L.: Discriminative embeddings of latent variable models for structured data. In: International Conference on Machine Learning, pp. 2702–2711 (2016)
Duvenaud, D.K., et al.: Convolutional networks on graphs for learning molecular fingerprints. In: Advances in Neural Information Processing Systems, pp. 2224–2232 (2015)
Gilmer, J., Schoenholz, S.S., Riley, P.F., Vinyals, O., Dahl, G.E.: Neural message passing for quantum chemistry. In: Proceedings of the 34th International Conference on Machine Learning, vol. 70, pp. 1263–1272. JMLR. org (2017)
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
Jiang, Q., Jin, X., Lee, S.J., Yao, S.: Protein secondary structure prediction: a survey of the state of the art. J. Mol. Graph. Model. 76, 379–402 (2017)
Jiang, Y., et al.: An expanded evaluation of protein function prediction methods shows an improvement in accuracy. Genome Biol. 17(1), 184 (2016)
Kabsch, W., Sander, C.: Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features. Biopolymers 22(12), 2577–2637 (1983)
Kihara, D.: Protein Function Prediction: Methods and Protocols. Humana Press, Totowa (2017)
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. CoRR abs/1412.6980 (2014)
Pearson, W.R.: Protein function prediction: problems and pitfalls. Curr. Protoc. Bioinform. 51(1), 4–12 (2015)
Radivojac, P., et al.: A large-scale evaluation of computational protein function prediction. Nat. Methods 10(3), 221 (2013)
Reddi, S.J., Kale, S., Kumar, S.: On the convergence of adam and beyond. In: Proceedings of the International Conference on Learning Representations (2018)
Rost, B.: Protein secondary structure prediction continues to rise. J. Struct. Biol. 134(2–3), 204–218 (2001)
Yang, Y., et al.: Sixty-five years of the long march in protein secondary structure prediction: the final stretch? Briefings Bioinform. 19(3), 482–494 (2016)
Zhang, M., Cui, Z., Neumann, M., Chen, Y.: An end-to-end deep learning architecture for graph classification. In: Thirty-Second AAAI Conference on Artificial Intelligence (2018)
Acknowledgements
This work was partially financed by the Faculty of Computer Science and Engineering at the Ss. Cyril and Methodius University, Skopje, North Macedonia. The computational resources used for this research were kindly provided by MAGIX.AI and the NVIDIA Corporation (a donation of a Titan V GPU to Eftim Zdravevski).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Stojanovska, F., Ackovska, N. (2019). Protein Secondary Structure Graphs as Predictors for Protein Function. In: Gievska, S., Madjarov, G. (eds) ICT Innovations 2019. Big Data Processing and Mining. ICT Innovations 2019. Communications in Computer and Information Science, vol 1110. Springer, Cham. https://doi.org/10.1007/978-3-030-33110-8_16
Download citation
DOI: https://doi.org/10.1007/978-3-030-33110-8_16
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-33109-2
Online ISBN: 978-3-030-33110-8
eBook Packages: Computer ScienceComputer Science (R0)