Skip to main content

GCNSP: A Novel Prediction Method of Self-Interacting Proteins Based on Graph Convolutional Networks

  • Conference paper
  • First Online:
Intelligent Computing Theories and Application (ICIC 2020)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 12464))

Included in the following conference series:

Abstract

As an essential protein interaction, self-interacting proteins (SIPs) plays a vital role in biological processes. Identifying and confirming SIPs is of great significance for the exploration of new gene functions, protein function research and proteomics research. Although a large number of SIPs have been confirmed with the rapid development of high-throughput technology, the biological experimental method is still limited by blindness and high cost, and has a high false-positive rate. Therefore, the use of computational techniques to accurately and efficiently predict SIPs has become an urgent need. In this study, a novel SIPs prediction method GCNSP based on Graph Convolutional Networks (GCN) is proposed. Firstly, the evolution information of protein is described by Position-Specific Scoring Matrix (PSSM). Then the feature information is extracted by GCN, and finally fed into Random Forest (RF) classifier for accurate classification. In the five-fold cross-validation on Human and Yeast data sets, GCNSP achieved 93.65% and 90.69% prediction accuracy with 99.64% and 99.08% specificity, respectively. In comparison with different classifier models and other existing methods, GCNSP shows strong competitiveness. The excellent results show that the proposed method is very suitable for SIPs prediction and can provide highly reliable candidates for biological experiments.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Giot, L., et al.: A protein interaction map of Drosophila melanogaster. Science 302, 1727–1736 (2003)

    Article  Google Scholar 

  2. Li, S., et al.: A map of the interactome network of the metazoan C. elegans. Science 303, 540–543 (2004)

    Article  Google Scholar 

  3. Gavin, A.C., et al.: Functional organization of the yeast proteome by systematic analysis of protein complexes. Nature 415, 141–147 (2002)

    Article  Google Scholar 

  4. Pereira-Leal, J.B., et al.: Evolution of protein complexes by duplication of homometric interactions. Genome Biol. 8, 51 (2007). https://doi.org/10.1186/gb-2007-8-4-r51

    Article  Google Scholar 

  5. Ispolatov, I., et al.: Binding properties and evolution of homodimers in protein-protein interaction networks. Nucleic Acids Res. 33, 3629–3635 (2005)

    Article  Google Scholar 

  6. Wang, Y.-B., et al.: Predicting protein–protein interactions from protein sequences by a stacked sparse autoencoder deep neural network. Mol. BioSyst. 13, 1336–1344 (2017)

    Article  Google Scholar 

  7. Wang, L., et al.: Advancing the prediction accuracy of protein-protein interactions by utilizing evolutionary information from position-specific scoring matrix and ensemble classifier. J. Theor. Biol. 418, 105–110 (2017)

    Article  MathSciNet  Google Scholar 

  8. Wang, L., et al.: An ensemble approach for large-scale identification of protein-protein interactions using the alignments of multiple sequences. Oncotarget 8, 5149 (2017)

    Article  Google Scholar 

  9. Zhu, L., You, Z.H., Huang, D.S.: Increasing the reliability of protein–protein interaction networks via non-convex semantic embedding. Neurocomputing 121, 99–107 (2013)

    Article  Google Scholar 

  10. You, Z.H., et al.: Using manifold embedding for assessing and predicting protein interactions from high-throughput experimental data. Bioinformatics 26, 2744–2751 (2010)

    Article  Google Scholar 

  11. Wang, L., et al.: Predicting Protein-Protein Interactions from Matrix-Based Protein Sequence Using Convolution Neural Network and Feature-Selective Rotation Forest. Sci. Rep. 9, 9848 (2019)

    Article  Google Scholar 

  12. You, Z.-H., et al.: Prediction of protein-protein interactions from amino acid sequences with ensemble extreme learning machines and principal component analysis. BMC Bioinf. 14, S10 (2013). https://doi.org/10.1186/1471-2105-14-S8-S10

    Article  Google Scholar 

  13. Jia, J., et al.: iPPI-PseAAC (CGR): Identify protein-protein interactions by incorporating chaos game representation into PseAAC. J. Theor. Biol. 460, 195–203 (2019)

    Article  Google Scholar 

  14. Wang, Y., et al.: A high efficient biological language model for predicting protein–protein interactions. Cells 8, 122 (2019)

    Article  Google Scholar 

  15. Bateman, A., et al.: UniProt: a hub for protein information. Nucleic Acids Res. 43, D204–D212 (2015)

    Article  Google Scholar 

  16. Breuer, K., et al.: InnateDB: systems biology of innate immunity and beyond-recent updates and continuing curation. Nucleic Acids Res. 41, D1228–D1233 (2013)

    Article  Google Scholar 

  17. Chatr-aryamontri, A., et al.: The BioGRID interaction database: 2015 update. Nucleic Acids Res. 43, D470–D478 (2015)

    Article  Google Scholar 

  18. Salwinski, L., Miller, C.S., Smith, A.J., et al.: The database of interacting proteins: 2004 update. Nucleic Acids Res. 32, D449–D451 (2004)

    Article  Google Scholar 

  19. Launay, G., et al.: MatrixDB, the extracellular matrix interaction database: updated content, a new navigator and expanded functionalities. Nucleic Acids Res. 43, D321–D327 (2015)

    Article  Google Scholar 

  20. Gribskov, M., McLachlan, A.D., Eisenberg, D.: Profile analysis: detection of distantly related proteins. Proc. Natl. Acad. Sci. U.S.A. 84, 4355–4358 (1987)

    Article  Google Scholar 

  21. Zheng, K., et al.: Dbmda: A unified embedding for sequence-based mirna similarity measure with applications to predict and validate mirna-disease associations. Molecular Therapy-Nucleic Acids 19, 602–611 (2020)

    Article  Google Scholar 

  22. Deng, S.-P., Zhu, L., Huang, D.-S.: Predicting hub genes associated with cervical cancer through gene co-expression networks. IEEE/ACM Trans. Comput. Biol. Bioinf. 13, 27–35 (2015)

    Article  Google Scholar 

  23. Wang, L., et al.: Computational methods for the prediction of drug-target interactions from drug fingerprints and protein sequences by stacked auto-encoder deep neural network. In: Cai, Z., Daescu, O., Li, M. (eds.) ISBRA 2017. LNCS, vol. 10330, pp. 46–58. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-59575-7_5

    Chapter  Google Scholar 

  24. Wang, L., et al.: Prediction of RNA-protein interactions by combining deep convolutional neural network with feature selection ensemble method. J. Theor. Biol. 461, 230–238 (2019)

    Article  Google Scholar 

  25. Chen, Z.-H., et al.: Identification of self-interacting proteins by integrating random projection classifier and finite impulse response filter. Bmc Genomics 20, 1 (2019)

    Article  Google Scholar 

  26. Jie, C., Ma, T., Cao, X.: FastGCN: Fast learning with graph convolutional networks via importance sampling. In: International Conference on Learning Representations (2018)

    Google Scholar 

  27. Wang, L., et al.: An efficient approach based on multi-sources information to predict circrna-disease associations using deep convolutional neural network. Bioinformatics 36, 4038–4046 (2019)

    Article  Google Scholar 

  28. You, Z-H., et al.: Large-scale protein-protein interactions detection by integrating big biosensing data with computational model. Biomed. Res. Int (2014

    Google Scholar 

  29. Zheng, K., Wang, L., You, Z.-H.: CGMDA: an approach to predict and validate microRNA-disease associations by utilizing chaos game representation and LightGBM. IEEE Access 7, 133314–133323 (2019)

    Article  Google Scholar 

  30. Wang, L., et al.: GCNCDA: A new method for predicting circRNA-disease associations based on Graph Convolutional Network Algorithm. PLoS Comput. Biol. 16, e1007568 (2020)

    Article  Google Scholar 

  31. Zheng, K., et al.: iCDA-CGR: identification of circrna-disease associations based on chaos game representation. PLoS Comput. Biol. 16, e1007872 (2020)

    Article  Google Scholar 

  32. Chen, Z.-H., et al.: An improved deep forest model for predicting self-interacting proteins from protein sequence using wavelet transformation. Front. Genet. 10, 90 (2019)

    Article  Google Scholar 

  33. Chen, Z.-H., et al.: Prediction of self-interacting proteins from protein sequence information based on random projection model and fast fourier transform. Int. J. Mol. Sci. 20, 930 (2019)

    Article  Google Scholar 

  34. Wang, L., et al.: Incorporating chemical sub-structures and protein evolutionary information for inferring drug-target interactions. Sci. Rep. 10, 1–11 (2020)

    Article  Google Scholar 

  35. Wang, M-N., et al.: LDGRNMF: LncRNA-Disease Associations Prediction based on Graph Regularized Non-Negative Matrix Factorization. Neurocomputing (2020)

    Google Scholar 

  36. Xia, J.-F., Han, K., Huang, D.-S.: Sequence-based prediction of protein-protein interactions by means of rotation forest and autocorrelation descriptor. Protein Pept. Lett. 17, 137–145 (2010)

    Article  Google Scholar 

  37. You, Z.-H., et al.: A MapReduce based parallel SVM for large-scale predicting protein–protein interactions. Neurocomputing 145, 37–43 (2014)

    Article  Google Scholar 

  38. Wang, L., et al.: Combining high speed elm learning with a deep convolutional neural network feature encoding for predicting protein-RNA interactions. IEEE/ACM Trans. Comput. Biol. Bioinf. 1, 1 (2018)

    Article  Google Scholar 

  39. Zheng, K., et al.: MLMDA: a machine learning approach to predict and validate MicroRNA–disease associations by integrating of heterogenous information sources. J. Translational Med. 17, 260 (2019)

    Article  Google Scholar 

  40. Li, J.Q., et al.: PSPEL: In silico prediction of self-interacting proteins from amino acids sequences using ensemble learning. IEEE/ACM Trans. Comput. Biol. Bioinf. pp. 1 (2016)

    Google Scholar 

  41. Liu, X., Yang, S., Li, C., Zhang, Z., Song, J.: SPAR: a random forest-based predictor for self-interacting proteins with fine-grained domain information. Amino Acids 48(7), 1655–1665 (2016). https://doi.org/10.1007/s00726-016-2226-z

    Article  Google Scholar 

  42. Liu, Z., et al.: Proteome-wide prediction of self-interacting proteins based on multiple properties. Mol. Cell. Proteomics 12, 1689–1700 (2013)

    Article  Google Scholar 

  43. Zahiri, J., et al.: PPIevo: Protein-protein interaction prediction from PSSM based evolutionary information. Genomics 102, 237–242 (2013)

    Article  Google Scholar 

  44. Zahiri, J., et al.: LocFuse: human protein–protein interaction prediction via classifier fusion using protein localization information. Genomics 104, 496–503 (2014)

    Article  Google Scholar 

Download references

Acknowledgements

This work is supported is supported in part by the National Natural Science Foundation of China, under Grants 61702444, in part by the West Light Foundation of The Chinese Academy of Sciences, under Grant 2018-XBQNXZ-B-008, in part by the Chinese Postdoctoral Science Foundation, under Grant 2019M653804, in part by the Tianshan youth - Excellent Youth, under Grant 2019Q029, in part by the Qingtan scholar talent project of Zaozhuang University. The authors would like to thank all anonymous reviewers for their constructive advices.

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Zhu-Hong You or Xin Yan .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Wang, L., You, ZH., Yan, X., Zheng, K., Li, ZW. (2020). GCNSP: A Novel Prediction Method of Self-Interacting Proteins Based on Graph Convolutional Networks. In: Huang, DS., Jo, KH. (eds) Intelligent Computing Theories and Application. ICIC 2020. Lecture Notes in Computer Science(), vol 12464. Springer, Cham. https://doi.org/10.1007/978-3-030-60802-6_11

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-60802-6_11

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-60801-9

  • Online ISBN: 978-3-030-60802-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics