Skip to main content

Advertisement

Log in

MFC-PPI: protein–protein interaction prediction with multimodal feature fusion and contrastive learning

  • Published:
The Journal of Supercomputing Aims and scope Submit manuscript

Abstract

Protein–protein interactions (PPIs) are of critical importance in numerous biological processes and disease mechanisms, and the accurate prediction of PPIs is helpful in the comprehension of complex biological systems. In this paper, MFC-PPI, a PPI prediction model based on multimodal feature fusion and contrastive learning, is proposed. The sequential features, structural features, and PPI network features of proteins are extracted and combined for prediction. The contrastive learning is used to compare the subtle difference between the sequential features and structural features. In addition, the feature enhancement module is designed for feature fusion. The comparative experiments on SHS27k and SHS148k datasets demonstrates the excellent performance of MFC-PPI over other state-of-art methods under three partitioning strategies, Random, BFS, and DFS.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

Data availability

The implementation of MFC-PPI and the preprocessed data is available at: https://github.com/Zhangzxin24/MFC-PPI.

References

  1. Ito T, Chiba T, Ozawa R, Yoshida M, Hattori M, Sakaki Y (2001) A comprehensive two-hybrid analysis to explore the yeast protein interactome. Proc Natl Acad Sci 98(8):4569–4574

    Article  Google Scholar 

  2. Lin J-S, Lai E-M (2017) Protein–protein interactions: co-immunoprecipitation. Bacterial protein secretion systems: Methods and protocols, pp 211–219. https://doi.org/10.1007/978-1-4939-7033-9_17

  3. Ho Y, Gruhler A, Heilbut A, Bader GD, Moore L, Adams S-L, Millar A, Taylor P, Bennett K, Boutilier K et al (2002) Systematic identification of protein complexes in saccharomyces cerevisiae by mass spectrometry. Nature 415(6868):180–183

    Article  Google Scholar 

  4. Wang H, Chen B, Sun H, Zhang Y (2024) Carbon-based molecular properties efficiently predicted by deep learning-based quantum chemical simulation with large language models. Comput Biol Med 176:108531

    Article  MATH  Google Scholar 

  5. Qiu X, Wang H, Tan X, Fang Z (2024) Gk bertdta: a graph representation learning and semantic embedding-based framework for drug-target affinity prediction. Comput Biol Med 173:108376

    Article  Google Scholar 

  6. Tang T, Zhang X, Liu Y, Peng H, Zheng B, Yin Y, Zeng X (2023) Machine learning on protein–protein interaction prediction: models, challenges and trends. Brief Bioinform 24(2):076

    Article  Google Scholar 

  7. Catacutan DB, Alexander J, Arnold A, Stokes JM (2024) Machine learning in preclinical drug discovery. Nat Chem Biol 20(8):960–973

    Article  MATH  Google Scholar 

  8. Ahmed Z, Mohamed K, Zeeshan S, Dong X (2020) Artificial intelligence with multi-functional machine learning platform development for better healthcare and precision medicine. Database 2020:010

    Article  Google Scholar 

  9. MacEachern SJ, Forkert ND (2021) Machine learning for precision medicine. Genome 64(4):416–425

    Article  MATH  Google Scholar 

  10. Soleymani F, Paquet E, Viktor H, Michalowski W, Spinello D (2022) protein–protein interaction prediction with deep learning: a comprehensive review. Comput Struct Biotechnol J 20:5316–5341

    Article  MATH  Google Scholar 

  11. Hu L, Wang X, Huang Y-A, Hu P, You Z-H (2021) A survey on computational models for predicting protein–protein interactions. Brief Bioinform 22(5):036

    Article  MATH  Google Scholar 

  12. Guo Y, Yu L, Wen Z, Li M (2008) Using support vector machine combined with auto covariance to predict protein–protein interactions from protein sequences. Nucleic Acids Res 36(9):3025–3030

    Article  MATH  Google Scholar 

  13. Hashemifar S, Neyshabur B, Khan AA, Xu J (2018) Predicting protein–protein interactions through sequence-based deep learning. Bioinformatics 34(17):802–810

    Article  Google Scholar 

  14. Chen M, Ju CJ-T, Zhou G, Chen X, Zhang T, Chang K-W, Zaniolo C, Wang W (2019) Multifaceted protein–protein interaction prediction based on siamese residual rcnn. Bioinformatics 35(14):305–314

    Article  Google Scholar 

  15. Gao Z, Jiang C, Zhang J, Jiang X, Li L, Zhao P, Yang H, Huang Y, Li J (2023) Hierarchical graph learning for protein–protein interaction. Nat Commun 14(1):1093

    Article  MATH  Google Scholar 

  16. Zhang F, Chang S, Wang B, Zhang X (2024) Dssgnn-ppi: a protein–protein interactions prediction model based on double structure and sequence graph neural networks. Comput Biol Med 177:108669

    Article  MATH  Google Scholar 

  17. Jumper J, Evans R, Pritzel A, Green T, Figurnov M, Ronneberger O, Tunyasuvunakool K, Bates R, Žídek A, Potapenko A et al (2021) Highly accurate protein structure prediction with alphafold. Nature 596(7873):583–589

    Article  Google Scholar 

  18. Li S, Huang J, Zhang Z, Liu J, Huang T, Chen H (2018) Similarity-based future common neighbors model for link prediction in complex networks. Sci Rep 8(1):17014

    Article  MATH  Google Scholar 

  19. Kipf TN, Welling M (2016) Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907

  20. Lv G, Hu Z, Bi Y, Zhang S (2021) Learning unknown from correlations: Graph neural network for inter-novel-protein interaction prediction. arXiv preprint arXiv:2105.06709

  21. Kang Y, Elofsson A, Jiang Y, Huang W, Yu M, Li Z (2023) Aftgan: prediction of multi-type ppi based on attention free transformer and graph attention network. Bioinformatics 39(2):052

    Article  Google Scholar 

  22. Kang Y, Wang X, Xie C, Zhang H, Xie W (2023) Bbln: a bilateral-branch learning network for unknown protein–protein interaction prediction. Comput Biol Med 167:107588

    Article  MATH  Google Scholar 

  23. Albu A-I, Bocicor M-I, Czibula G (2023) Mm-stackens: a new deep multimodal stacked generalization approach for protein–protein interaction prediction. Comput Biol Med 153:106526

    Article  Google Scholar 

  24. Zhang D, Kabuka MR (2018) Multimodal deep representation learning for protein–protein interaction networks. In: 2018 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), pp. 595–602. IEEE

  25. Liu L, Zhu X, Ma Y, Piao H, Yang Y, Hao X, Fu Y, Wang L, Peng J (2020) Combining sequence and network information to enhance protein–protein interaction prediction. BMC Bioinform 21:1–13

    Article  Google Scholar 

  26. Chen T, Kornblith S, Norouzi M, Hinton G (2020) A simple framework for contrastive learning of visual representations. In: International Conference on Machine Learning, pp. 1597–1607. PMLR

  27. Szklarczyk D, Gable AL, Lyon D, Junge A, Wyder S, Huerta-Cepas J, Simonovic M, Doncheva NT, Morris JH, Bork P et al (2019) String v11: protein–protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets. Nucleic Acids Res 47(D1):607–613

    Article  Google Scholar 

  28. Chen J, Gan Z, Li X, Guo Q, Chen L, Gao S, Chung T, Xu Y, Zeng B, Lu W, et al. (2021) Simpler, faster, stronger: Breaking the log-k curse on contrastive learners with flatnce. arXiv preprint arXiv:2107.01152

  29. Xu K, Hu W, Leskovec J, Jegelka S (2018) How powerful are graph neural networks? arXiv preprint arXiv:1810.00826

  30. Varadi M, Bertoni D, Magana P, Paramval U, Pidruchna I, Radhakrishnan M, Tsenkov M, Nair S, Mirdita M, Yeo J et al (2024) Alphafold protein structure database in 2024: providing structure coverage for over 214 million protein sequences. Nucleic Acids Res 52(D1):368–375

    Article  Google Scholar 

  31. Khan H, Usman MT, Rida I, Koo J (2024) Attention enhanced machine instinctive vision with human-inspired saliency detection. Image Vis Comput 152:105308

    Article  Google Scholar 

  32. Usman MT, Khan H, Singh SK, Lee MY, Koo J (2024) Efficient deepfake detection via layer-frozen assisted dual attention network for consumer imaging devices. IEEE Transactions on Consumer Electronics. https://doi.org/10.1109/tce.2024.3476477

  33. Wong L, You Z-H, Li S, Huang Y-A, Liu G (2015) Detection of protein–protein interactions from amino acid sequences using a rotation forest model with a novel pr-lpq descriptor. In: Advanced Intelligent Computing Theories and Applications: 11th International Conference, ICIC 2015, Fuzhou, China, August 20-23, 2015. Proceedings, Part III 11, pp. 713–720. Springer

  34. Silberberg Y, Kupiec M, Sharan R (2014) A method for predicting protein–protein interaction types. PLoS ONE 9(3):90904

    Article  MATH  Google Scholar 

  35. Khan H, Ullah I, Shabaz M, Omer MF, Usman MT, Guellil MS, Koo J (2024) Visionary vigilance: optimized yolov8 for fallen person detection with large-scale benchmark dataset. Image Vis Comput 149:105195

    Article  Google Scholar 

  36. Linderman GC, Rachh M, Hoskins JG, Steinerberger S, Kluger Y (2019) Fast interpolation-based t-sne for improved visualization of single-cell rna-seq data. Nat Methods 16(3):243–245

    Article  MATH  Google Scholar 

Download references

Acknowledgements

This work was supported by National Natural Science Foundation of China (12371491).

Author information

Authors and Affiliations

Authors

Contributions

Z.Z.wrote the main manuscript text. J.X., Q.Z., and S.D. contributed to data analysis and figure preparation. Z.L. supervised the research and reviewed the manuscript. All authors approved the final version of the manuscript.

Corresponding author

Correspondence to Zhen Li.

Ethics declarations

Conflict of interest

No conflict of interest is declared.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhang, Z., Zhang, Q., Xiao, J. et al. MFC-PPI: protein–protein interaction prediction with multimodal feature fusion and contrastive learning. J Supercomput 81, 579 (2025). https://doi.org/10.1007/s11227-025-07076-2

Download citation

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s11227-025-07076-2

Keywords