Skip to main content

Advertisement

Log in

LDS-CNN: a deep learning framework for drug-target interactions prediction based on large-scale drug screening

  • Research
  • Published:
Health Information Science and Systems Aims and scope Submit manuscript

Abstract

Background

Drug-target interaction (DTI) is a vital drug design strategy that plays a significant role in many processes of complex diseases and cellular events. In the face of challenges such as extensive protein data and experimental costs, it is suggested to apply bioinformatics approaches to exploit potential interactions to design new targeted medications. Different data and interaction types bring difficulties to study involving incompatible and heterology formats. The analysis of drug-target interactions in a comprehensive and unified model is a significant challenge.

Method

Here, we propose a general method for predicting interactions between small-molecule drugs and protein targets, Large-scale Drug target Screening Convolutional Neural Network (LDS-CNN), which used unified encoding to achieve the calculation of the different data formats in an integrated model to realize feature abstraction and potential object prediction.

Result

On 898,412 interaction data involving 1683 small-molecule compounds and 14,350 human proteins from 8.8 billion records, the proposed method achieved an area under the curve (AUC) of 0.96, an area under the precision-recall curve (AUPRC) of 0.95, and an accuracy of 90.13%. The experimental results illustrated that the proposed method attained high accuracy on the test set, indicating its high predictive ability in drug-target interaction prediction. LDS-CNN is effective for the prediction of large-scale datasets and datasets composed of data with different formats.

Conclusion

In this study, we propose a DTI prediction method to solve the problems of unified encoding of large-scale data in multiple formats. It provides a feasible way to efficiently abstract the features among different types of drug-related data, thus reducing experimental costs and time consumption. The proposed method can be used to identify potential drug targets and candidates for the treatment of complex diseases. This work provides a reference for DTI to process large-scale data and different formats with deep learning methods and provides certain suggestions for future research.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

Data availability

All datasets used in this study are listed in the article. The code and data of the method are freely available at: https://github.com/ZuxianZhang/LDS-CNN.

References

  1. Ye Y, et al. Drug-target interaction prediction based on adversarial Bayesian personalized ranking. Biomed Res Int. 2021;2021:6690154.

    Google Scholar 

  2. Yang Z, et al. FragDPI: a novel drug-protein interaction prediction model based on fragment understanding and unified coding. Front Comp Sci. 2022;17(5): 175903.

    MathSciNet  Google Scholar 

  3. Huang K, et al. DeepPurpose: a deep learning library for drug–target interaction prediction. Bioinformatics. 2020;36(22–23):5545–7.

    Google Scholar 

  4. Cowen L, et al. Network propagation: a universal amplifier of genetic associations. Nat Rev Genet. 2017;18(9):551–62.

    Google Scholar 

  5. Cheng FX, et al. Network-based approach to prediction and population-based validation of in silico drug repurposing. Nat Commun. 2018;9:2691.

    Google Scholar 

  6. Wei BM, Zhang Y, Gong X. DeepLPI: a novel deep learning-based model for protein-ligand interaction prediction for drug repurposing. Sci Rep. 2022;12(1):18200.

    Google Scholar 

  7. Perez-Nueno VI, et al. Detecting drug promiscuity using Gaussian ensemble screening. J Chem Inf Model. 2012;52(8):1948–61.

    Google Scholar 

  8. Rao L, et al. Nonfitting protein-ligand interaction scoring function based on first-principles theoretical chemistry methods: development and application on kinase inhibitors. J Comput Chem. 2013;34(19):1636–46.

    Google Scholar 

  9. Sajadi SZ, et al. AutoDTI plus plus: deep unsupervised learning for DTI prediction by autoencoders. BMC Bioinform. 2021;22(1):1–19.

    Google Scholar 

  10. Huang L, et al. CoaDTI: multi-modal co-attention based framework for drug-target interaction annotation. Brief Bioinform. 2022;23(6):bbac446.

    Google Scholar 

  11. Chavan G, Das D. Design and characterizations of pH-responsive drug delivery vehicles using molecular docking. Mater Technol. 2023;38(1):2196490.

    Google Scholar 

  12. Wang YB, et al. A deep learning-based method for drug-target interaction prediction based on long short-term memory neural network. BMC Med Inform Decis Mak. 2020;20(1):1–9.

    Google Scholar 

  13. Varela D, Karlin V, Andre I. A memetic algorithm enables efficient local and global all-atom protein-protein docking with backbone and side-chain flexibility. Structure. 2022;30(11):1550+.

    Google Scholar 

  14. Li KQ, et al. Identification of a potential structure-based GPCR drug for interstitial cystitis/bladder pain syndrome: in silico protein structure analysis and molecular docking. Int Urogynecol J. 2023;34:1559–65.

    Google Scholar 

  15. Zeng M, et al. A deep learning framework for identifying essential proteins based on protein-protein interaction network and gene expression data. In: Proceedings 2018 IEEE international conference on bioinformatics and biomedicine. 2018. pp. 583–8.

  16. Yu Z, et al. Predicting drug-disease associations through layer attention graph convolutional network. Brief Bioinform. 2021;22(4):bba243.

    Google Scholar 

  17. Vamathevan J, et al. Applications of machine learning in drug discovery and development. Nat Rev Drug Discovery. 2019;18(6):463–77.

    Google Scholar 

  18. Schneider G. Automating drug discovery. Nat Rev Drug Discovery. 2018;17(2):97–113.

    Google Scholar 

  19. Chen RL, et al. Machine learning for drug-target interaction prediction. Molecules. 2018;23(9):2208.

    Google Scholar 

  20. Ozturk H, Ozgur A, Ozkirimli E. DeepDTA: deep drug-target binding affinity prediction. Bioinformatics. 2018;34(17):821–9.

    Google Scholar 

  21. Bleakley K, Yamanishi Y. Supervised prediction of drug-target interactions using bipartite local models. Bioinformatics. 2009;25(18):2397–403.

    Google Scholar 

  22. Mei JP, et al. Drug-target interaction prediction by learning from local information and neighbors. Bioinformatics. 2013;29(2):238–45.

    Google Scholar 

  23. Buza K, Peska L. Drug-target interaction prediction with bipartite local models and hubness-aware regression. Neurocomputing. 2017;260:284–93.

    Google Scholar 

  24. Cheng FX, Zhao ZM. Machine learning-based prediction of drug-drug interactions by integrating drug phenotypic, therapeutic, chemical, and genomic properties. J Am Med Inform Assoc. 2014;21(E2):E278–86.

    Google Scholar 

  25. Bull SC, Doig AJ. Properties of protein drug target classes. PLoS ONE. 2015;10(3):e0117955.

    Google Scholar 

  26. Zhou LQ, et al. Revealing drug-target interactions with computational models and algorithms. Molecules. 2019;24(9):1714.

    Google Scholar 

  27. Kwon S, Yoon S. DeepCCI: end-to-end deep learning for chemical-chemical interaction prediction. In: ACM-BCB' 2017: proceedings of the 8th ACM international conference on bioinformatics, computational biology, and health informatics. 2017. pp. 203–12.

  28. Tran HN, Xuan QNP, Nguyen TT. DeepCF-PPI: improved prediction of protein-protein interactions by combining learned and handcrafted features based on attention mechanisms. Appl Intell. 2023;53:17887–902.

    Google Scholar 

  29. Wan FP, et al. DeepCPI: a deep learning-based framework for large-scale in silico drug screening. Genomics Proteomics Bioinform. 2019;17(5):478–95.

    Google Scholar 

  30. Dorahy G, Chen JZ, Balle T. Computer-aided drug design towards new psychotropic and neurological drugs. Molecules. 2023;28(3):1324.

    Google Scholar 

  31. Zhao QC, et al. AttentionDTA: prediction of drug-target binding affinity using attention model. In: IEEE international conference on bioinformatics and biomedicine (BIBM). 2019. pp. 64–9.

    Google Scholar 

  32. Lin X, et al. DeepGS: deep representation learning of graphs and sequences for drug-target binding affinity prediction. In: ECAI 2020: 24th European conference on artificial intelligence, vol. 325. 2020. pp. 1301–8.

  33. Wen M, et al. Deep-learning-based drug-target interaction prediction. J Proteome Res. 2017;16(4):1401–9.

    Google Scholar 

  34. Wong A, et al. Amino acid motifs for the identification of novel protein interactants. Comput Struct Biotechnol J. 2023;21:326–34.

    Google Scholar 

  35. Khiar-Fernandez N, et al. Chemistry for the identification of therapeutic targets: recent advances and future directions. Eur J Org Chem. 2021;2021(9):1307–20.

    Google Scholar 

  36. Ye JH, et al. Machine learning advances in predicting peptide/protein-protein interactions based on sequence information for lead peptides discovery. Adv Biol. 2023. https://doi.org/10.1002/adbi.202200232.

    Article  Google Scholar 

  37. Anusuya S, et al. Drug-target interactions: prediction methods and applications. Curr Protein Pept Sci. 2018;19(6):537–61.

    Google Scholar 

  38. Huang KX, et al. MolTrans: molecular interaction transformer for drug-target interaction prediction. Bioinformatics. 2021;37(6):830–6.

    Google Scholar 

  39. Chu Y, et al. DTI-CDF: a cascade deep forest model towards the prediction of drug-target interactions based on hybrid features. Brief Bioinform. 2021;22(1):451–62.

    MathSciNet  Google Scholar 

  40. Lee I, et al. DeepConv-DTI: prediction of drug-target interactions via deep learning with convolution on protein sequences. PLoS Comput Biol. 2019;15(6):e1007129.

    Google Scholar 

  41. Bagherian, et al. Machine learning approaches and databases for prediction of drug–target interaction: a survey paper. Brief Bioinform. 2020;22:247–69.

    Google Scholar 

  42. Otovic E, et al. Sequential properties representation scheme for recurrent neural network-based prediction of therapeutic peptides. J Chem Inf Model. 2022;62(12):2961–72.

    Google Scholar 

  43. Kim S, et al. PubChem 2023 update. Nucleic Acids Res. 2023;51(D1):D1373–80.

    Google Scholar 

  44. Gaulton A, et al. ChEMBL: a large-scale bioactivity database for chemical biology and drug discovery. Nucleic Acids Res Database Issue. 2012;40:D1100–7.

    Google Scholar 

  45. Wishart DS, et al. DrugBank 5.0: a major update to the DrugBank database for 2018. Nucleic Acids Res. 2017;46(D1):D1074–82.

    Google Scholar 

  46. Szklarczyk D, et al. STITCH 5: augmenting protein-chemical interaction networks with tissue and affinity data. Nucleic Acids Res. 2016;44(D1):D380–4.

    Google Scholar 

  47. Szklarczyk D, et al. The STRING database in 2023: protein–protein association networks and functional enrichment analyses for any sequenced genome of interest. Nucleic Acids Res. 2023;51(D1):D638–46.

    Google Scholar 

  48. Zeng HY, et al. Convolutional neural network architectures for predicting DNA-protein binding. Bioinformatics. 2016;32(12):121–7.

    Google Scholar 

  49. Campana PA, Nikoloski Z. Self- and cross-attention accurately predicts metabolite-protein interactions. NAR Genom Bioinform. 2023;5(1):lqad008.

    Google Scholar 

  50. Zhao HC, Li YH, Wang JX. A convolutional neural network and graph convolutional network-based method for predicting the classification of anatomical therapeutic chemicals. Bioinformatics. 2021;37(18):2841–7.

    Google Scholar 

  51. Sun C, et al. Autoencoder-based drug–target interaction prediction by preserving the consistency of chemical properties and functions of drugs. Bioinformatics. 2021;37(20):3618–25.

    Google Scholar 

  52. Xuan P, et al. Prediction of drug-target interactions based on network representation learning and ensemble learning. IEEE/ACM Trans Comput Biol Bioinform. 2021;18(6):2671–81.

    Google Scholar 

  53. Watanabe N, Ohnuki Y, Sakakibara Y. Deep learning integration of molecular and interactome data for protein-compound interaction prediction. J Cheminform. 2021;13(1):36.

    Google Scholar 

Download references

Funding

This work was supported in part by the National Key R&D Program of China under Grant No. 2022YFB3304400, in part by the China University Industry, University and Research Innovation Fund under Grant No. 2021FNA03002, and in part by the Shanghai Pujiang Programme under Grant No. 22PJD104.

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Yu-Jing Lu or Dongning Liu.

Ethics declarations

Conflict of interest

The authors declare that they have no competing interests or potential conflict.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wang, Y., Zhang, Z., Piao, C. et al. LDS-CNN: a deep learning framework for drug-target interactions prediction based on large-scale drug screening. Health Inf Sci Syst 11, 42 (2023). https://doi.org/10.1007/s13755-023-00243-w

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s13755-023-00243-w

Keywords

Navigation