Abstract
Background
Drug-target interaction (DTI) is a vital drug design strategy that plays a significant role in many processes of complex diseases and cellular events. In the face of challenges such as extensive protein data and experimental costs, it is suggested to apply bioinformatics approaches to exploit potential interactions to design new targeted medications. Different data and interaction types bring difficulties to study involving incompatible and heterology formats. The analysis of drug-target interactions in a comprehensive and unified model is a significant challenge.
Method
Here, we propose a general method for predicting interactions between small-molecule drugs and protein targets, Large-scale Drug target Screening Convolutional Neural Network (LDS-CNN), which used unified encoding to achieve the calculation of the different data formats in an integrated model to realize feature abstraction and potential object prediction.
Result
On 898,412 interaction data involving 1683 small-molecule compounds and 14,350 human proteins from 8.8 billion records, the proposed method achieved an area under the curve (AUC) of 0.96, an area under the precision-recall curve (AUPRC) of 0.95, and an accuracy of 90.13%. The experimental results illustrated that the proposed method attained high accuracy on the test set, indicating its high predictive ability in drug-target interaction prediction. LDS-CNN is effective for the prediction of large-scale datasets and datasets composed of data with different formats.
Conclusion
In this study, we propose a DTI prediction method to solve the problems of unified encoding of large-scale data in multiple formats. It provides a feasible way to efficiently abstract the features among different types of drug-related data, thus reducing experimental costs and time consumption. The proposed method can be used to identify potential drug targets and candidates for the treatment of complex diseases. This work provides a reference for DTI to process large-scale data and different formats with deep learning methods and provides certain suggestions for future research.
Similar content being viewed by others
Data availability
All datasets used in this study are listed in the article. The code and data of the method are freely available at: https://github.com/ZuxianZhang/LDS-CNN.
References
Ye Y, et al. Drug-target interaction prediction based on adversarial Bayesian personalized ranking. Biomed Res Int. 2021;2021:6690154.
Yang Z, et al. FragDPI: a novel drug-protein interaction prediction model based on fragment understanding and unified coding. Front Comp Sci. 2022;17(5): 175903.
Huang K, et al. DeepPurpose: a deep learning library for drug–target interaction prediction. Bioinformatics. 2020;36(22–23):5545–7.
Cowen L, et al. Network propagation: a universal amplifier of genetic associations. Nat Rev Genet. 2017;18(9):551–62.
Cheng FX, et al. Network-based approach to prediction and population-based validation of in silico drug repurposing. Nat Commun. 2018;9:2691.
Wei BM, Zhang Y, Gong X. DeepLPI: a novel deep learning-based model for protein-ligand interaction prediction for drug repurposing. Sci Rep. 2022;12(1):18200.
Perez-Nueno VI, et al. Detecting drug promiscuity using Gaussian ensemble screening. J Chem Inf Model. 2012;52(8):1948–61.
Rao L, et al. Nonfitting protein-ligand interaction scoring function based on first-principles theoretical chemistry methods: development and application on kinase inhibitors. J Comput Chem. 2013;34(19):1636–46.
Sajadi SZ, et al. AutoDTI plus plus: deep unsupervised learning for DTI prediction by autoencoders. BMC Bioinform. 2021;22(1):1–19.
Huang L, et al. CoaDTI: multi-modal co-attention based framework for drug-target interaction annotation. Brief Bioinform. 2022;23(6):bbac446.
Chavan G, Das D. Design and characterizations of pH-responsive drug delivery vehicles using molecular docking. Mater Technol. 2023;38(1):2196490.
Wang YB, et al. A deep learning-based method for drug-target interaction prediction based on long short-term memory neural network. BMC Med Inform Decis Mak. 2020;20(1):1–9.
Varela D, Karlin V, Andre I. A memetic algorithm enables efficient local and global all-atom protein-protein docking with backbone and side-chain flexibility. Structure. 2022;30(11):1550+.
Li KQ, et al. Identification of a potential structure-based GPCR drug for interstitial cystitis/bladder pain syndrome: in silico protein structure analysis and molecular docking. Int Urogynecol J. 2023;34:1559–65.
Zeng M, et al. A deep learning framework for identifying essential proteins based on protein-protein interaction network and gene expression data. In: Proceedings 2018 IEEE international conference on bioinformatics and biomedicine. 2018. pp. 583–8.
Yu Z, et al. Predicting drug-disease associations through layer attention graph convolutional network. Brief Bioinform. 2021;22(4):bba243.
Vamathevan J, et al. Applications of machine learning in drug discovery and development. Nat Rev Drug Discovery. 2019;18(6):463–77.
Schneider G. Automating drug discovery. Nat Rev Drug Discovery. 2018;17(2):97–113.
Chen RL, et al. Machine learning for drug-target interaction prediction. Molecules. 2018;23(9):2208.
Ozturk H, Ozgur A, Ozkirimli E. DeepDTA: deep drug-target binding affinity prediction. Bioinformatics. 2018;34(17):821–9.
Bleakley K, Yamanishi Y. Supervised prediction of drug-target interactions using bipartite local models. Bioinformatics. 2009;25(18):2397–403.
Mei JP, et al. Drug-target interaction prediction by learning from local information and neighbors. Bioinformatics. 2013;29(2):238–45.
Buza K, Peska L. Drug-target interaction prediction with bipartite local models and hubness-aware regression. Neurocomputing. 2017;260:284–93.
Cheng FX, Zhao ZM. Machine learning-based prediction of drug-drug interactions by integrating drug phenotypic, therapeutic, chemical, and genomic properties. J Am Med Inform Assoc. 2014;21(E2):E278–86.
Bull SC, Doig AJ. Properties of protein drug target classes. PLoS ONE. 2015;10(3):e0117955.
Zhou LQ, et al. Revealing drug-target interactions with computational models and algorithms. Molecules. 2019;24(9):1714.
Kwon S, Yoon S. DeepCCI: end-to-end deep learning for chemical-chemical interaction prediction. In: ACM-BCB' 2017: proceedings of the 8th ACM international conference on bioinformatics, computational biology, and health informatics. 2017. pp. 203–12.
Tran HN, Xuan QNP, Nguyen TT. DeepCF-PPI: improved prediction of protein-protein interactions by combining learned and handcrafted features based on attention mechanisms. Appl Intell. 2023;53:17887–902.
Wan FP, et al. DeepCPI: a deep learning-based framework for large-scale in silico drug screening. Genomics Proteomics Bioinform. 2019;17(5):478–95.
Dorahy G, Chen JZ, Balle T. Computer-aided drug design towards new psychotropic and neurological drugs. Molecules. 2023;28(3):1324.
Zhao QC, et al. AttentionDTA: prediction of drug-target binding affinity using attention model. In: IEEE international conference on bioinformatics and biomedicine (BIBM). 2019. pp. 64–9.
Lin X, et al. DeepGS: deep representation learning of graphs and sequences for drug-target binding affinity prediction. In: ECAI 2020: 24th European conference on artificial intelligence, vol. 325. 2020. pp. 1301–8.
Wen M, et al. Deep-learning-based drug-target interaction prediction. J Proteome Res. 2017;16(4):1401–9.
Wong A, et al. Amino acid motifs for the identification of novel protein interactants. Comput Struct Biotechnol J. 2023;21:326–34.
Khiar-Fernandez N, et al. Chemistry for the identification of therapeutic targets: recent advances and future directions. Eur J Org Chem. 2021;2021(9):1307–20.
Ye JH, et al. Machine learning advances in predicting peptide/protein-protein interactions based on sequence information for lead peptides discovery. Adv Biol. 2023. https://doi.org/10.1002/adbi.202200232.
Anusuya S, et al. Drug-target interactions: prediction methods and applications. Curr Protein Pept Sci. 2018;19(6):537–61.
Huang KX, et al. MolTrans: molecular interaction transformer for drug-target interaction prediction. Bioinformatics. 2021;37(6):830–6.
Chu Y, et al. DTI-CDF: a cascade deep forest model towards the prediction of drug-target interactions based on hybrid features. Brief Bioinform. 2021;22(1):451–62.
Lee I, et al. DeepConv-DTI: prediction of drug-target interactions via deep learning with convolution on protein sequences. PLoS Comput Biol. 2019;15(6):e1007129.
Bagherian, et al. Machine learning approaches and databases for prediction of drug–target interaction: a survey paper. Brief Bioinform. 2020;22:247–69.
Otovic E, et al. Sequential properties representation scheme for recurrent neural network-based prediction of therapeutic peptides. J Chem Inf Model. 2022;62(12):2961–72.
Kim S, et al. PubChem 2023 update. Nucleic Acids Res. 2023;51(D1):D1373–80.
Gaulton A, et al. ChEMBL: a large-scale bioactivity database for chemical biology and drug discovery. Nucleic Acids Res Database Issue. 2012;40:D1100–7.
Wishart DS, et al. DrugBank 5.0: a major update to the DrugBank database for 2018. Nucleic Acids Res. 2017;46(D1):D1074–82.
Szklarczyk D, et al. STITCH 5: augmenting protein-chemical interaction networks with tissue and affinity data. Nucleic Acids Res. 2016;44(D1):D380–4.
Szklarczyk D, et al. The STRING database in 2023: protein–protein association networks and functional enrichment analyses for any sequenced genome of interest. Nucleic Acids Res. 2023;51(D1):D638–46.
Zeng HY, et al. Convolutional neural network architectures for predicting DNA-protein binding. Bioinformatics. 2016;32(12):121–7.
Campana PA, Nikoloski Z. Self- and cross-attention accurately predicts metabolite-protein interactions. NAR Genom Bioinform. 2023;5(1):lqad008.
Zhao HC, Li YH, Wang JX. A convolutional neural network and graph convolutional network-based method for predicting the classification of anatomical therapeutic chemicals. Bioinformatics. 2021;37(18):2841–7.
Sun C, et al. Autoencoder-based drug–target interaction prediction by preserving the consistency of chemical properties and functions of drugs. Bioinformatics. 2021;37(20):3618–25.
Xuan P, et al. Prediction of drug-target interactions based on network representation learning and ensemble learning. IEEE/ACM Trans Comput Biol Bioinform. 2021;18(6):2671–81.
Watanabe N, Ohnuki Y, Sakakibara Y. Deep learning integration of molecular and interactome data for protein-compound interaction prediction. J Cheminform. 2021;13(1):36.
Funding
This work was supported in part by the National Key R&D Program of China under Grant No. 2022YFB3304400, in part by the China University Industry, University and Research Innovation Fund under Grant No. 2021FNA03002, and in part by the Shanghai Pujiang Programme under Grant No. 22PJD104.
Author information
Authors and Affiliations
Corresponding authors
Ethics declarations
Conflict of interest
The authors declare that they have no competing interests or potential conflict.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Wang, Y., Zhang, Z., Piao, C. et al. LDS-CNN: a deep learning framework for drug-target interactions prediction based on large-scale drug screening. Health Inf Sci Syst 11, 42 (2023). https://doi.org/10.1007/s13755-023-00243-w
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s13755-023-00243-w