Abstract
Identification and prediction of Drug-Target Interactions (DTIs) is the basis for screening drug candidates, which plays a vital role in the development of innovative drugs. However, due to the time-consuming and high cost constraints of biological experimental methods, traditional drug target identification technologies are often difficult to develop on a large scale. Therefore, in silico methods are urgently needed to predict drug-target interactions in a genome-wide manner. In this article, we design a novel in silico approach, named DTIFS to predict the DTIs by combining Feature weighted Rotation Forest (FwRF) classifier with protein amino acids information. This model has two outstanding advantages: a) using the fusion data of protein sequence and drug molecular fingerprint, which can fully carry information; b) using the classifier with feature selection ability, which can effectively remove noise information and improve prediction performance. More specifically, we first use Position-Specific Score Matrix (PSSM) to numerically convert protein sequences and utilize Pseudo Position-Specific Score Matrix (PsePSSM) to extract their features. Then a unified digital descriptor is formed by combining molecular fingerprints representing drug information. Finally, the FwRF is applied to implement on Enzyme, Ion Channel, GPCR, and Nuclear Receptor datasets. The results of the 5-fold CV experiment show that the prediction accuracy of this approach reaches 91.68%, 88.11%, 84.72% and 78.33% on four benchmark datasets, respectively. To further validate the performance of the DTIFS, we compare it with other excellent methods and Support Vector Machine (SVM) model. The experimental results of cross-validation indicated that DTIFS is feasible in predicting the relationship among drugs and target, and can provide help for the discovery of new candidate drugs.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Xia, Z., Wu, L.-Y., Zhou, X., et al.: Semi-supervised drug-protein interaction prediction from heterogeneous biological spaces. BMC Syst. Biol. 4 (2010)
Wang, L., You, Z.-H., Chen, X., et al.: A computational-based method for predicting drug–target interactions by using stacked autoencoder deep neural network. J. Comput. Biol. 25, 361–373 (2018)
Landry, Y., Gies, J.-P.: Drugs and their molecular targets: an updated overview. Fundam. Clin. Pharmacol. 22, 1–18 (2008)
Wang, L., et al.: Computational methods for the prediction of drug-target interactions from drug fingerprints and protein sequences by stacked auto-encoder deep neural network. In: Cai, Z., Daescu, O., Li, M. (eds.) ISBRA 2017. LNCS, vol. 10330, pp. 46–58. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-59575-7_5
Wang, L., Yan, X., Liu, M.-L., et al.: Prediction of RNA-protein interactions by combining deep convolutional neural network with feature selection ensemble method. J. Theor. Biol. 461, 230–238 (2019)
Wang, L., You, Z.H., Chen, X., et al.: An ensemble approach for large-scale identification of protein-protein interactions using the alignments of multiple sequences. Oncotarget 8, 5149 (2017)
Chen, Z.-H., Yi, H.-C., Guo, Z.-H., et al.: Prediction of drug-target interactions from multi-molecular network based on deep walk embedding model. Front. Bioeng. Biotechnol. 8, 338 (2020)
Gao, Z.G., Wang, L., Xia, S.X., et al.: Ens-PPI: a novel ensemble classifier for predicting the interactions of proteins using autocovariance transformation from PSSM. Biomed. Res. Int. 8 (2016)
Wang, L., You, Z.-H., Li, L.-P., et al.: incorporating chemical sub-structures and protein evolutionary information for inferring drug-target interactions. Sci. Rep. 10, 1–11 (2020)
Wu, Z., Cheng, F., Li, J., et al.: SDTNBI: an integrated network and chemoinformatics tool for systematic prediction of drug–target interactions and drug repositioning. Brief. Bioinform. 18, 333–347 (2017)
Zong, N., Kim, H., Ngo, V., et al.: Deep mining heterogeneous networks of biomedical linked data to predict novel drug-target associations. Bioinformatics 33, 2337–2344 (2017)
Peng, L., Liao, B., Zhu, W., et al.: Predicting drug-target interactions with multi-information fusion. IEEE J. Biomed. Health Inf. 21, 561–572 (2017)
Ezzat, A., Wu, M., Li, X.L., et al.: Drug-target interaction prediction using ensemble learning and dimensionality reduction. Methods 129, 81 (2017)
Wen, M., Zhang, Z., Niu, S., et al.: Deep-learning-based drug-target interaction prediction. J. Proteome Res. 16, 1401 (2017)
Yamanishi, Y., Araki, M., Gutteridge, A., et al.: Prediction of drug-target interaction networks from the integration of chemical and genomic spaces. Bioinformatics 24, I232–I240 (2008)
Schomburg, I., Chang, A., Ebeling, C., et al.: BRENDA, the enzyme database: updates and major new developments. Nucleic Acids Res. 32, D431–D433 (2004)
Wishart, D.S., Knox, C., Guo, A.C., et al.: DrugBank: a knowledgebase for drugs, drug actions and drug targets. Nucleic Acids Res. 36, D901–D906 (2008)
Gunther, S., Kuhn, M., Dunkel, M., et al.: SuperTarget and matador: resources for exploring drug-target relationships. Nucleic Acids Res. 36, D919–D922 (2008)
Kanehisa, M., Goto, S., Hattori, M., et al.: From genomics to chemical genomics: new developments in KEGG. Nucleic Acids Res. 34, D354–D357 (2006)
Wang, L., You, Z.H., Chen, X., et al.: RFDT: a rotation forest-based predictor for predicting drug-target interactions using drug structure and protein sequence information. Curr. Protein Pept. Sci. 19, 445–454 (2018)
Jiang, H.-J., Huang, Y.-A., You, Z.-H.: SAEROF: an ensemble approach for large-scale drug-disease association prediction by incorporating rotation forest and sparse autoencoder deep neural network. Sci. Rep. 10, 4972 (2020)
Wang, L., Wang, H.-F., Liu, S.-R., et al.: Predicting protein-protein interactions from matrix-based protein sequence using convolution neural network and feature-selective rotation forest. Sci. Rep. 9, 9848 (2019)
Gribskov, M., McLachlan, A.D., Eisenberg, D.: Profile analysis: detection of distantly related proteins. Proc. Natl. Acad. Sci. U.S.A. 84, 4355–4358 (1987)
Wang, L., You, Z.-H., Yan, X., et al.: Using two-dimensional principal component analysis and rotation forest for prediction of protein-protein interactions. Sci. Rep. 8, 12874 (2018)
Zheng, K., You, Z.-H., Wang, L., et al.: MLMDA: a machine learning approach to predict and validate MicroRNA–disease associations by integrating of heterogenous information sources. J. Transl. Med. 17, 260 (2019)
Jones, D.T.: Protein secondary structure prediction based on position-specific scoring matrices. J. Mol. Biol. 292, 195–202 (1999)
Jones, D.T., Ward, J.J.: Prediction of disordered regions in proteins from position specific score matrices. Proteins-Struct. Funct. Bioinf. 53, 573–578 (2003)
Altschul, S.F., Madden, T.L., Schaffer, A.A., et al.: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25, 3389–3402 (1997)
Wang, L., You, Z.-H., Xia, S.-X., et al.: Advancing the prediction accuracy of protein-protein interactions by utilizing evolutionary information from position-specific scoring matrix and ensemble classifier. J. Theor. Biol. 418, 105–110 (2017)
Wang, L., You, Z.-H., Huang, D.-S., et al.: Combining high speed ELM learning with a deep convolutional neural network feature encoding for predicting protein-RNA interactions. IEEE/ACM Trans. Comput. Biol. Bioinf. 1, 1 (2018)
Wang, L., You, Z.-H., Chen, X., et al.: LMTRDA: using logistic model tree to predict MiRNA-disease associations by fusing multi-source information of sequences and similarities. PLoS Comput. Biol. 15, e1006865 (2019)
Chou, K.C.: Prediction of protein cellular attributes using pseudo-amino acid composition. Proteins-Struct. Funct. Genet. 43, 246–255 (2001)
Rodriguez, J.J., Kuncheva, L.I.: Rotation forest: A new classifier ensemble method. IEEE Trans. Pattern Anal. Mach. Intell. 28, 1619–1630 (2006)
Wang, L., et al.: An improved efficient rotation forest algorithm to predict the interactions among proteins. Soft. Comput. 22(10), 3373–3381 (2017). https://doi.org/10.1007/s00500-017-2582-y
Zheng, K., You, Z.-H., Wang, L., et al.: Dbmda: A unified embedding for sequence-based mirna similarity measure with applications to predict and validate mirna-disease associations. Mol. Ther.-Nucleic Acids 19, 602–611 (2020)
Zweig, M.H., Campbell, G.: Receiver-operating characteristic (ROC) plots: a fundamental evaluation tool in clinical medicine. Clin. Chem. 39, 561–577 (1993)
Zheng, K., Wang, L., You, Z.-H.: CGMDA: an approach to predict and validate MicroRNA-disease associations by utilizing chaos game representation and LightGBM. IEEE Access 7, 133314–133323 (2019)
Chen, Z.-H., You, Z.-H., Li, L.-P., et al.: Identification of self-interacting proteins by integrating random projection classifier and finite impulse response filter. BMC Genom. 20, 1–10 (2019)
Chen, Z.-H., Li, L.-P., He, Z., et al.: An improved deep forest model for predicting self-interacting proteins from protein sequence using wavelet transformation. Front. Genet. 10, 90 (2019)
Pliakos K., Vens, C., Tsoumakas, G.: Predicting drug-target interactions with multi-label classification and label partitioning. IEEE/ACM Trans. Comput. Biol. Bioinf. (2019)
Chen, H., Zhang, Z.: A semi-supervised method for drug-target interaction prediction with consistency in networks. PLoS One 8, e62975 (2013)
Öztürk, H., Ozkirimli, E., Özgür, A.: A comparative study of SMILES-based compound similarity functions for drug-target interaction prediction. BMC Bioinf. 17, 1–11 (2016)
Mousavian, Z., Khakabimamaghani, S., Kavousi, K., et al.: Drug-target interaction prediction from pssm based evolutionary information. J. Pharmacol. Toxicol. Methods 78, 42–51 (2015)
Acknowledgements
This work is supported in part by the National Natural Science Foundation of China, under Grants 61702444, in part by the West Light Foundation of The Chinese Academy of Sciences, under Grant 2018-XBQNXZ-B-008, in part by the Chinese Postdoctoral Science Foundation, under Grant 2019M653804, in part by the Tianshan youth - Excellent Youth, under Grant 2019Q029, in part by the Qingtan scholar talent project of Zaozhuang University. The authors would like to thank all anonymous reviewers for their constructive advices.
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Yan, X., You, ZH., Wang, L., Li, LP., Zheng, K., Wang, MN. (2020). DTIFS: A Novel Computational Approach for Predicting Drug-Target Interactions from Drug Structure and Protein Sequence. In: Huang, DS., Jo, KH. (eds) Intelligent Computing Theories and Application. ICIC 2020. Lecture Notes in Computer Science(), vol 12464. Springer, Cham. https://doi.org/10.1007/978-3-030-60802-6_33
Download citation
DOI: https://doi.org/10.1007/978-3-030-60802-6_33
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-60801-9
Online ISBN: 978-3-030-60802-6
eBook Packages: Computer ScienceComputer Science (R0)