Loading [a11y]/accessibility-menu.js
NegStacking: Drug−Target Interaction Prediction Based on Ensemble Learning and Logistic Regression | IEEE Journals & Magazine | IEEE Xplore

NegStacking: Drug−Target Interaction Prediction Based on Ensemble Learning and Logistic Regression


Abstract:

Drug−target interactions (DTIs) identification is an important issue of drug research, and many methods proposed to predict potential DTIs based on machine learning treat...Show More

Abstract:

Drug−target interactions (DTIs) identification is an important issue of drug research, and many methods proposed to predict potential DTIs based on machine learning treat it as a binary classification problem. However, the number of known interacting drug−target pairs (positive samples) is far less than that of non-interacting pairs (negative samples). Most methods do not utilize these large numbers of negative samples sufficiently, which limits their prediction performance. To address this problem, we proposed a stacking framework named NegStacking. First, it uses sampling to obtain multiple completely different negative sample sets. Then, each weak learner is trained with a different negative sample set and the same positive sample set, and the logistic regression (LR) is used as a meta-learner to adaptively combine these weak learners. Moreover, in the training process, feature subspacing and hyperparameter perturbation are applied to increase ensemble diversity. Finally, the trained model could be used to predict new samples. We compared NegStacking with other methods, and the experimental results show that our model is superior. NegStacking can improve the performance of predictive DTIs, and it has broad application prospects for improving the drug discovery process. The source code and datasets are available at https://github.com/Open-ss/NegStacking.
Published in: IEEE/ACM Transactions on Computational Biology and Bioinformatics ( Volume: 18, Issue: 6, 01 Nov.-Dec. 2021)
Page(s): 2624 - 2634
Date of Publication: 22 January 2020

ISSN Information:

PubMed ID: 31985434

Funding Agency:


Contact IEEE to Subscribe

References

References is not available for this document.