Abstract
Drug-drug interactions (DDIs) lead to Adverse Drug Reactions (ADRs) in most cases, which increase medical costs tremendously, and may cause medical negligence or even fatal accidents. Many researchers introduce binary classification methods into DDIs discovery task to avoid expensive and inefficient experimental trials. However, we find that drug-drug pairs without DDI labels cannot be simply viewed as negative samples in practice, since unobserved DDIs may be discovered from unlabeled pairs. Therefore, different from traditional positive-negative classification, in this paper, we treat DDIs discovery as a positive-unlabeled learning (PU learning) problem, in which each drug-drug pair is either a positive sample (labeled as DDI) or an unlabeled one (labeled as non-DDI). We propose a PU learning framework based on Extreme Learning Machine (ELM) named PU-ELM, which consists of two components, namely reliable negative extraction and classifier learning. To improve the quality of reliable negative set, we propose a negative extraction method named OC-ELM-RN (One-Class ELM for Reliable Negative extraction). As to the learning module, we first apply the semi-supervised learning strategy in PU-ELM (denoted as PU-ELM-SS) to achieve extremely fast learning speed, and then we propose an entropy based weighted learning method named PU-ELM-EW (PU-ELM with Entropy Weighted learning) to improve the PU learning performance. Extensive experiments are conducted on a synthetic dataset with varied settings. We also apply PU-ELM to a real-world drug dataset. The results indicate that PU-ELM achieves better DDIs discovery ability compared with state-of-the-art methods.
Similar content being viewed by others
References
Chang S, Zhang Y, Tang J, Yin D, Chang Y, Hasegawa-Johnson MA, Huang TS (2016) Positive-unlabeled learning in streaming networks. In: Proceedings of the 22Nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp 755–764
Cheng F, Zhao Z (2014) Machine learning-based prediction of drug-drug interactions by integrating drug phenotypic, therapeutic, chemical, and genomic properties. J Am Med Inform Assoc Jamia 21(2):278–86
Cheng F, Li W, Wang X, Zhou Y, Wu Z, Shen J, Tang Y (2013) Adverse drug events: Database construction and in silico prediction. J Chem Inform Model 53(4):744–752
Claesen M, De Smet F, Suykens JAK, De Moor B (2015) A robust ensemble approach to learn from positive and unlabeled data using svm base models. Neurocomputing 160:73–84
Elkan C, Noto K (2008) Learning classifiers from only positive and unlabeled data. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 213–220
Fung GPC, Yu JX, Lu H, Yu PS (2006) Text classification without negative examples revisit. IEEE Trans Knowl Data Eng 18(1):6–20
Hoerl AE, Kennard RW (2000) Ridge regression: Biased estimation for nonorthogonal problems. Technometrics 42(1):80–86
Huang G, Song S, Gupta JNDG, Wu C (2014) Semi-supervised and unsupervised extreme learning machines. IEEE Trans Cybern 44(12):2405–2417
Huang GB, Zhu QY, Siew CK (2004) Extreme learning machine: a new learning scheme of feedforward neural networks. Int Symp Neural Netw 2:985–990
Huang GB, Zhu QY, Siew CK (2006) Extreme learning machine: theory and applications. Neurocomputing 70:489–501
Ienco D, Pensa RG (2016) Positive and unlabeled learning in categorical data. Neurocomputing 196:113–124
Lan W, Wang J, Li M, Liu J, Li Y, Wu FX, Pan Y (2016) Predicting drugtarget interaction using positive-unlabeled learning. Neurocomputing 206:50–57
Lee WS, Liu B (2003) Learning with positive and unlabeled examples using weighted logistic regression. In: Proceedings of the Twentieth International Conference on International Conference on Machine Learning, pp 448–455
Leng Q, Qi H, Miao J, Zhu W, Su G (2015) One-class classification with extreme learning machine. Math Probl Eng 2015:1–11
Li H, Chen Z, Liu B, Wei X, Shao J (2014) Spotting fake reviews via collective positive-unlabeled learning. In: 2014 IEEE International Conference on Data Mining, pp 899–904
Li X, Liu B (2003) Learning to classify texts using positive and unlabeled data. In: Proceedings of the 18th International Joint Conference on Artificial Intelligence, pp 587–592
Liu B, Lee WS, Yu PS, Li X (2002) Partially supervised classification of text documents. In: Proceedings of the Nineteenth International Conference on Machine Learning, pp 387–394
Liu B, Dai Y, Li X, Lee WS, Yu PS (2013) Building text classifiers using positive and unlabeled examples. In: Third IEEE International Conference on Data Mining, pp 179–186
Nguyen MN, Li XL, Ng SK (2012) Ensemble based positive unlabeled learning for time series classification. In: Database Systems for Advanced Applications: 17th International Conference, Proceedings, Part I, pp 243–257
Pan S, Zhang Y, Li X (2012) Dynamic classifier ensemble for positive unlabeled text stream classification. Knowl Inf Syst 33(2):267–287
Sun Y, Yuan Y, Wang G (2011) An os-elm based distributed ensemble classification framework in p2p networks. Neurocomputing 74(16):2438–2443
Sun Y, Yuan Y, Wang G (2014) Extreme learning machine for classification over uncertain data. Neurocomputing 128:500–506
Tsai RTH, Hung HC, Dai HJ, Lin YW, Hsu WL (2008) Exploiting likely-positive and unlabeled data to improve the identification of protein-protein interaction articles. BMC Bioinform 9(1):S3
Wang G, Zhao Y, Wang D (2008) A protein secondary structure prediction framework based on the Extreme Learning Machine. Neurocomputing 72:262–268
Xiao Y, Segal MR (2008) Biological sequence classification utilizing positive and unlabeled data. Bioinformatics 24(9):1198–1205
Yang P, Li XL, Mei JP, Kwoh CK, Ng SK (2012) Positive-unlabeled learning for disease gene identification. Bioinformatics 28(20):2640–2647
Yang P, Humphrey SJ, James DE, Yang YH, Jothi R (2016) Positive-unlabeled ensemble learning for kinase substrate prediction from dynamic phosphoproteomics data. Bioinformatics 32(2):252–259
Yu H, Han J, Chang KCC (2002) Pebl: Positive example based learning for web page classification using svm. In: Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ACM, pp 239–248
Zhao X, Wang G, Bi X, Gong P, Zhao Y (2011) XML document classification based on ELM. Neurocomputing 74:2444–2451
Zhao X, Bi X, Qiao B (2014) Probability based voting extreme learning machine for multiclass xml documents classification. World Wide Web 17(5):1217–1231
Zhao X, Bi X, Wang G, Zhang Z, Yang H (2016) Uncertain xml documents classification using extreme learning machine. Neurocomputing 174:375–382
Zong W, Huang GB, Chen Y (2013) Weighted extreme learning machine for imbalance learning. Neurocomputing 101:229–242
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This research is partially supported by the National Natural Science Foundation of China (Grant nos. 61702086, 61702087, 61672145, 61572121 and U1401256), the Natural Science Foundation of Liaoning Province (Grant no. 20170520164), the Fundamental Research Funds for the Central Universities of China (Grant no. N171904007), and the China Postdoctoral Science Foundation (Grant no. 2018M631806).
Rights and permissions
About this article
Cite this article
Bi, X., Ma, H., Li, J. et al. A positive and unlabeled learning framework based on extreme learning machine for drug-drug interactions discovery. J Ambient Intell Human Comput 14, 1–12 (2023). https://doi.org/10.1007/s12652-018-0960-7
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12652-018-0960-7