Abstract
A self-adaptive weighted extreme learning machine (SawELM) is proposed in this paper to deal with the imbalanced binary-class classification problems. SawELM calculates the output-layer weights based on a newly-designed self-adaptive mechanism which includes the following two modules: one is to gradually reduce the weights of wrongly-classified training instances and the other is to dynamically update the outputs of these wrongly-classified instances. On 50 imbalanced binary-class data sets selected from KEEL repository, we compare the accuracy, G-mean, and F-measure of SawELM with unweighted ELM (UnWELM) and weighted ELM (WELM). The experimental results show that the newly-designed self-adaptive mechanism is effective and SawELM obviously improves the imbalanced classification performance of WELM. SawWLM obtains the significantly higher G-mean and F-measure than UnWELM and WELM. Meanwhile, the accuracy of SawELM is better than WELM and comparable to UnWELM.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Alcalá-Fdez, J., Sánchez, L., García, S., del Jesus, M.J., Ventura, S., Garrell, J.M., Otero, J., Romero, C., Bacardit, J., Rivas, V.M., Fernández, J.C., Herrera, F.: KEEL: a software tool to assess evolutionary algorithms to data mining problems. Soft. Comput. 13(3), 307–318 (2009)
Alcalá-Fdez, J., Fernandez, A., Luengo, J., Derrac, J., García, S., Sánchez, L., Herrera, F.: KEEL data-mining software tool: data set repository. Integration of algorithms and experimental analysis framework. J. Multiple-Valued Logic Soft Comput. 17(2–3), 255–287 (2011)
Ertekin, S., Huang, J., Bottou, L., Giles, C.L.: Learning on the border: active learning in imbalanced data classification. In: Proceedings of the Sixteenth ACM Conference on Information and Knowledge Management, pp. 127–136 (2007)
Ertekin, S., Huang, J., Giles, C.L.: Active learning for class imbalance problem. In: Proceedings of The 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 823–824 (2007)
Guo, H., Viktor, H.L.: Learning from imbalanced data sets with boosting and data generation: the databoost-im approach. ACM SIGKDD Explor. Newslett. 6(1), 30–39 (2004)
Han, H., Wang, W.-Y., Mao, B.-H.: Borderline-SMOTE: a new over-sampling method in imbalanced data sets learning. In: Huang, D.-S., Zhang, X.-P., Huang, G.-B. (eds.) ICIC 2005. LNCS, vol. 3644, pp. 878–887. Springer, Heidelberg (2005). doi:10.1007/11538059_91
He, H.B., Garcia, E.A.: Learning from imbalanced data. IEEE Trans. Knowl. Data Eng. 21(9), 1263–1284 (2009)
He, H.B., Ma, Y.Q.: Imbalanced Learning: Foundations, Algorithms, and Applications. Wiley, Hoboken (2013)
Huang, G.B., Zhou, H., Ding, X., Zhang, R.: Extreme learning machine for regression and multiclass classification. IEEE Trans. Syst. Man Cybern. B Cybern. 42(2), 513–529 (2012)
Huang, G.B., Zhu, Q.Y., Siew, C.K.: Extreme learning machine: theory and applications. Neurocomputing 70(1), 489–501 (2006)
Khoshgoftaar, T.M., Van Hulse, J., Napolitano, A.: Comparing boosting and bagging techniques with noisy and imbalanced data. IEEE Trans. Syst. Man Cybern. Part A Syst. Hum. 41(3), 552–568 (2011)
Li, K., Kong, X.F., Lu, Z., Liu, W.Y., Yin, J.P.: Boosting weighted ELM for imbalanced learning. Neurocomputing 128, 15–21 (2014)
Liu, X.Y., Wu, J.X., Zhou, Z.H.: Exploratory undersampling for class-imbalance learning. IEEE Trans. Syst. Man Cybern. B Cybern. 39(2), 539–550 (2009)
Thai-Nghe, N., Gantner, Z., Schmidt-Thieme, L.: Cost-sensitive learning methods for imbalanced data. In: Proceedings of the 2010 International Joint Conference on Neural Networks, pp. 1–8 (2010)
Xiao, W.D., Zhang, J., Li, Y.J., Yang, W.D.: Imbalanced extreme learning machine for classification with imbalanced data distributions. Proc. Adapt. Learn. Optimization 7, 503–514 (2016)
Yang, J., Yu, H., Yang, X., Zuo, X.: Imbalanced extreme learning machine based on probability density estimation. In: Bikakis, A., Zheng, X. (eds.) MIWAI 2015. LNCS, vol. 9426, pp. 160–167. Springer, Cham (2015). doi:10.1007/978-3-319-26181-2_15
You, Z.H., Lei, Y.K., Zhu, L., Xia, J.F., Wang, B.: Prediction of protein-protein interactions from amino acid sequences with ensemble extreme learning machines and principal component analysis. BMC Bioinform. 14(Suppl 8), S10 (2013)
Zadrozny, B., Langford, J., Abe, N.: Cost-sensitive learning by cost-proportionate example weighting. In: Proceedings of the Third IEEE International Conference on Data Mining, pp. 435–442 (2003)
Zhang, S., Qin, Z., Ling, C.X., Sheng, S.: “Missing is useful”: missing values in cost-sensitive decision trees. IEEE Trans. Knowl. Data Eng. 17(12), 1689–1693 (2005)
Zhou, Z.H., Liu, X.Y.: Training cost-sensitive neural networks with methods addressing the class imbalance problem. IEEE Trans. Knowl. Data Eng. 18(1), 63–77 (2006)
Zong, W.W., Huang, G.B., Chen, Y.Q.: Weighted extreme learning machine for imbalance learning. Neurocomputing 101, 229–242 (2013)
Acknowledgments
The first author and second author contributed equally the same to this article which is supported by National Natural Science Foundations of China (61503252 and 61473194) and China Postdoctoral Science Foundation (2016T90799).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Long, H., He, Y., Huang, J.Z., Wang, Q. (2017). Self-adaptive Weighted Extreme Learning Machine for Imbalanced Classification Problems. In: Kang, U., Lim, EP., Yu, J., Moon, YS. (eds) Trends and Applications in Knowledge Discovery and Data Mining. PAKDD 2017. Lecture Notes in Computer Science(), vol 10526. Springer, Cham. https://doi.org/10.1007/978-3-319-67274-8_11
Download citation
DOI: https://doi.org/10.1007/978-3-319-67274-8_11
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-67273-1
Online ISBN: 978-3-319-67274-8
eBook Packages: Computer ScienceComputer Science (R0)