Abstract
Support vector machine (SVM) is a powerful tool for pattern classification and regression estimation. However, for the class imbalanced problem, conventional SVMs are not suitable to the imbalanced learning tasks since they tend to misclassify the minority class, which is always the more important class. In this paper, we propose an improved biased SVM with weighted within-class structure for imbalanced classification. The new algorithm makes the minority class more clustered by assigning a small weight for the within-class scatter matrix of minority class, which can improve the classification performance. The experimental results on several benchmark datasets demonstrate the effectiveness of the proposed algorithm for imbalanced data classification problems.
Similar content being viewed by others
References
Maglogiannis I, Zafiropoulos E, Anagnostopoulos I (2009) An intelligent system for automated breast cancer diagnosis and prognosis using SVM based classifiers. Appl Intell 30:24–36
Wang KN, Zhong P (2014) Robust non-convex least squares loss function for regression with outliers. Knowl Based Syst 71:290–302
Engen V, Vincent J, Phalp K (2008) Enhancing network based intrusion detection for imbalanced data. Int J Knowl Based Intell Eng Syst 12:357–367
Kubat M, Matwin S (1997) Addressing the curse of imbalanced training sets: one-sided selection. In: Proceedings of the 14th international conference on machine learning, pp 179–186
Jo T, Japkowicz N (2004) Class imbalances versus small disjuncts. SIGKDD Explor 6:40–49
Chawla N, Bowyer K, Hall L, Kegelmeyer W (2002) SMOTE: synthetic minority oversampling technique. J Artif Intell Res 16:321–357
Sun Y, Kamela M, Wongb A, Wang Y (2007) Cost-sensitive boosting for classification of imbalanced data. Pattern Recognit 40:3358–3378
Shao YH, Chen WJ, Zhang JJ, Wang Z, Deng NY (2014) An efficient weighted Lagrangian twin support vector machine for imbalanced data classification. Pattern Recognit 47:3158–3167
Zhao Z, Zhong P, Zhao Y (2011) Learning SVM with weighted maximum margin criterion for classification of imbalanced data. Math Comput Model 54:1093–1099
Zhao Z, Zhong P, Zhao Y (2010) Reduced least squaers one-class SVM in empirical feature space for imbalnced data. ICIC Express Lett 5(11):4115–4121
Zhu WX, Zhong P (2014) A new one-class SVM based on hidden information. Knowl Based Syst 60:35–43
Vapnik VN (1995) The nature of statistical learning theory. Springer, New York
Veropoulos K, Campbell C, Cristimanini N (1999) Controlling the sensitivity of support vector machines. In: Proceedings of the international joint conferences on artificial intelligence, vol 4, pp 55–60
Akbani R, Kwek S, Japkowicz N (2004) Applying support vector machines to imbalanced datasets. In: Machine learning: ECML 2004. Springer, Berlin Heidelberg, pp 39–50
Davenport MA (2005) The 2\(\nu \)-SVM: a cost-sensitive extension of the \(\nu \)-SVM. Technical Report TREE 0504, Rice University, Department of Electrical and Computer Engineering, October, 2005. http://www.ece.rice.edu/~md
Karagiannopoulos MG, Anyfantis DS, Kotsiantis SB, et al. (2007) Local cost sensitive learning for handling imbalanced data sets. In: Mediterranean conference on control automation, 2007. MED’07. IEEE, pp 1–6
Shao YH, Deng NY, Yang ZM (2012) Least squares recursive projection twin support vector machine for classification. Pattern Recognit 45:2299–2307
Fisher RA (1936) The use of multiple measurements in taxonomic problems. Ann Eugen 7:179–188
Chen C, Yang J (2007) Fisher large margin linear classifier. J Image Graph 12(12):2143–2147
An W, Jiao M (2013) Fuzzy support vector machine based on within-class scatter for classification problems with outliers or noises. Neurocomputing 110:101–110
Gu X, Ni T, Wang H (2014) New fuzzy support vector machine for the class imbalance problem in medical datasets classification. Sci World J 2014:536434. https://doi.org/10.1155/2014/536434
Blake CL, Merz CJ (1998) UCI repository for machine learning databases. Department of Information and Computer Sciences, University of California, Irvine. http://www.ics.uci.edu/mlearn/MLRepository.html. Accessed 1 June 2015
Acknowledgements
The work is supported by National Natural Science Foundation of China Grant No. 11171346 and Chinese Universities Scientific Fund No. 2013YJ010. The author also gratefully acknowledges the helpful comments and suggestions of the reviewers, which have improved the presentation.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Zhang, JJ., Zhong, P. Learning Biased SVM with Weighted Within-Class Scatter for Imbalanced Classification. Neural Process Lett 51, 797–817 (2020). https://doi.org/10.1007/s11063-019-10096-8
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11063-019-10096-8