Skip to main content
Log in

Learning Biased SVM with Weighted Within-Class Scatter for Imbalanced Classification

  • Published:
Neural Processing Letters Aims and scope Submit manuscript

Abstract

Support vector machine (SVM) is a powerful tool for pattern classification and regression estimation. However, for the class imbalanced problem, conventional SVMs are not suitable to the imbalanced learning tasks since they tend to misclassify the minority class, which is always the more important class. In this paper, we propose an improved biased SVM with weighted within-class structure for imbalanced classification. The new algorithm makes the minority class more clustered by assigning a small weight for the within-class scatter matrix of minority class, which can improve the classification performance. The experimental results on several benchmark datasets demonstrate the effectiveness of the proposed algorithm for imbalanced data classification problems.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17

Similar content being viewed by others

References

  1. Maglogiannis I, Zafiropoulos E, Anagnostopoulos I (2009) An intelligent system for automated breast cancer diagnosis and prognosis using SVM based classifiers. Appl Intell 30:24–36

    Google Scholar 

  2. Wang KN, Zhong P (2014) Robust non-convex least squares loss function for regression with outliers. Knowl Based Syst 71:290–302

    Google Scholar 

  3. Engen V, Vincent J, Phalp K (2008) Enhancing network based intrusion detection for imbalanced data. Int J Knowl Based Intell Eng Syst 12:357–367

    Google Scholar 

  4. Kubat M, Matwin S (1997) Addressing the curse of imbalanced training sets: one-sided selection. In: Proceedings of the 14th international conference on machine learning, pp 179–186

  5. Jo T, Japkowicz N (2004) Class imbalances versus small disjuncts. SIGKDD Explor 6:40–49

    Google Scholar 

  6. Chawla N, Bowyer K, Hall L, Kegelmeyer W (2002) SMOTE: synthetic minority oversampling technique. J Artif Intell Res 16:321–357

    Google Scholar 

  7. Sun Y, Kamela M, Wongb A, Wang Y (2007) Cost-sensitive boosting for classification of imbalanced data. Pattern Recognit 40:3358–3378

    Google Scholar 

  8. Shao YH, Chen WJ, Zhang JJ, Wang Z, Deng NY (2014) An efficient weighted Lagrangian twin support vector machine for imbalanced data classification. Pattern Recognit 47:3158–3167

    Google Scholar 

  9. Zhao Z, Zhong P, Zhao Y (2011) Learning SVM with weighted maximum margin criterion for classification of imbalanced data. Math Comput Model 54:1093–1099

    Google Scholar 

  10. Zhao Z, Zhong P, Zhao Y (2010) Reduced least squaers one-class SVM in empirical feature space for imbalnced data. ICIC Express Lett 5(11):4115–4121

    Google Scholar 

  11. Zhu WX, Zhong P (2014) A new one-class SVM based on hidden information. Knowl Based Syst 60:35–43

    Google Scholar 

  12. Vapnik VN (1995) The nature of statistical learning theory. Springer, New York

    Google Scholar 

  13. Veropoulos K, Campbell C, Cristimanini N (1999) Controlling the sensitivity of support vector machines. In: Proceedings of the international joint conferences on artificial intelligence, vol 4, pp 55–60

  14. Akbani R, Kwek S, Japkowicz N (2004) Applying support vector machines to imbalanced datasets. In: Machine learning: ECML 2004. Springer, Berlin Heidelberg, pp 39–50

  15. Davenport MA (2005) The 2\(\nu \)-SVM: a cost-sensitive extension of the \(\nu \)-SVM. Technical Report TREE 0504, Rice University, Department of Electrical and Computer Engineering, October, 2005. http://www.ece.rice.edu/~md

  16. Karagiannopoulos MG, Anyfantis DS, Kotsiantis SB, et al. (2007) Local cost sensitive learning for handling imbalanced data sets. In: Mediterranean conference on control automation, 2007. MED’07. IEEE, pp 1–6

  17. Shao YH, Deng NY, Yang ZM (2012) Least squares recursive projection twin support vector machine for classification. Pattern Recognit 45:2299–2307

    Google Scholar 

  18. Fisher RA (1936) The use of multiple measurements in taxonomic problems. Ann Eugen 7:179–188

    Google Scholar 

  19. Chen C, Yang J (2007) Fisher large margin linear classifier. J Image Graph 12(12):2143–2147

    Google Scholar 

  20. An W, Jiao M (2013) Fuzzy support vector machine based on within-class scatter for classification problems with outliers or noises. Neurocomputing 110:101–110

    Google Scholar 

  21. Gu X, Ni T, Wang H (2014) New fuzzy support vector machine for the class imbalance problem in medical datasets classification. Sci World J 2014:536434. https://doi.org/10.1155/2014/536434

  22. Blake CL, Merz CJ (1998) UCI repository for machine learning databases. Department of Information and Computer Sciences, University of California, Irvine. http://www.ics.uci.edu/mlearn/MLRepository.html. Accessed 1 June 2015

Download references

Acknowledgements

The work is supported by National Natural Science Foundation of China Grant No. 11171346 and Chinese Universities Scientific Fund No. 2013YJ010. The author also gratefully acknowledges the helpful comments and suggestions of the reviewers, which have improved the presentation.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ping Zhong.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhang, JJ., Zhong, P. Learning Biased SVM with Weighted Within-Class Scatter for Imbalanced Classification. Neural Process Lett 51, 797–817 (2020). https://doi.org/10.1007/s11063-019-10096-8

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11063-019-10096-8

Keywords

Navigation