A study of visual behavior of multidimensional scaling for kernel perceptron algorithm

Hsu, Che-Chang; Wang, Kuo-Shong; Chung, Hung-Yuan; Chang, Shih-Hsing

doi:10.1007/s00521-014-1746-2

A study of visual behavior of multidimensional scaling for kernel perceptron algorithm

Original Article
Published: 18 October 2014

Volume 26, pages 679–691, (2015)
Cite this article

Neural Computing and Applications Aims and scope Submit manuscript

Che-Chang Hsu¹,
Kuo-Shong Wang¹,
Hung-Yuan Chung² &
…
Shih-Hsing Chang³

182 Accesses
1 Citation
Explore all metrics

Abstract

The class imbalance problem occurs when the classifier is to detect a rare but important class. The purpose of this paper is to study whether possible sources of error are not only the imbalance but also other factors in combination, which lead to these misclassifications. The theoretical difficulties in purely predictive settings arise from the lack of visualization. Therefore, for kernel classifiers we propose the link with a kernel version of multidimensional scaling in high-dimensional feature space. The transformed version of the features specifically discloses the intrinsic structure of Hilbert space and is then used as inputs into a learning system: in the example, this prediction method is based on the SVMs-rebalance methodology. The graphical representations indicate the effects of masking, skewed, and multimodal distribution, which are also responsible for the poor performance. By studying the properties of the misclassifications, we can further develop ways to improve them.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Kernel-based linear classification on categorical data

Article 05 November 2015

Parameter investigation of support vector machine classifier with kernel functions

Article 01 February 2019

Kernel-Based SVM

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

References

Provost F, Fawcett T (2001) Robust classification for imprecise environments. Mach Learn 42(3):203–231
Article MATH Google Scholar
Wu G, Chang EY (2003) Class-boundary alignment for imbalanced dataset learning. In: Proceedings of the ICML’03 workshop on learning from imbalanced datasets, pp 49–56
Chawla NV, Japcowicz N, Kolcz A (2004) Editorial: special issue on learning from imbalanced datasets. SIGKDD Explor 6(1):1–6
Article Google Scholar
Visa S, Ralescu A (2005) Issues in mining imbalanced data sets—a review paper. In: Proceeding of the sixteen Midwest artificial intelligence and cognitive science conference, Dayton, Ohio, USA, pp 67–73
Kubat M, Matwin S (1997) Addressing the curse of imbalanced training sets: one-sided selection. In: Proceedings of the 14th international conference on machine learning, pp 179–186
Japkowicz N (ed) (2000) Proceeding of the AAAI’2000 workshop on learning from imbalanced data sets. Technical report WS-00-05, AAAI Press, Menlo Park
Chawla NV, Japkowicz N, Kolcz A (eds) (2003) Proceedings of the ICML’2003 workshop on learning from imbalanced data sets (II). http://www.site.uottawa.ca/~nat/Workshop2003/workshop2003.html
Weiss G (2004) Mining with rarity: a unifying framework. SIGKDD Explor 6(1):7–19
Article Google Scholar
Japkowicz N, Stephen S (2002) The class imbalance problem: a systematic study. Intell Data Anal 6(5):429–449
MATH Google Scholar
Vapnik VN (1995) The nature of statistical learning theory. Springer, Berlin
Book MATH Google Scholar
Akbani R, Kwek S, Japkowicz N (2004) Applying support vector machines to imbalanced datasets. In: Proceedings 15th ECML, pp 39–50
Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP (2002) SMOTE: synthetic minority over-sampling technique. J Artif Intell Res 16:321–357
MATH Google Scholar
Veropoulos K, Campbell C, Cristianini N (1999) Controlling the sensitivity of support vector machines. In: Proceedings of the international joint conference on artificial intelligence, pp 55–60
Han H, Wang W-Y, Mao B-H (2005) Borderline-SMOTE: a new over-sampling method in imbalanced data sets learning. Lect Notes Comput Sci 3644:878–887
Article Google Scholar
Tang Y, Zhang Y-Q, Chawla NV, Krasser S (2009) SVMs modeling for highly imbalanced classification. IEEE Trans Syst Man Cybern Part B 39(1):281–288
Article Google Scholar
Wu G, Chang EY (2005) KBA: kernel boundary alignment considering imbalanced data distribution. IEEE Trans Knowl Data Eng 17(6):786–795
Article Google Scholar
Maloof M (2003) Learning when data sets are imbalanced and when costs are unequal and unknown. In: Proceedings of the ICML’2003 workshop on learning from imbalanced data sets II, pp 73–80
Tao Q, Wu G-W, Wang F-Y, Wang J (2005) Posterior probability support vector machines for unbalanced data. IEEE Trans Neural Netw 16(6):1561–1573
Article Google Scholar
Green PF, Carmone FJ Jr, Smith SM (1989) Multidimensional scaling: concepts and applications. Allyn and Bacon, Boston, pp 139–204
Google Scholar
Hsu CC, Wang KS, Chung HY, Chang SH (2013) An algorithmic SVMs-rebalancing approach for class imbalance problem. Neural Computing and Applications (submitted)
Chung HY, Ho CH (2009) Design of Bayesian-based knowledge extraction for SVMs in unbalanced classifications. Department of Electrical Engineering, National Central University, Jhongli, Taiwan, ROC
Hsu CC, Wang KS, Chang SH (2011) Bayesian decision theory for support vector machines: imbalance measurement and feature optimization. Expert Syst Appl 38(5):4698–4704
Article Google Scholar
Chung HY, Ho CH, Hsu CC (2011) Support vector machines using Bayesian-based approach in the issue of unbalanced classifications. Expert Syst Appl 38(9):11447–11452
Article Google Scholar
Visa S, Ralescu A (2003) Learning imbalanced and overlapping classes using fuzzy sets. In: Proceedings of the ICML’2003 workshop on learning from imbalanced data sets II, Washington, pp 97–104
Prati RC, Batista GEAPA, Monard MC (2004) Class imbalances versus class overlapping: an analysis of a learning system behavior. In: MICAI, pp 312–321
Batista GEAPA, Prati RC, Monard MC (2004) A study of the behavior of several methods for balancing machine learning training data. SIGKDD Explor 6(1):20–29
Article Google Scholar
Japkowicz N (2001) Concept-learning in the presence of between-class and within-class imbalances. In: Proceedings of the fourteenth conference of the Canadian society for computational studies of intelligence, pp 67–77
Bradley AP (1997) The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recognit 30(7):1145–1159
Article Google Scholar
Duda RO, Hart PE, Stork DG (2001) Pattern classification, 2nd edn. Wiley, New York, pp 20–64
MATH Google Scholar
Krishnaiah PR, Kanal LN (1982) Classification, pattern recognition, and reduction of dimensionality. Handbook of statistics 2. North-Holland, Amsterdam
Google Scholar
Hastie T, Tibshirani R, Friendman J (2001) The elements of statistical learning: data mining, inference and prediction. Springer, Berlin, pp 502–503
Book Google Scholar
Lee JA, Verleysen M (2007) Nonlinear dimensionality reduction. Springer, New York, pp 69–97
Book MATH Google Scholar
Murphy PM (1995) UCI-benchmark repository of artificial and real data sets. http://www.ics.uci.edu/~mlearn. CA, University of California Irvine
Breiman L (1996) Bias, variance and arcing classifiers. Technical report 460, Berkeley, CA: Statistics Department, University of California at Berkeley

Download references

Author information

Authors and Affiliations

Department of Mechanical Engineering, National Central University, No. 300, Jhongda Rd., Jhongli City, Taoyuan County, Taiwan, ROC
Che-Chang Hsu & Kuo-Shong Wang
Department of Electrical Engineering, National Central University, No. 300, Jhongda Rd., Jhongli City, Taoyuan County, Taiwan, ROC
Hung-Yuan Chung
Institute of Business and Management, Vanung University, No. 1 Van-Nung Rd., Jhongli City, Taoyuan County, 320614, Taiwan, ROC
Shih-Hsing Chang

Authors

Che-Chang Hsu
View author publications
You can also search for this author inPubMed Google Scholar
Kuo-Shong Wang
View author publications
You can also search for this author inPubMed Google Scholar
Hung-Yuan Chung
View author publications
You can also search for this author inPubMed Google Scholar
Shih-Hsing Chang
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Che-Chang Hsu.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Hsu, CC., Wang, KS., Chung, HY. et al. A study of visual behavior of multidimensional scaling for kernel perceptron algorithm. Neural Comput & Applic 26, 679–691 (2015). https://doi.org/10.1007/s00521-014-1746-2

Download citation

Received: 03 January 2014
Accepted: 14 September 2014
Published: 18 October 2014
Issue Date: April 2015
DOI: https://doi.org/10.1007/s00521-014-1746-2

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A study of visual behavior of multidimensional scaling for kernel perceptron algorithm

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Kernel-based linear classification on categorical data

Parameter investigation of support vector machine classifier with kernel functions

Kernel-Based SVM

Explore related subjects

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now