PDA-SVM Hybrid: A Unified Model for Kernel-Based Supervised Classification

Kung, S. Y.; Mak, Man-Wai

doi:10.1007/s11265-011-0588-8

PDA-SVM Hybrid: A Unified Model for Kernel-Based Supervised Classification

Published: 17 May 2011

Volume 65, pages 5–21, (2011)
Cite this article

Journal of Signal Processing Systems Aims and scope Submit manuscript

S. Y. Kung¹ &
Man-Wai Mak²

230 Accesses
2 Citations
Explore all metrics

Abstract

For most practical supervised learning applications, the training datasets are often linearly nonseparable based on the traditional Euclidean metric. To strive for more effective classification capability, a new and flexible distance metric has to be adopted. There exist a great variety of kernel-based classifiers, each with their own favorable domain of applications. They are all based on a new distance metric induced from a kernel-based inner-product. It is also known that classifier’s effectiveness depends strongly on the distribution of training and testing data. The problem lies in that we just do not know in advance the right models for the observation data and measurement noise. As a result, it is impossible to pinpoint an appropriate model for the best tradeoff between the classifier’s training accuracy and error resilience. The objective of this paper is to develop a versatile classifier endowed with a broad array of parameters to cope with various kinds of real-world data. More specifically, a so-called PDA-SVM Hybrid is proposed as a unified model for kernel-based supervised classification. This paper looks into the interesting relationship between existing classifiers (such as KDA, PDA, and SVM) and explains why they are special cases of the unified model. It further explores the effects of key parameters on various aspects of error analysis. Finally, simulations were conducted on UCI and biological data and their performance compared.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Kernel-based linear classification on categorical data

Article 05 November 2015

Lifei Chen, Yanfang Ye, … Jianping Zhu

Kernel-Based SVM

Nyström-SGD: Fast Learning of Kernel-Classifiers with Conditioned Stochastic Gradient Descent

Notes

Generally speaking, the formula for an optimal linear decision function is x ^T w + b. For the special case, we happen to have b = 0.
Note that a vector with α _i < 0 implies that it will have a safety margin greater than or equal to 1.0. They are arguably too far away from the decision boundary and may therefore be regarded as non-critical for decision making. Therefore, by the conventional SVM, they are excluded from the pool of selected vectors (i.e. those with nonzero a _i′s).
http://archive.ics.uci.edu/ml/

References

Kung, S. Y. (2009). Kernel approaches to unsupervised and supervised machine learning. In Proc. PCM’2009. Lecture notes in computer science (Vol. 5879, pp. 1–32). Springer-Verlag.
Aizerman, M., et al. (1964). Theoretical foundation of the potential function method in pattern recognition learning. Automation and Remote Control, 25, 821–837.
MathSciNet Google Scholar
Vapnik, V. N. (1995). The nature of statistical learning theory. New York: Springer-Verlag.
MATH Google Scholar
Mitchell, T. M. (1997). Machine learning. McGraw-Hill.
Fisher, R. A. (1936). The use of multiple measurements in taxonomic problems. Annals of Eugenics, 7, 179–188.
Article Google Scholar
McLachlan, G. J. (1992). Discriminant analysis and statistical pattern recognition. John Wiley & Sons.
Friedman, J. (1989). Regularized discriminant analysis. Journal of the American Statistical Association, 84, 165–175.
Article MathSciNet Google Scholar
Fukunaga, K. (1990). Introduction to statistical pattern recognition. Boston: Academic.
MATH Google Scholar
Schölkopf, B., Burges, C. J. C., & Smola, A. J. (1999). Advances in kernel methods: Support vector learning. Cambridge: MIT Press.
Google Scholar
Mika, S., Ratsch, G., Weston, J., Scholkopf, B., & Mullers, K. R. (1999). Fisher discriminant analysis with kernels. In Y. H. Hu, J. Larsen, E. Wilson, & S. Douglas (Eds.), Neural networks for signal processing IX (pp. 41–48).
Mika, S., Ratsch, G., & Muller, K. R. (2001). A mathematical programming approach to the kernel Fisher algorithm. Advances in Neural Information Processing Systems, 13, 591–597.
Google Scholar
Mika, S., Smola, A. J., & Scholkopf, B. (2001). An improved training algorithm for kernel Fisher discriminants. In T. Jaakkola, & T. Richardson (Eds.), Proceedings AISTATS (Vol. 2001, pp. 98–104). San Francisco: Morgan Kaufmann.
Google Scholar
Muller, K. R., Mika, S., Ratsch, G., Tsuda, K., & Scholkopf, B. (2001). An introduction to kernel-based learning algorithms. IEEE Transactions on Neural Networks, 12(2), 181–201.
Article Google Scholar
Gestel, T. V., Suykens, J. A. K., Lanckriet, G., Lambrechts, A., Moor, B. D., & Vandewalle, J. (2002). Bayesian framework for least-squares support vector machine classifiers, Gaussian processes, and kernel Fisher discriminant analysis. Neural Computation, 14(5), 1115–1147,
Article MATH Google Scholar
Woodbury, M. A. (1950). Inverting modified matrices. In Statistical research group. Princeton: Princeton University, pp. Memorandum Rept. 42, MR38136.
Joachims, T. (1999). Making large-scale SVM learning practical. In B. Schölkopf, C. Burges, & A. Smola (Eds.), Advances in Kernel methods—Support vector learning. Cambridge: MIT Press.
Google Scholar
Schwaighofer, A. (2005). SVM toolbox for Matlab.
Pochet, N., De Smet, F., Suykens, J. A. K., & De Moor, B. L. R. (2004). Systematic benchmarking of microarray data classification: Assessing the role of nonlinearity and dimensionality reduction. Bioinformatics, 20(17), 3185–3195.
Article Google Scholar
Iizuka, N., et al. (2003). Oligonucleotide microarray for prediction of early intrahepatic recurrence of hepatocellular carcinoma after curative resection. The Lancet, 361(9361), 923–929.
Article Google Scholar
Alon, U., Barkai, N., Notterman, D. A., Gish, K., Ybarra, S., Mack, D., et al. (1999). Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays. Proceedings of the National Academy of Sciences, 96(12), 6745–6750.
Article Google Scholar
Nutt, C. L., et al. (2003). Gene expression-based classification of malignant gliomas correlates better with survival than histological classification. Cancer Research, 63(7), 1602–1607.
Google Scholar
Golub, T. R., Slonim, D. K., Huard, C., Tamayo, P., Gaasenbeek, M., Mesirov, J. P., et al. (1999). Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science, 286, 531–537.
Article Google Scholar
Singh, D., et al. (2002). Gene expression correlates of clinical prostate cancer behavior. Cancer cell, 1(2), 203–209.
Article Google Scholar
van ’t Veer, L., et al. (2002). Gene expression profiling predicts clinical outcome of breast cancer. Nature, 415, 530– 535.
Article Google Scholar
Guyon, I., Weston, J., Barnhill, S., & Vapnik, V. (2002). Gene selection for cancer classification using support vector machines. Machine Learning, 46, 389–422.
Article MATH Google Scholar

Download references

Author information

Authors and Affiliations

Princeton University, Princeton, NJ, 08544, USA
S. Y. Kung
The Hong Kong Polytechnic University, Hung Hom, Kowloon, Hongkong
Man-Wai Mak

Authors

S. Y. Kung
View author publications
You can also search for this author in PubMed Google Scholar
Man-Wai Mak
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Man-Wai Mak.

Additional information

This manuscript was based on the keynote paper at PCM2009 by Kung [1]. This work benefited greatly from our research collaboration with Ms. Yuhui Luo from the Princeton University. The work was in part supported by The Hong Kong Research Grant Council, Grant No. PolyU5251/08E and PolyU5264/09E. Some of the research was conducted when S.Y. Kung was a Distinguished Visiting Professor at The University of Hong Kong.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Kung, S.Y., Mak, MW. PDA-SVM Hybrid: A Unified Model for Kernel-Based Supervised Classification. J Sign Process Syst 65, 5–21 (2011). https://doi.org/10.1007/s11265-011-0588-8

Download citation

Received: 22 March 2011
Revised: 23 March 2011
Accepted: 18 April 2011
Published: 17 May 2011
Issue Date: October 2011
DOI: https://doi.org/10.1007/s11265-011-0588-8

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

PDA-SVM Hybrid: A Unified Model for Kernel-Based Supervised Classification

Abstract

Access this article

Similar content being viewed by others

Kernel-based linear classification on categorical data

Kernel-Based SVM

Nyström-SGD: Fast Learning of Kernel-Classifiers with Conditioned Stochastic Gradient Descent

Notes

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

PDA-SVM Hybrid: A Unified Model for Kernel-Based Supervised Classification

Abstract

Access this article

Similar content being viewed by others

Kernel-based linear classification on categorical data

Kernel-Based SVM

Nyström-SGD: Fast Learning of Kernel-Classifiers with Conditioned Stochastic Gradient Descent

Notes

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation