Abstract
We describe general conditions for data classification which can serve as a unifying framework in the study of kernel based Machine Learning Algorithms. From these conditions we derive a new algorithm called SBC (for Similarity Based Classification), which has attractive theoretical properties regarding underfitting, overfitting, power of generalization, computational complexity and robustness. Compared to classical algorithms, such as Parzen windows and non-linear Perceptrons, SBC can be seen as an optimized version of them. Finally it is a conceptually simpler and a more efficient alternative to Support Vector Machines for an arbitrary number of classes. Its practical significance is illustrated through a number of benchmark classification problems.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Brown, M.P.S., Grundy, W.N., Lin, D., Cristianini, N., Sugnet, C.W., Furey, T.S., Ares, M., Haussler, D.: Knowledge-based analysis of microarray gene expression data by using support vector machines. Proceedings of the National Academy of Sciences 97, 262–267 (2000)
Burges, C.J.C.: A tutorial on support vector machines for pattern recognition. Data Mining and Knowledge Discovery 2(2), 121–167 (1998)
Chang, C.C., Lin, C.J.: Training ν-support vector classifiers, Theory and Algorithms. Neural Computation 9(13), 2119–2147 (2001)
Crisp, D.J., Burges, C.J.C.: 2000. In: Solla, S., Leen, T., Muller, K.R. (eds.) A Geometric Interpretation of N=bsvm Classifiers. Advances in Neural Information Processing Systems, vol. 12, MIT Press, Cambridge (2000)
Cristianini, N., Shawe-Taylor, J.: An Introduction to Support Vector Machines. Cambridge University Press, Cambridge (2000)
Freund, Y., Schapire, R.E.: Large Margin Classification using the Perceptron Algorithm. Machine Learning 3(37), 277–296 (1999)
Joachims, T.: Learning to Classify Text using Support Vector Machines. Kluwer, Dordrecht (2002)
Keerthi, S.S., Shevade, S.K., Bhattacharyya, C., Murthy, K.R.K.: A fast iterative nearest point algorithm for support vector machine classifier design. IEEE Transactions on Neural Networks 11, 124–136 (2000)
Link, A.J., Robinson, K., Church, G.M.: Comparing the predicted and observed properties of proteins encoded in the genome of Escherichia Coli. Electrophoresis 18, 1259–1313 (1997)
Murty, K.G.: Linear Complementarity, Linear and Nonlinear Programming. Helderman-Verlag (1988)
Reese, M.G., Eeckman, F.H., Kulp, D., Haussler, D.: Improved Splice Site Detection in Genie. In: Waterman, M. (ed.) RECOMB, Santa Fe (1997)
Rudd, K.E.: Ecogene: a genome sequence database for Escherichia Coli K-12. Nucleic Acid Research 28, 60–64 (2000)
Scholkopf, B., Smola, A.: Learning with Kernels. MIT Press, Cambridge (2001)
Thanaraj, T.A.: A clean data set of EST-confirmed splice sites from Homo sapiens amnd standards for clean-up procedures. Nucleic Acids Research 27(13), 2627–2637 (1999)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Bernal, A.E., Hospevian, K., Karadeniz, T., Lassez, JL. (2003). Similarity Based Classification. In: R. Berthold, M., Lenz, HJ., Bradley, E., Kruse, R., Borgelt, C. (eds) Advances in Intelligent Data Analysis V. IDA 2003. Lecture Notes in Computer Science, vol 2810. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-45231-7_18
Download citation
DOI: https://doi.org/10.1007/978-3-540-45231-7_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40813-0
Online ISBN: 978-3-540-45231-7
eBook Packages: Springer Book Archive