Similarity Based Classification

Bernal, Axel E.; Hospevian, Karen; Karadeniz, Tayfun; Lassez, Jean-Louis

doi:10.1007/978-3-540-45231-7_18

Axel E. Bernal⁹,
Karen Hospevian¹⁰,
Tayfun Karadeniz¹¹ &
…
Jean-Louis Lassez¹¹

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2810))

Included in the following conference series:

International Symposium on Intelligent Data Analysis

1699 Accesses
2 Citations

Abstract

We describe general conditions for data classification which can serve as a unifying framework in the study of kernel based Machine Learning Algorithms. From these conditions we derive a new algorithm called SBC (for Similarity Based Classification), which has attractive theoretical properties regarding underfitting, overfitting, power of generalization, computational complexity and robustness. Compared to classical algorithms, such as Parzen windows and non-linear Perceptrons, SBC can be seen as an optimized version of them. Finally it is a conceptually simpler and a more efficient alternative to Support Vector Machines for an arbitrary number of classes. Its practical significance is illustrated through a number of benchmark classification problems.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Brown, M.P.S., Grundy, W.N., Lin, D., Cristianini, N., Sugnet, C.W., Furey, T.S., Ares, M., Haussler, D.: Knowledge-based analysis of microarray gene expression data by using support vector machines. Proceedings of the National Academy of Sciences 97, 262–267 (2000)
Article Google Scholar
Burges, C.J.C.: A tutorial on support vector machines for pattern recognition. Data Mining and Knowledge Discovery 2(2), 121–167 (1998)
Article Google Scholar
Chang, C.C., Lin, C.J.: Training ν-support vector classifiers, Theory and Algorithms. Neural Computation 9(13), 2119–2147 (2001)
Article Google Scholar
Crisp, D.J., Burges, C.J.C.: 2000. In: Solla, S., Leen, T., Muller, K.R. (eds.) A Geometric Interpretation of N=bsvm Classifiers. Advances in Neural Information Processing Systems, vol. 12, MIT Press, Cambridge (2000)
Google Scholar
Cristianini, N., Shawe-Taylor, J.: An Introduction to Support Vector Machines. Cambridge University Press, Cambridge (2000)
Google Scholar
Freund, Y., Schapire, R.E.: Large Margin Classification using the Perceptron Algorithm. Machine Learning 3(37), 277–296 (1999)
Article Google Scholar
Joachims, T.: Learning to Classify Text using Support Vector Machines. Kluwer, Dordrecht (2002)
Google Scholar
Keerthi, S.S., Shevade, S.K., Bhattacharyya, C., Murthy, K.R.K.: A fast iterative nearest point algorithm for support vector machine classifier design. IEEE Transactions on Neural Networks 11, 124–136 (2000)
Article Google Scholar
Link, A.J., Robinson, K., Church, G.M.: Comparing the predicted and observed properties of proteins encoded in the genome of Escherichia Coli. Electrophoresis 18, 1259–1313 (1997)
Article Google Scholar
Murty, K.G.: Linear Complementarity, Linear and Nonlinear Programming. Helderman-Verlag (1988)
Google Scholar
Reese, M.G., Eeckman, F.H., Kulp, D., Haussler, D.: Improved Splice Site Detection in Genie. In: Waterman, M. (ed.) RECOMB, Santa Fe (1997)
Google Scholar
Rudd, K.E.: Ecogene: a genome sequence database for Escherichia Coli K-12. Nucleic Acid Research 28, 60–64 (2000)
Article Google Scholar
Scholkopf, B., Smola, A.: Learning with Kernels. MIT Press, Cambridge (2001)
Google Scholar
Thanaraj, T.A.: A clean data set of EST-confirmed splice sites from Homo sapiens amnd standards for clean-up procedures. Nucleic Acids Research 27(13), 2627–2637 (1999)
Article Google Scholar

Download references

Author information

Authors and Affiliations

CIS Department, University of Pennsylvania., Phildelphia, PA, 19104, USA
Axel E. Bernal
Computer Science Dept., New Mexico Institute of Mining and Technology, Socorro, New Mexico, USA
Karen Hospevian
Department of Computer Science, Coastal Carolina University, Conway, SC, 29528, USA
Tayfun Karadeniz & Jean-Louis Lassez

Authors

Axel E. Bernal
View author publications
You can also search for this author in PubMed Google Scholar
Karen Hospevian
View author publications
You can also search for this author in PubMed Google Scholar
Tayfun Karadeniz
View author publications
You can also search for this author in PubMed Google Scholar
Jean-Louis Lassez
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Berkeley Initiative in Soft Computing (BISC), University of California at Berkeley, USA
Michael R. Berthold
Freie Universität Berlin, Garystr. 21, 14195, Berlin, Germany
Hans-Joachim Lenz
Department of Computer Science, University of Colorado, Boulder, Colorado, USA
Elizabeth Bradley
Otto-von-Guericke-University of Magdeburg, Germany
Rudolf Kruse
Department of Knowledge Processing and Language Engineering, University of Magdeburg, Universitätsplatz 2, 39106, Magdeburg, Germany
Christian Borgelt

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bernal, A.E., Hospevian, K., Karadeniz, T., Lassez, JL. (2003). Similarity Based Classification. In: R. Berthold, M., Lenz, HJ., Bradley, E., Kruse, R., Borgelt, C. (eds) Advances in Intelligent Data Analysis V. IDA 2003. Lecture Notes in Computer Science, vol 2810. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-45231-7_18

Download citation

DOI: https://doi.org/10.1007/978-3-540-45231-7_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40813-0
Online ISBN: 978-3-540-45231-7
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics