Skip to main content

Similarity Based Classification

  • Conference paper
Advances in Intelligent Data Analysis V (IDA 2003)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2810))

Included in the following conference series:

Abstract

We describe general conditions for data classification which can serve as a unifying framework in the study of kernel based Machine Learning Algorithms. From these conditions we derive a new algorithm called SBC (for Similarity Based Classification), which has attractive theoretical properties regarding underfitting, overfitting, power of generalization, computational complexity and robustness. Compared to classical algorithms, such as Parzen windows and non-linear Perceptrons, SBC can be seen as an optimized version of them. Finally it is a conceptually simpler and a more efficient alternative to Support Vector Machines for an arbitrary number of classes. Its practical significance is illustrated through a number of benchmark classification problems.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Brown, M.P.S., Grundy, W.N., Lin, D., Cristianini, N., Sugnet, C.W., Furey, T.S., Ares, M., Haussler, D.: Knowledge-based analysis of microarray gene expression data by using support vector machines. Proceedings of the National Academy of Sciences 97, 262–267 (2000)

    Article  Google Scholar 

  2. Burges, C.J.C.: A tutorial on support vector machines for pattern recognition. Data Mining and Knowledge Discovery 2(2), 121–167 (1998)

    Article  Google Scholar 

  3. Chang, C.C., Lin, C.J.: Training ν-support vector classifiers, Theory and Algorithms. Neural Computation 9(13), 2119–2147 (2001)

    Article  Google Scholar 

  4. Crisp, D.J., Burges, C.J.C.: 2000. In: Solla, S., Leen, T., Muller, K.R. (eds.) A Geometric Interpretation of N=bsvm Classifiers. Advances in Neural Information Processing Systems, vol. 12, MIT Press, Cambridge (2000)

    Google Scholar 

  5. Cristianini, N., Shawe-Taylor, J.: An Introduction to Support Vector Machines. Cambridge University Press, Cambridge (2000)

    Google Scholar 

  6. Freund, Y., Schapire, R.E.: Large Margin Classification using the Perceptron Algorithm. Machine Learning 3(37), 277–296 (1999)

    Article  Google Scholar 

  7. Joachims, T.: Learning to Classify Text using Support Vector Machines. Kluwer, Dordrecht (2002)

    Google Scholar 

  8. Keerthi, S.S., Shevade, S.K., Bhattacharyya, C., Murthy, K.R.K.: A fast iterative nearest point algorithm for support vector machine classifier design. IEEE Transactions on Neural Networks 11, 124–136 (2000)

    Article  Google Scholar 

  9. Link, A.J., Robinson, K., Church, G.M.: Comparing the predicted and observed properties of proteins encoded in the genome of Escherichia Coli. Electrophoresis 18, 1259–1313 (1997)

    Article  Google Scholar 

  10. Murty, K.G.: Linear Complementarity, Linear and Nonlinear Programming. Helderman-Verlag (1988)

    Google Scholar 

  11. Reese, M.G., Eeckman, F.H., Kulp, D., Haussler, D.: Improved Splice Site Detection in Genie. In: Waterman, M. (ed.) RECOMB, Santa Fe (1997)

    Google Scholar 

  12. Rudd, K.E.: Ecogene: a genome sequence database for Escherichia Coli K-12. Nucleic Acid Research 28, 60–64 (2000)

    Article  Google Scholar 

  13. Scholkopf, B., Smola, A.: Learning with Kernels. MIT Press, Cambridge (2001)

    Google Scholar 

  14. Thanaraj, T.A.: A clean data set of EST-confirmed splice sites from Homo sapiens amnd standards for clean-up procedures. Nucleic Acids Research 27(13), 2627–2637 (1999)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2003 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Bernal, A.E., Hospevian, K., Karadeniz, T., Lassez, JL. (2003). Similarity Based Classification. In: R. Berthold, M., Lenz, HJ., Bradley, E., Kruse, R., Borgelt, C. (eds) Advances in Intelligent Data Analysis V. IDA 2003. Lecture Notes in Computer Science, vol 2810. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-45231-7_18

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-45231-7_18

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-40813-0

  • Online ISBN: 978-3-540-45231-7

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics