Abstract
This paper extends the idea of weighted distance functions to kernels and support vector machines. Here, we focus on applications that rely on sliding a window over a sequence of string data. For this type of problems it is argued that a symbolic, context-based representation of the data should be preferred over a continuous, real format as this is a much more intuitive setting for working with (weighted) distance functions. It is shown how a weighted string distance can be decomposed and subsequently used in different kernel functions and how these kernel functions correspond to real kernels between the continuous, real representations of the symbolic, context-based representations of the vectors.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Cristianini, N., Shawe-Taylor, J.: An Introduction to Support Vector Machines and other Kernel-based Learning Methods. Cambridge University Press, Cambridge (2000)
Schölkopf, B.: The kernel trick for distances. Technical report, Microsoft Research (2000)
Vanschoenwinkel, B.: Substitution matrix based kernel functions for protein secondary structure prediction. In: Proceedings of ICMLA 2004 (International Conference on Machine Learning and Applications) (2004)
Vanschoenwinkel, B., Manderick, B.: Appropriate kernel functions for support vector machine learning with sequences of symbolic data. In: Winkler, J.R., Niranjan, M., Lawrence, N.D. (eds.) Deterministic and Statistical Methods in Machine Learning. LNCS (LNAI), vol. 3635, pp. 256–280. Springer, Heidelberg (2005)
Joachims, T.: Learning to Classify Text Using Support Vector Machines. Kluwer Academic Publishers, Dordrecht (2002)
Vanschoenwinkel, B., Liu, F., Manderick, B.: Weighted kernel functions for svm learning in string domains: A distance function viewpoint. In: International Conference on Machine Learning and Cybernetics (ICMLC), Guangzhou, China. Vrije Universiteit Brussel, Belgium (2005)
Berg, C., Christensen, J.P.R., Ressel, P.: Harmonic Analysis on Semigroups. Springer, Heidelberg (1984)
Vanschoenwinkel, B., Manderick, B.: Context-sensitive kernel functions: A comparison between different context weights. In: Belgisch Nederlandse Artifcial Intelligence Conference (BNAIC), Brussels. Vrije Universiteit Brussel, Belgium (2005)
Chih-Chung, C., Chi-Jen, L.: LIBSVM: A Library for Support Vector Machines (2004)
Daelemans, W., Zavrel, J., Berck, S.: Mbt: A memorybased part of speech tagger-generator (1996)
Noreen, E.W.: Computer-Intensive Methods for Testing Hypotheses. John Wiley & Sons, Chichester (1989)
Degroeve, S.: Design and Evaluation of a Linear Classification Strategy for Gene Structural Element Recognition. PhD thesis, Universiteit Gent, Faculty of Sciences, Gent, Belgium (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Vanschoenwinkel, B., Liu, F., Manderick, B. (2006). Context-Sensitive Kernel Functions: A Distance Function Viewpoint. In: Yeung, D.S., Liu, ZQ., Wang, XZ., Yan, H. (eds) Advances in Machine Learning and Cybernetics. Lecture Notes in Computer Science(), vol 3930. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11739685_90
Download citation
DOI: https://doi.org/10.1007/11739685_90
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-33584-9
Online ISBN: 978-3-540-33585-6
eBook Packages: Computer ScienceComputer Science (R0)