Abstract
There is a growing interest to kernel-based methods in Data Mining. The application of these methods for real-world data, stored in databases, leads to the problem of designing kernels for complex structured data. Since many Data Mining systems use relational databases, the important task is to design kernels for relational data. In this paper we show that for relational data the structure of single data instance in the input space can be described by nested relation schemes. For such data we propose the method for constructing kernels, which is based on convolution kernels framework developed by Haussler. For demonstration we construct the simple convolution Gaussian kernel and apply it, using k-nearest neighbor algorithm, for outliers detection problem in the sample relational database.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Scholkopf, B., Smola, A.J.: Learning with kernels: Support Vector Machines, Regularization, Optimization and Beyond. The MIT Press Cambridge, Massachusetss (2000)
Friedman, N., Getoor, L., Koller, D., Pfeffer, A.: Learning Probabilistic Relational Models. In: Proceedings of the 16th International Joint Conference on Artificial Intelligence (IJCAI), Stockholm, Sweden, pp. 1300–1307 (1999)
Levene, M., Loizou, G.: A Fully Precise Null Extended Nested Relational Algebra. Fundamenta Informaticae 19, 303–343 (1993)
Haussler, D.: Convolution kernels on discrete structures. Technical Report UCS-CRL-99-10, UC Santa Cruz (1999)
Han, J., Kamber, M.: Data Mining: Concepts and Techniques. Morgan Kaufmann, San Francisco (2000) ISBN 155860-489-8
OLE DB for Data Mining Specification. Version 1.0. Microsoft Corporation (2000), http://www.microsoft.com/data/oledb/dm.htm
Northwind Traders Sample Database. Microsoft Corporation (2003), http://office.microsoft.com/downloads/9798/Nwind.aspx
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Petrovskiy, M. (2003). Convolution Kernels for Outliers Detection in Relational Data. In: Liu, J., Cheung, Ym., Yin, H. (eds) Intelligent Data Engineering and Automated Learning. IDEAL 2003. Lecture Notes in Computer Science, vol 2690. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-45080-1_89
Download citation
DOI: https://doi.org/10.1007/978-3-540-45080-1_89
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40550-4
Online ISBN: 978-3-540-45080-1
eBook Packages: Springer Book Archive