Abstract
Text categorization is one of the most interesting topic, due to the extremely increase of digital documents. The Support Vector Machine algorithm (SVM) is one of the most effective technique for solving this problem. However, SVM requires the user to choose the kernel function and parameters of the function, which directly effect to the performance of the classifiers. This paper proposes a novel method, named Kernel Tree SVM, which represents the multiple kernel function with a tree structure. The functions are selected and formed by using genetic programming (GP). Moreover, the gradient descent method is used to perform fine tune on parameter values in each tree. The method is benchmarked on WebKB and 20Newsgroup datasets. The results prove that the method can find a bettr optimal solution than the SVM tuned with the gradient method.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Vapnik, V.N.: Statistical Learning Theory. John Wiley and Sons, Chichester (1998)
Burges, C.: A Tutorial on Support Vector Machines for Pattern. Data Mining and Knowledge Discovery 2, 121–267 (1998)
Cristianini, N., Shawe-Taylor, J.: An Introduction to Support Vector Machines. Cambridge University Press, Cambridge (2000)
Chapelle, O., et al.: Choosing multiple parameters for support vector machines. In: Advances in Neural Information Processing Systems (2001)
Friedrichs, F., Igel, C.: Evolutionary Tuning of Multiple SVM Parameters. Neurocomputing 64(C), 107–117 (2005)
Chung, K.M., et al.: Radius margin bounds for support vector machines with the RBF kernel. Neural computation 15, 2643–2681 (2003)
Genton, M.G.: Classes of Kernels for Machine Learning: A Statistics. Jour. of Machine Learning 2, 299–312 (2001)
Glamachers, T., Igel, C.: Gradient-based Adaptation of General Gaussian Kernels. Neural Computation 17(10), 2099–2105 (2005)
Tom, H., Michael, G.M.: The Genetic Kernel Support Vector Machine: Description and Evaluation. Artificial Intelligence Review 10, 379–395 (2005)
Keerthi, S.S.: Efficient tuning of SVM hyperparameters using radius/margin bound and iterative algorithms. IEEE Transaction on Neural Network 13, 1225–1229 (2002)
Apte, C., Damerau, F.J., Weiss, S.M.: Automated learning of decision rules for text categorization. Information Systems, 233–251 (1994)
Weiss, Y., Schölkopf, B., Platt, J.: A General and Efficient Multiple Kernel Learning Algorithm. In: Advances in Neural Information Processing Systems 18 (2005)
Nigam, M., et al.: Text classification from labeled and unlabeled documents using EM. Machine Learning 39, 103–134 (2000)
Yang, Y., Liu, X.: A re-examination of text categorization methods. In: Proceeding of SIGIR-99, pp. 42–49. ACM Press, New York (1999)
Namburu, S.M., et al.: Experiments on Supervised Learning Algorithms for Text Categorization. In: Proceeding of AERO 2005, pp. 1–8 (2005)
Verayuth, L., Thanaruk, T.: Effect of term distributions on centroid-based text categorization. Information Sciences 158, 89–115 (2004)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer Berlin Heidelberg
About this paper
Cite this paper
Methasate, I., Theeramunkong, T. (2007). Experiments on Kernel Tree Support Vector Machines for Text Categorization. In: Zhou, ZH., Li, H., Yang, Q. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2007. Lecture Notes in Computer Science(), vol 4426. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-71701-0_78
Download citation
DOI: https://doi.org/10.1007/978-3-540-71701-0_78
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-71700-3
Online ISBN: 978-3-540-71701-0
eBook Packages: Computer ScienceComputer Science (R0)