Experiments on Kernel Tree Support Vector Machines for Text Categorization

Methasate, Ithipan; Theeramunkong, Thanaruk

doi:10.1007/978-3-540-71701-0_78

Ithipan Methasate^1,2 &
Thanaruk Theeramunkong¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4426))

Included in the following conference series:

Pacific-Asia Conference on Knowledge Discovery and Data Mining

1413 Accesses
1 Citations

Abstract

Text categorization is one of the most interesting topic, due to the extremely increase of digital documents. The Support Vector Machine algorithm (SVM) is one of the most effective technique for solving this problem. However, SVM requires the user to choose the kernel function and parameters of the function, which directly effect to the performance of the classifiers. This paper proposes a novel method, named Kernel Tree SVM, which represents the multiple kernel function with a tree structure. The functions are selected and formed by using genetic programming (GP). Moreover, the gradient descent method is used to perform fine tune on parameter values in each tree. The method is benchmarked on WebKB and 20Newsgroup datasets. The results prove that the method can find a bettr optimal solution than the SVM tuned with the gradient method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Vapnik, V.N.: Statistical Learning Theory. John Wiley and Sons, Chichester (1998)
MATH Google Scholar
Burges, C.: A Tutorial on Support Vector Machines for Pattern. Data Mining and Knowledge Discovery 2, 121–267 (1998)
Article Google Scholar
Cristianini, N., Shawe-Taylor, J.: An Introduction to Support Vector Machines. Cambridge University Press, Cambridge (2000)
Google Scholar
Chapelle, O., et al.: Choosing multiple parameters for support vector machines. In: Advances in Neural Information Processing Systems (2001)
Google Scholar
Friedrichs, F., Igel, C.: Evolutionary Tuning of Multiple SVM Parameters. Neurocomputing 64(C), 107–117 (2005)
Google Scholar
Chung, K.M., et al.: Radius margin bounds for support vector machines with the RBF kernel. Neural computation 15, 2643–2681 (2003)
Article MATH Google Scholar
Genton, M.G.: Classes of Kernels for Machine Learning: A Statistics. Jour. of Machine Learning 2, 299–312 (2001)
Article Google Scholar
Glamachers, T., Igel, C.: Gradient-based Adaptation of General Gaussian Kernels. Neural Computation 17(10), 2099–2105 (2005)
Article MathSciNet Google Scholar
Tom, H., Michael, G.M.: The Genetic Kernel Support Vector Machine: Description and Evaluation. Artificial Intelligence Review 10, 379–395 (2005)
Google Scholar
Keerthi, S.S.: Efficient tuning of SVM hyperparameters using radius/margin bound and iterative algorithms. IEEE Transaction on Neural Network 13, 1225–1229 (2002)
Article Google Scholar
Apte, C., Damerau, F.J., Weiss, S.M.: Automated learning of decision rules for text categorization. Information Systems, 233–251 (1994)
Google Scholar
Weiss, Y., Schölkopf, B., Platt, J.: A General and Efficient Multiple Kernel Learning Algorithm. In: Advances in Neural Information Processing Systems 18 (2005)
Google Scholar
Nigam, M., et al.: Text classification from labeled and unlabeled documents using EM. Machine Learning 39, 103–134 (2000)
Article MATH Google Scholar
Yang, Y., Liu, X.: A re-examination of text categorization methods. In: Proceeding of SIGIR-99, pp. 42–49. ACM Press, New York (1999)
Chapter Google Scholar
Namburu, S.M., et al.: Experiments on Supervised Learning Algorithms for Text Categorization. In: Proceeding of AERO 2005, pp. 1–8 (2005)
Google Scholar
Verayuth, L., Thanaruk, T.: Effect of term distributions on centroid-based text categorization. Information Sciences 158, 89–115 (2004)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Sirindhorn International Institute of Technology (SIIT), Thammasat University, 131 Moo 5, Tiwanont Road, Muang, Phathumthani 12000, Thailand
Ithipan Methasate & Thanaruk Theeramunkong
National Electronics and Computer Technology Center, 112 Phahon Yothin Road, Klong Luang, Pathumthani 12120, Thailand
Ithipan Methasate

Authors

Ithipan Methasate
View author publications
You can also search for this author in PubMed Google Scholar
Thanaruk Theeramunkong
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Zhi-Hua Zhou Hang Li Qiang Yang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Methasate, I., Theeramunkong, T. (2007). Experiments on Kernel Tree Support Vector Machines for Text Categorization. In: Zhou, ZH., Li, H., Yang, Q. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2007. Lecture Notes in Computer Science(), vol 4426. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-71701-0_78

Download citation

DOI: https://doi.org/10.1007/978-3-540-71701-0_78
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-71700-3
Online ISBN: 978-3-540-71701-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics