Skip to main content

Experiments on Kernel Tree Support Vector Machines for Text Categorization

  • Conference paper
Advances in Knowledge Discovery and Data Mining (PAKDD 2007)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4426))

Included in the following conference series:

Abstract

Text categorization is one of the most interesting topic, due to the extremely increase of digital documents. The Support Vector Machine algorithm (SVM) is one of the most effective technique for solving this problem. However, SVM requires the user to choose the kernel function and parameters of the function, which directly effect to the performance of the classifiers. This paper proposes a novel method, named Kernel Tree SVM, which represents the multiple kernel function with a tree structure. The functions are selected and formed by using genetic programming (GP). Moreover, the gradient descent method is used to perform fine tune on parameter values in each tree. The method is benchmarked on WebKB and 20Newsgroup datasets. The results prove that the method can find a bettr optimal solution than the SVM tuned with the gradient method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Vapnik, V.N.: Statistical Learning Theory. John Wiley and Sons, Chichester (1998)

    MATH  Google Scholar 

  2. Burges, C.: A Tutorial on Support Vector Machines for Pattern. Data Mining and Knowledge Discovery 2, 121–267 (1998)

    Article  Google Scholar 

  3. Cristianini, N., Shawe-Taylor, J.: An Introduction to Support Vector Machines. Cambridge University Press, Cambridge (2000)

    Google Scholar 

  4. Chapelle, O., et al.: Choosing multiple parameters for support vector machines. In: Advances in Neural Information Processing Systems (2001)

    Google Scholar 

  5. Friedrichs, F., Igel, C.: Evolutionary Tuning of Multiple SVM Parameters. Neurocomputing 64(C), 107–117 (2005)

    Google Scholar 

  6. Chung, K.M., et al.: Radius margin bounds for support vector machines with the RBF kernel. Neural computation 15, 2643–2681 (2003)

    Article  MATH  Google Scholar 

  7. Genton, M.G.: Classes of Kernels for Machine Learning: A Statistics. Jour. of Machine Learning 2, 299–312 (2001)

    Article  Google Scholar 

  8. Glamachers, T., Igel, C.: Gradient-based Adaptation of General Gaussian Kernels. Neural Computation 17(10), 2099–2105 (2005)

    Article  MathSciNet  Google Scholar 

  9. Tom, H., Michael, G.M.: The Genetic Kernel Support Vector Machine: Description and Evaluation. Artificial Intelligence Review 10, 379–395 (2005)

    Google Scholar 

  10. Keerthi, S.S.: Efficient tuning of SVM hyperparameters using radius/margin bound and iterative algorithms. IEEE Transaction on Neural Network 13, 1225–1229 (2002)

    Article  Google Scholar 

  11. Apte, C., Damerau, F.J., Weiss, S.M.: Automated learning of decision rules for text categorization. Information Systems, 233–251 (1994)

    Google Scholar 

  12. Weiss, Y., Schölkopf, B., Platt, J.: A General and Efficient Multiple Kernel Learning Algorithm. In: Advances in Neural Information Processing Systems 18 (2005)

    Google Scholar 

  13. Nigam, M., et al.: Text classification from labeled and unlabeled documents using EM. Machine Learning 39, 103–134 (2000)

    Article  MATH  Google Scholar 

  14. Yang, Y., Liu, X.: A re-examination of text categorization methods. In: Proceeding of SIGIR-99, pp. 42–49. ACM Press, New York (1999)

    Chapter  Google Scholar 

  15. Namburu, S.M., et al.: Experiments on Supervised Learning Algorithms for Text Categorization. In: Proceeding of AERO 2005, pp. 1–8 (2005)

    Google Scholar 

  16. Verayuth, L., Thanaruk, T.: Effect of term distributions on centroid-based text categorization. Information Sciences 158, 89–115 (2004)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Zhi-Hua Zhou Hang Li Qiang Yang

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer Berlin Heidelberg

About this paper

Cite this paper

Methasate, I., Theeramunkong, T. (2007). Experiments on Kernel Tree Support Vector Machines for Text Categorization. In: Zhou, ZH., Li, H., Yang, Q. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2007. Lecture Notes in Computer Science(), vol 4426. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-71701-0_78

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-71701-0_78

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-71700-3

  • Online ISBN: 978-3-540-71701-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics