skip to main content
10.1145/3638264.3638270acmotherconferencesArticle/Chapter ViewAbstractPublication PagesmicmlConference Proceedingsconference-collections
research-article

Selecting Hyperparameters of Nonlinear Support Vector Machine Using Bayesian Inference

Published:29 January 2024Publication History

ABSTRACT

The performance of support vector machine(SVM) highly depends on selection of hyperparameters(kernel function parameters and penalty parameter). Currently selection methods of hyperparameters in nonlinear SVMs tend to fall into local optimized solution or rather is time consuming. Taking account of this respect, we propose a new automatic selection method of hyperparameters of nonlinear support vector machines. The theoretical analysis for the appropriate selection of hyperparameters is conducted by Bayesian inference. On the basis of this analysis, we determine the parameter by using Markov chain Monte Carlo(MCMC) algorithm. By using the determined parameters, we construct a classifier with low complexity of recognizing computation. Experimental results from various databases and the evaluation of toxicity in water samples demonstrate that the proposed method provides high classification performance.

References

  1. V. Vapnik, The Nature of Statistical Learning Theory, Springer Verlag, New York, 1995.Google ScholarGoogle ScholarCross RefCross Ref
  2. Jianfeng Wang,Lei Zhang, “A parameter optimization method for an SVM based on improved grid search algorithm,” Applied Science and Technology, Vol.39,No.3 ,2012.Google ScholarGoogle Scholar
  3. S. Keerthi, V. Sindhwani, O. Chapelle, An efficient method for gradient-based adaptation of hyperparameters in SVM models, Adv. Neural Inf. Process. Syst.19 (2007) 673–680.Google ScholarGoogle Scholar
  4. S.T. Li, M.K. Tan, S.V.M. Tuning, parameters by using a hybrid CLPSO–BFGS algorithm, Neurocomputing 73 (2010) 2089–2096.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. C.-L. Huang, ACO-based hybrid classification system with feature subset selection and model parameters optimization, Neurocomputing 73 (2009) 438–448.Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. X.L. Zhang, X.F. Chen, Z.J. He, An ACO-based algorithm for parameter optimization of support vector machines, Expert Syst. Appl. 37 (2010) 6618–6628.Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. İ. İlhan, G. Tezel, A genetic algorithm-support vector machine method with parameter optimization for selecting the tag SNPs, J. Biomed. Inf. 46 (2013) 328–340.Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. S.W. Lin, K.C. Ying, S.C. Chen, Z.J. Lee, Particle swarm optimization for parameter determination and feature selection of support vector machines, Expert Syst. Appl. 35 (2008) 1817–1824.Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. X.-l. Li, L.-h. Li, B.-l. Zhang, Q.-j. Guo, Hybrid self-adaptive learning based particle swarm optimization and support vector regression model for grade estimation, Neurocomputing 118 (2013) 179–190.Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Y. Bao, Z. Hu, T. Xiong, A PSO and pattern search based memetic algorithm for SVMs parameters optimization, Neurocomputing 117 (2013) 98–106.Google ScholarGoogle ScholarCross RefCross Ref
  11. S. W. Lin, "Parameter determination of support vector machine and f eature selection using simulated annealing approach", Applied Soft Computin g, 8(4):1505-1512, 2008.Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. T.Van Gestel Bayesian Framework for Least-Squares Support Vector Machine Classifiers, Gaussian Processes, and Kernel Fisher Discriminant Analysis [J]. Neural Computation 14 1115-1147. 2002.Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Polson, N.G., Scott, S.L.: Data augmentation for support vector machines. Bayesian Anal, 2011.Google ScholarGoogle Scholar
  14. Shounak Datta, Swagatam Das, "Near-Bayesian Support Vector Machines for imbalanced data classification with equal or unequal misclassification costs", Neural Networks, vol.70, 39–52, 2015.Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. L. Sun, K.-A. Toh and Z. Lin, “A center sliding Bayesian binary classifier adopting orthogonal polynomials”, Pattern Recognition, vol. 48, no. 6, pp. 2013-2028, 2015.Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Henao, R., Yuan, X., Carin, L.: Bayesian Nonlinear Support Vector Machines and Discriminative Factor Modeling. NIPS (2017)Google ScholarGoogle Scholar
  17. Polson, N.G., Scott, S.L.: Data augmentation for support vector machines. B ayesian Anal. 2011.Google ScholarGoogle Scholar
  18. Carl Edward Rasmussen, " Adaptive stellar spectral subclass classification based on Bayesian SVMs", New Astronomy 51 (2017) 51–58.Google ScholarGoogle ScholarCross RefCross Ref
  19. Henao, R., Yuan, X., Carin, L.: Bayesian Nonlinear Support Vector Machine s and Discriminative Factor Modeling. NIPS 2014.Google ScholarGoogle Scholar
  20. Marin, J. M., and C. P. Robert. Bayesian Core: A Practical Approach to Computati onal Bayesian Statistics. New York: Springer Science+Business Media, LLC, 2007.Google ScholarGoogle Scholar
  21. Gareth O. Roberts, "General state space Markov chains and MCMC algorithms" [J]. Probability Surveys 20, No 7, 20–71. 2004.Google ScholarGoogle Scholar
  22. I. Murray and R. P. Adams. "Slice sampling covariance hyperparameters of latent Gaussian models" NIPS, 23, pp 1723–1731, 2010.Google ScholarGoogle Scholar
  23. Kevin P. Mur phy, "Machine Learning", 864∼866, MIT press 2012.Google ScholarGoogle Scholar
  24. Adam Bownik, “Daphnia swimming behaviour as a biomarker in toxicity assessment”, Science of the Total Environment 601 –602 (2017) 194 –205.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Selecting Hyperparameters of Nonlinear Support Vector Machine Using Bayesian Inference
          Index terms have been assigned to the content through auto-classification.

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Other conferences
            MICML '23: Proceedings of the 2023 International Conference on Mathematics, Intelligent Computing and Machine Learning
            December 2023
            109 pages
            ISBN:9798400709258
            DOI:10.1145/3638264

            Copyright © 2023 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 29 January 2024

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • research-article
            • Research
            • Refereed limited
          • Article Metrics

            • Downloads (Last 12 months)6
            • Downloads (Last 6 weeks)0

            Other Metrics

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader

          HTML Format

          View this article in HTML Format .

          View HTML Format