Skip to main content
Log in

A hybrid-model optimization algorithm based on the Gaussian process and particle swarm optimization for mixed-variable CNN hyperparameter automatic search

一种基于高斯过程与粒子群算法的CNN超参数自动搜索混合模型优化算法

  • Research Article
  • Published:
Frontiers of Information Technology & Electronic Engineering Aims and scope Submit manuscript

Abstract

Convolutional neural networks (CNNs) have been developed quickly in many real-world fields. However, CNN’s performance depends heavily on its hyperparameters, while finding suitable hyperparameters for CNNs working in application fields is challenging for three reasons: (1) the problem of mixed-variable encoding for different types of hyperparameters in CNNs, (2) expensive computational costs in evaluating candidate hyperparameter configuration, and (3) the problem of ensuring convergence rates and model performance during hyperparameter search. To overcome these problems and challenges, a hybrid-model optimization algorithm is proposed in this paper to search suitable hyperparameter configurations automatically based on the Gaussian process and particle swarm optimization (GPPSO) algorithm. First, a new encoding method is designed to efficiently deal with the CNN hyperparameter mixed-variable problem. Second, a hybrid-surrogate-assisted model is proposed to reduce the high cost of evaluating candidate hyperparameter configurations. Third, a novel activation function is suggested to improve the model performance and ensure the convergence rate. Intensive experiments are performed on image-classification benchmark datasets to demonstrate the superior performance of GPPSO over state-of-the-art methods. Moreover, a case study on metal fracture diagnosis is carried out to evaluate the GPPSO algorithm performance in practical applications. Experimental results demonstrate the effectiveness and efficiency of GPPSO, achieving accuracy of 95.26% and 76.36% only through 0.04 and 1.70 GPU days on the CIFAR-10 and CIFAR-100 datasets, respectively.

摘要

卷积神经网络(CNN)在许多实际应用领域中有着快速发展. 然而, CNN性能很大程度上取决于其超参数, 而为CNN配置合适的超参数通常面临着以下3个挑战: (1)不同类型CNN超参数的混合变量编码问题; (2)评估候选模型的昂贵计算成本问题; (3)确保搜索过程中收敛速率和模型性能问题. 针对上述问题, 提出一种基于高斯过程(GP)和粒子群优化算法(PSO)的混合模型优化算法(GPPSO), 用于自动搜索最优的CNN超参数配置. 首先, 设计一种新的编码方法高效编码CNN中不同类型的超参数. 其次, 提出一种混合代理辅助(HSA)模型降低评估候选模型的高计算成本. 最后, 设计一种新的激活函数改善模型性能并确保收敛速率. 在图像分类基准数据集上进行了大量实验, 验证GPPSO优于最先进的方法. 以金属断口诊断为例, 验证GPPSO算法在实际应用中的有效性. 实验结果表明, GPPSO仅需0.04和1.70 GPU天即可在CIFAR-10和CIFAR-100数据集上实现95.26%和76.36%识别准确率.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

Data availability

Due to the nature of this research, all authors of this paper did not agree for their data to be shared publicly, so supporting data are not available.

References

Download references

Author information

Authors and Affiliations

Authors

Contributions

Han YAN designed the research and performed the experiments. Han YAN and Chongquan ZHONG implemented the software and drafted the paper. Yuhu WU, Liyong ZHANG, and Wei LU revised and finalized the paper.

Corresponding author

Correspondence to Wei Lu  (卢伟).

Ethics declarations

Han YAN, Chongquan ZHONG, Yuhu WU, Liyong ZHANG, and Wei LU declare that they have no conflict of interest.

Additional information

Project supported by the National Natural Science Foundation of China (Nos. 62073056 and 61876029), the Applied Basic Research Project of Liaoning Province, China (No. 2023JH2/101300207), and the Key Field Innovation Team Project of Dalian, China (No. 2021RT14)

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Yan, H., Zhong, C., Wu, Y. et al. A hybrid-model optimization algorithm based on the Gaussian process and particle swarm optimization for mixed-variable CNN hyperparameter automatic search. Front Inform Technol Electron Eng 24, 1557–1573 (2023). https://doi.org/10.1631/FITEE.2200515

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1631/FITEE.2200515

Key words

关键词

CLC number

Navigation