Surface defect identification method for hot-rolled steel plates based on random data balancing and lightweight convolutional neural network

Zeng, Weihui; Wang, Junyan; Chen, Peng; Zhong, Zhimin; Hu, Gensheng; Bao, Wenxia

doi:10.1007/s11760-024-03270-6

Surface defect identification method for hot-rolled steel plates based on random data balancing and lightweight convolutional neural network

Original Paper
Published: 22 May 2024

Volume 18, pages 5775–5786, (2024)
Cite this article

Signal, Image and Video Processing Aims and scope Submit manuscript

Weihui Zeng^1,3,
Junyan Wang¹,
Peng Chen¹,
Zhimin Zhong²,
Gensheng Hu³ &
…
Wenxia Bao³

283 Accesses
2 Citations
Explore all metrics

Abstract

Hot-rolled strip steel is an extremely important industrial foundational material. The rapid and precise identification of surface defects in hot-rolled strip steel is beneficial for enhancing the quality of steel materials and reducing economic losses. Current research primarily focuses on using convolutional neural networks (CNNs) for strip steel surface defect identification. Although the accuracy of identification has remarkably improved in comparison with traditional machine learning methods, it has overlooked issues related to dataset preprocessing and the problem of nonlightweight CNN models with large model parameters and high computational complexity. To address the abovementioned issues, this study proposes a hot-rolled steel strip surface defect identification method based on random data balancing and the lightweight CNN MobileNet-Pro. Random data balancing employs image augmentation to eliminate the differences in the quantity of categories between the hot-rolled strip steel surface defect data, providing diverse images to alleviate overfitting during model training. MobileNet-Pro is used to increase the model’s effective receptive field. Building upon MobileNetV1, it introduces large convolutional kernels and improves depth-wise separable convolution. Experiments show that the new MobileNet-Pro, after random data balancing on the X-SDD dataset, achieves an accuracy of 96.47%, surpassing RepVGG + SA (95.10% accuracy, nonlightweight) and ResNet50 (93.86% accuracy, nonlightweight). Additionally, MobileNet-Pro outperforms mainstream lightweight networks from the MobileNet series, ShuffleNetV2, and GhostnetV2 in terms of performance on the CIFAR-100 and PASCAL VOC 2007 datasets, demonstrating excellent generalization capabilities. All our code and models are available on GitHub: https://github.com/OnlyForWW/MobileNet-Pro.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A Lightweight Network for the Hot-Rolled Steel Strip Surface Defect Detection

CNN-based hot-rolled steel strip surface defects classification: a comparative study between different pre-trained CNN models

Article 11 March 2024

Automatic Detection and Quantification of Hot-Rolled Steel Surface Defects Using Deep Learning

Article 30 December 2022

References

Aldunin, A.: Development of method for calculation of structure parameters of hot-rolled steel strip for sheet stamping. J. Chem. Technol. Metall 52, 737–740 (2017)
Google Scholar
Wen, X., et al.: Steel surface defect recognition: a survey. Coatings 13(1), 17 (2022)
Article Google Scholar
Xiao, M., et al.: An evolutionary classifier for steel surface defects with small sample set. EURASIP J. Image Video Process. 2017(1), 1–13 (2017)
Article Google Scholar
Gong, R., Chengdong, Wu., Chu, M.: Steel surface defect classification using multiple hyper-spheres support vector machine with additional information. Chemom. Intell. Lab. Syst. 172, 109–117 (2018)
Article Google Scholar
Liu, Y., Ke, Xu., Jinwu, Xu.: An improved MB-LBP defect recognition approach for the surface of steel plates. Appl. Sci. 9(20), 4222 (2019)
Article Google Scholar
Feng, X., Gao, X., Luo, L.: A ResNet50-based method for classifying surface defects in hot-rolled strip steel. Mathematics 9(19), 2359 (2021)
Article Google Scholar
Feng, X., Gao, X., and Luo, L.: A method for surface detect classification of hot rolled strip steel based on Xception. In: 2021 33rd Chinese Control and Decision Conference (CCDC). IEEE (2021)
Hao, Z., et al.: Strip steel surface defects classification based on generative adversarial network and attention mechanism. Metals 12(2), 311 (2022)
Article Google Scholar
Wang, S. et al.: Training deep neural networks on imbalanced data sets. In: 2016 international joint conference on neural networks (IJCNN). IEEE (2016)
Feng, Q., et al.: Online recognition of peanut leaf diseases based on the data balance algorithm and deep transfer learning. Precis. Agric. 24(2), 560–586 (2023)
Article Google Scholar
Ding, X. et al.: Scaling up your kernels to 31x31: revisiting large kernel design in cnns. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (2022)
Liu, S. et al.: More convnets in the 2020s: Scaling up kernels beyond 51x51 using sparsity. arXiv preprint arXiv:2207.03620 (2022)
Howard, A.G. et al.: Mobilenets: Efficient convolutional neural networks for mobile vision applications. arXiv preprint arXiv:1704.04861 (2017)
Sandler, M. et al.: Mobilenetv2: inverted residuals and linear bottlenecks. In: Proceedings of the IEEE conference on computer vision and pattern recognition (2018)
Howard, A. et al.: Searching for mobilenetv3. In: Proceedings of the IEEE/CVF international conference on computer vision (2019)
Zhang, X. et al.: Shufflenet: an extremely efficient convolutional neural network for mobile devices. In: Proceedings of the IEEE conference on computer vision and pattern recognition (2018)
Ma, N. et al.: Shufflenet v2: Practical guidelines for efficient cnn architecture design. In: Proceedings of the European conference on computer vision (ECCV) (2018)
Han, K. et al.: Ghostnet: More features from cheap operations. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (2020)
Tang, Y., et al.: GhostNetv2: enhance cheap operation with long-range attention. Adv. Neural Inform. Process. Syst. 35, 9969–9982 (2022)
Google Scholar
Feng, X., Gao, X., Luo, L.: X-SDD: a new benchmark for hot rolled steel strip surface defects detection. Symmetry 13(4), 706 (2021)
Article Google Scholar
Luo, W. et al.: Understanding the effective receptive field in deep convolutional neural networks. Adv. Neural Inform. Process. Syst. 29 (2016)
Hu, J., Shen, L., and Sun, G.: Squeeze-and-excitation networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition (2018)
Zhang, Q.-L., and Yang, Y.-B.: Sa-net: Shuffle attention for deep convolutional neural networks. In: ICASSP 2021–2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE (2021)
He, K. et al.: Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition (2016)
Ding, X. et al.: Repvgg: making vgg-style convnets great again. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (2021)
Liu, Z. et al.: A convnet for the 2020s. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (2022)
He, K. et al.: Identity mappings in deep residual networks. In: Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11–14, 2016, Proceedings, Part IV 14. Springer International Publishing (2016)
He, T. et al.: Bag of tricks for image classification with convolutional neural networks. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (2019)
Wightman, R., Touvron, H., and Jégou, H.: Resnet strikes back: an improved training procedure in timm. arXiv 2021. arXiv preprint arXiv:2110.00476
Cubuk, E. D. et al.: Randaugment: Practical automated data augmentation with a reduced search space. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition workshops (2020)
Yun, S. et al.: Cutmix: Regularization strategy to train strong classifiers with localizable features. In: Proceedings of the IEEE/CVF international conference on computer vision (2019)
Zhang, H. et al.: mixup: beyond empirical risk minimization. arXiv preprint arXiv:1710.09412 (2017)
Szegedy, C. et al.: Inception-v4, inception-resnet and the impact of residual connections on learning. In: Proceedings of the AAAI conference on artificial intelligence. Vol. 31. No. 1. (2017)
Loshchilov, I. and Hutter, F.: Sgdr: Stochastic gradient descent with warm restarts. arXiv preprint arXiv:1608.03983 (2016)
Goyal, P. et al.: Accurate, large minibatch sgd: training imagenet in 1 hour. arXiv preprint arXiv:1706.02677 (2017)
Tan, M., and Le, Q.: Efficientnet: rethinking model scaling for convolutional neural networks. In: International conference on machine learning. PMLR (2019)
Micikevicius, P. et al.: Mixed precision training. arXiv preprint arXiv:1710.03740 (2017)
Tishby, N. and Zaslavsky, N.: Deep learning and the information bottleneck principle. In: 2015 IEEE information theory workshop (itw). IEEE (2015)
He, Y., et al.: An end-to-end steel surface defect detection approach via fusing multiple hierarchical features. IEEE Trans. Instrum. Measur. 69(4), 1493–1504 (2019)
Article Google Scholar
Jocher, G. et al.: ultralytics/yolov5: v7. 0-yolov5 sota realtime instance segmentation. Zenodo (2022)
Selvaraju, R. R., et al.: Grad-cam: Visual explanations from deep networks via gradient-based localization. In: Proceedings of the IEEE international conference on computer vision (2017)
Dosovitskiy, A. et al.: An image is worth 16x16 words: transformers for image recognition at scale. arXiv preprint arXiv:2010.11929 (2020)
Liu, Z. et al.: Swin transformer: Hierarchical vision transformer using shifted windows. In: Proceedings of the IEEE/CVF international conference on computer vision (2021)
Krizhevsky, A. and Hinton, G.: Learning multiple layers of features from tiny images 7 (2009)
Everingham, M., et al.: The pascal visual object classes (voc) challenge. Int. J. Comput. Vision 88, 303–338 (2010)
Article Google Scholar

Download references

Acknowledgements

This work was partially supported by the National Natural Science Foundation of China (No. 32201666), the AIMS Commissioned Project (2022340101001837), and the Open Project of Anhui University Power Quality Engineering Research Center, Ministry of Education (KFKT202304).

Author information

Authors and Affiliations

School of Internet, Anhui University, Hefei, 230039, China
Weihui Zeng, Junyan Wang & Peng Chen
AIMS Technology Co., Ltd., Hefei, 230088, China
Zhimin Zhong
National Engineering Research Center for Agro-Ecological Big Data Analysis & Application, Anhui University, Hefei, 230601, China
Weihui Zeng, Gensheng Hu & Wenxia Bao

Authors

Weihui Zeng
View author publications
You can also search for this author in PubMed Google Scholar
Junyan Wang
View author publications
You can also search for this author in PubMed Google Scholar
Peng Chen
View author publications
You can also search for this author in PubMed Google Scholar
Zhimin Zhong
View author publications
You can also search for this author in PubMed Google Scholar
Gensheng Hu
View author publications
You can also search for this author in PubMed Google Scholar
Wenxia Bao
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

WZ and JW provided the theoretical viewpoints in the manuscript; JW completed the validation experiment in the manuscript; PC Organized and led the project; All authors have reviewed and evaluated the manuscript.

Corresponding author

Correspondence to Peng Chen.

Ethics declarations

Conflict of interest

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Zeng, W., Wang, J., Chen, P. et al. Surface defect identification method for hot-rolled steel plates based on random data balancing and lightweight convolutional neural network. SIViP 18, 5775–5786 (2024). https://doi.org/10.1007/s11760-024-03270-6

Download citation

Received: 04 March 2024
Revised: 28 April 2024
Accepted: 05 May 2024
Published: 22 May 2024
Issue Date: September 2024
DOI: https://doi.org/10.1007/s11760-024-03270-6

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Surface defect identification method for hot-rolled steel plates based on random data balancing and lightweight convolutional neural network

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

A Lightweight Network for the Hot-Rolled Steel Strip Surface Defect Detection

CNN-based hot-rolled steel strip surface defects classification: a comparative study between different pre-trained CNN models

Automatic Detection and Quantification of Hot-Rolled Steel Surface Defects Using Deep Learning

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

Surface defect identification method for hot-rolled steel plates based on random data balancing and lightweight convolutional neural network

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

A Lightweight Network for the Hot-Rolled Steel Strip Surface Defect Detection

CNN-based hot-rolled steel strip surface defects classification: a comparative study between different pre-trained CNN models

Automatic Detection and Quantification of Hot-Rolled Steel Surface Defects Using Deep Learning

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation