Abstract
Metal surface defect detection is an important task for quality control in industrial production processes, and the requirements for accuracy, and running speed are becoming increasingly high. However, maintaining the realization of real-time surface defect segmentation remains a challenge due to the complex edge details of metal defects, inter-class similarity, and intra-class differences. For this reason, we propose Dual-branch Real-time Segmentation NetWork (DBRNet) for pixel-level defect classification on metal surfaces. First, we propose the Low-params Feature Enhancement Module (LFEM), which improves the feature extraction capability of the model with fewer parameters and does not significantly reduce the inference speed. Then, to solve the problem of inter-class similarity, we design the Attention Flow-semantic Fusion Module (AFFM) to effectively integrate the high-dimensional semantic information into the low-dimensional detail feature map by generating flow-semantic offset positions and using global attention. Finally, the Deep Connection Pyramid Pooling Module (DCPPM) is proposed to aggregate multi-scale context information to realize the overall perception of the defect. Experiments on NEU-Seg, MT, and Severstal Steel Defect Dataset show that the DBRNet outperforms the other state-of-the-art approaches in balance accuracy, speed, and params. The code is publicly available at https://github.com/fffcompu/DBRNet-Defect.
This work was supported in part by Major innovation projects of the pilot project of science, education and industry integration (2022JBZ01-01), and in part by Taishan Scholars Program (tsqn202211203).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Badrinarayanan, V., Kendall, A., Cipolla, R.: SegNet: a deep convolutional encoder-decoder architecture for image segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 39(12), 2481–2495 (2017)
Bochkovskiy, A., Wang, C.Y., Liao, H.Y.M.: YOLOv4: optimal speed and accuracy of object detection. arXiv preprint arXiv:2004.10934 (2020)
Chen, L.-C., Zhu, Y., Papandreou, G., Schroff, F., Adam, H.: Encoder-decoder with atrous separable convolution for semantic image segmentation. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11211, pp. 833–851. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01234-2_49
Damacharla, P., Rao, A., Ringenberg, J., Javaid, A.Y.: TLU-Net: a deep learning approach for automatic steel surface defect detection. In: 2021 International Conference on Applied Artificial Intelligence (ICAPAI), pp. 1–6. IEEE (2021)
Dong, H., Song, K., He, Y., Xu, J., Yan, Y., Meng, Q.: PGA-Net: pyramid feature fusion and global context attention network for automated surface defect detection. IEEE Trans. Ind. Inf. 16(12), 7448–7458 (2019)
Fan, M., et al.: Rethinking BiSeNet for real-time semantic segmentation. In: Proceedings of the IEEE/CVF Conference on CVPR, pp. 9716–9725 (2021)
Hong, Y., Pan, H., Sun, W., Jia, Y.: Deep dual-resolution networks for real-time and accurate semantic segmentation of road scenes. arXiv preprint arXiv:2101.06085 (2021)
Huang, Y., Qiu, C., Yuan, K.: Surface defect saliency of magnetic tile. Vis. Comput. 36, 85–96 (2020). https://doi.org/10.1007/s00371-018-1588-5
Jocher, G.: Yolov5 (2021). https://github.com/ultralytics/yolov5
Li, H., Xiong, P., Fan, H., Sun, J.: DFANet: deep feature aggregation for real-time semantic segmentation. In: Proceedings of the IEEE/CVF Conference on CVPR, pp. 9522–9531 (2019)
Li, X., et al.: Semantic flow for fast and accurate scene parsing. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12346, pp. 775–793. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58452-8_45
Ma, N., Zhang, X., Zheng, H.-T., Sun, J.: ShuffleNet V2: practical guidelines for efficient CNN architecture design. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) Computer Vision – ECCV 2018. LNCS, vol. 11218, pp. 122–138. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01264-9_8
Pan, Y., Zhang, L.: Dual attention deep learning network for automatic steel surface defect segmentation. Comput.-Aided Civ. Infrastruct. Eng. 37(11), 1468–1487 (2022)
Paszke, A., Chaurasia, A., Kim, S., Culurciello, E.: ENet: a deep neural network architecture for real-time semantic segmentation. arXiv preprint arXiv:1606.02147 (2016)
Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. In: Advances in Neural Information Processing Systems, vol. 28 (2015)
Romera, E., Alvarez, J.M., Bergasa, L.M., Arroyo, R.: ERFNet: efficient residual factorized convnet for real-time semantic segmentation. IEEE Trans. Intell. Transp. Syst. 19(1), 263–272 (2017)
Ronneberger, O., Fischer, P., Brox, T.: U-Net: convolutional networks for biomedical image segmentation. In: Navab, N., Hornegger, J., Wells, W.M., Frangi, A.F. (eds.) MICCAI 2015. LNCS, vol. 9351, pp. 234–241. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-24574-4_28
Severstal: Steel defect detection, kaggle challange 2019 (2019). https://www.kaggle.com/c/severstal-steel-defect-detection
Wang, Y., et al.: LEDNet: a lightweight encoder-decoder network for real-time semantic segmentation. In: 2019 IEEE International Conference on Image Processing (ICIP), pp. 1860–1864. IEEE (2019)
Yu, C., Gao, C., Wang, J., Yu, G., Shen, C., Sang, N.: BiSeNet V2: bilateral network with guided aggregation for real-time semantic segmentation. Int. J. Comput. Vis. 129, 3051–3068 (2021). https://doi.org/10.1007/s11263-021-01515-2
Zhang, D., Song, K., Xu, J., He, Y., Niu, M., Yan, Y.: MCnet: multiple context information segmentation network of no-service rail surface defects. IEEE Trans. Instrum. Meas. 70, 1–9 (2020)
Zhang, X., Du, B., Wu, Z., Wan, T.: LAANet: lightweight attention-guided asymmetric network for real-time semantic segmentation. Neural Comput. Appl. 34(5), 3573–3587 (2022). https://doi.org/10.1007/s00521-022-06932-z
Zhao, H., Qi, X., Shen, X., Shi, J., Jia, J.: ICNet for real-time semantic segmentation on high-resolution images. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11207, pp. 418–434. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01219-9_25
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Zhang, T., Wei, X., Wu, X., Jiang, X. (2024). DBRNet: Dual-Branch Real-Time Segmentation NetWork for Metal Defect Detection. In: Liu, Q., et al. Pattern Recognition and Computer Vision. PRCV 2023. Lecture Notes in Computer Science, vol 14430. Springer, Singapore. https://doi.org/10.1007/978-981-99-8537-1_34
Download citation
DOI: https://doi.org/10.1007/978-981-99-8537-1_34
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-8536-4
Online ISBN: 978-981-99-8537-1
eBook Packages: Computer ScienceComputer Science (R0)