skip to main content
10.1145/3647649.3647656acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicigpConference Proceedingsconference-collections
research-article

DCS-YOLOv8: An Improved Steel Surface Defect Detection Algorithm Based on YOLOv8

Published:03 May 2024Publication History

ABSTRACT

In response to the challenges posed by low detection accuracy resulting from a wide range of surface defects, intricate textures, and minute defect targets in steel surfaces, this paper introduces an innovative defect detection model called DCS-YOLOv8, which builds upon the foundation of YOLOv8. Firstly, Real-ESRGAN (Real-Enhanced Super-Resolution GAN) is used to enhance image resolution, effectively addressing the challenge of identifying minuscule defects within the dataset. Furthermore, DCN (Deformable Convolutions) are seamlessly integrated into the backbone network to amplify the network's capability for multi-scale feature extraction, which empowers the network to adeptly navigate intricate background information and concentrate on pinpointing target objects. Lastly, to tackle the issues of elevated false negative rates and diminished detection precision, this paper designs a module based on the CBAM (Concentration-Based Attention Module) and SCSE (Concurrent Spatial and Channel Squeeze and Excitation) attention modules. It facilitates comprehensive information acquisition, enriches the fusion of channel and spatial features, and elevates feature map expression. Regarding experimental outcomes, the enhanced YOLOv8 algorithm shows outstanding detection performance, achieving 78.6% mAP on the NEU-DET dataset, which marks a 4.4% enhancement over the original YOLOv8 network. Notably, the algorithm attains a detection speed of 143 FPS. When juxtaposed with other prominent object detection algorithms, it unequivocally affirms the efficacy and supremacy of this approach, underscoring its potential significance in industrial applications.

References

  1. Xiong Z, Li Q, Mao Q, A 3D laser profiling system for rail surface defect detection. Sensors, 2017, 17(8): 1791.Google ScholarGoogle ScholarCross RefCross Ref
  2. Liang Y, Xu K, Zhou P. Mask gradient response-based threshold segmentation for surface defect detection of milled aluminum ingot. Sensors, 2020, 20(16): 4519.Google ScholarGoogle ScholarCross RefCross Ref
  3. Yefan Zhou, "Research on Image-Based Automatic Wafer Surface Defect Detection Algorithm," Journal of Image and Graphics, Vol. 7, No. 1, pp. 26-31, March 2019. doi: 10.18178/joig.7.1.26-31Google ScholarGoogle ScholarCross RefCross Ref
  4. Teow Wee Teo and Mohd Zaid Abdullah, "Solar Cell Micro-Crack Detection Using Localised Texture Analysis," Journal of Image and Graphics, Vol. 6, No. 1, pp. 54-58, June 2018. doi: 10.18178/joig.6.1.54-58Google ScholarGoogle ScholarCross RefCross Ref
  5. Girshick R, Donahue J, Darrell T, Rich feature hierarchies for accurate object detection and semantic segmentation. Proceedings of the IEEE conference on computer vision and pattern recognition. 2014: 580-587.Google ScholarGoogle Scholar
  6. Ren S, He K, Girshick R, Faster R-CNN: Towards real-time object detection with region proposal networks. Advances in neural information processing systems, 2015,28: 1440-1448.Google ScholarGoogle Scholar
  7. He K, Gkioxari G, Dollár P, Mask R-CNN. Proceedings of the IEEE international conference on computer vision. 2017: 2961-2969.Google ScholarGoogle Scholar
  8. Zhao W, Chen F, Huang H, A new steel defect detection algorithm based on deep learning. Computational Intelligence and Neuroscience, 2021, 2021: 1-13.Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Liu W, Anguelov D, Erhan D, Ssd: Single shot multibox detector. Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11–14, 2016, Proceedings, Part I 14. Springer International Publishing, 2016: 21-37.Google ScholarGoogle Scholar
  10. Redmon J, Farhadi A. Yolov3: An incremental improvement. arXiv preprint arXiv:1804.02767, 2018.Google ScholarGoogle Scholar
  11. Redmon J, Divvala S, Girshick R, You only look once: Unified, real-time object detection. Proceedings of the IEEE conference on computer vision and pattern recognition. 2016: 779-788.Google ScholarGoogle Scholar
  12. Redmon J, Farhadi A. YOLO9000: better, faster, stronger. Proceedings of the IEEE conference on computer vision and pattern recognition. 2017: 7263-7271.Google ScholarGoogle Scholar
  13. Bochkovskiy A, Wang C Y, Liao H Y M. Yolov4: Optimal speed and accuracy of object detection. arXiv preprint arXiv:2004.10934, 2020.Google ScholarGoogle Scholar
  14. Li C, Li L, Jiang H, YOLOv6: A single-stage object detection framework for industrial applications. arXiv preprint arXiv:2209.02976, 2022.Google ScholarGoogle Scholar
  15. Wang C Y, Bochkovskiy A, Liao H Y M. YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2023: 7464-7475.Google ScholarGoogle Scholar
  16. Junjie X, Minping J, Feiyun X, A method for workpiece surface small-defect detection based on CutMix and YOLOv3. Journal of Southeast University (English Edition), 2021, 37(2): 128-136.Google ScholarGoogle Scholar
  17. Zhao, H., Wan, F., Lei, G., Xiong, Y., Xu, L., Xu, C., & Zhou, W. LSD-YOLOv5: A Steel Strip Surface Defect Detection Algorithm Based on Lightweight Network and Enhanced Feature Fusion Mode. Sensors, 2023, 23(14), 6558.Google ScholarGoogle Scholar
  18. Zhu X, Hu H, Lin S, Deformable convnets v2: More deformable, better results. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2019: 9308-9316.Google ScholarGoogle Scholar
  19. Bolya D, Zhou C, Xiao F, Yolact: Real-time instance segmentation. Proceedings of the IEEE/CVF international conference on computer vision. 2019: 9157-9166.Google ScholarGoogle Scholar
  20. Wang C Y, Liao H Y M, Wu Y H, CSPNet: A new backbone that can enhance learning capability of CNN. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition workshops. 2020: 390-391.Google ScholarGoogle Scholar
  21. Lin T Y, Dollár P, Girshick R, Feature pyramid networks for object detection. Proceedings of the IEEE conference on computer vision and pattern recognition. 2017: 2117-2125.Google ScholarGoogle Scholar
  22. Liu S, Qi L, Qin H, Path aggregation network for instance segmentation. Proceedings of the IEEE conference on computer vision and pattern recognition. 2018: 8759-8768.Google ScholarGoogle Scholar
  23. Ge Z, Liu S, Wang F, Yolox: Exceeding yolo series in 2021. arXiv preprint arXiv:2107.08430, 2021.Google ScholarGoogle Scholar
  24. Li X, Wang W, Wu L, Generalized focal loss: Learning qualified and distributed bounding boxes for dense object detection. Advances in Neural Information Processing Systems, 2020, 33: 21002-21012.Google ScholarGoogle Scholar
  25. Cubuk E D, Zoph B, Mane D, Autoaugment: Learning augmentation strategies from data. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2019: 113-123.Google ScholarGoogle Scholar
  26. Wang X, Xie L, Dong C, Real-ESRGAN: Training real-world blind super-resolution with pure synthetic data. Proceedings of the IEEE/CVF international conference on computer vision. 2021: 1905-1914.Google ScholarGoogle Scholar
  27. Lin T Y, Maire M, Belongie S, Microsoft coco: Common objects in context. Computer Vision–ECCV 2014: 13th European Conference, Zurich, Switzerland, September 6-12, 2014, Proceedings, Part V 13. Springer International Publishing, 2014: 740-755.Google ScholarGoogle Scholar
  28. Woo S, Park J, Lee J Y, Cbam: Convolutional block attention module. Proceedings of the European conference on computer vision (ECCV). 2018: 3-19.Google ScholarGoogle Scholar
  29. Roy A G, Navab N, Wachinger C. Concurrent spatial and channel ‘squeeze & excitation’ in fully convolutional networks. Medical Image Computing and Computer Assisted Intervention–MICCAI 2018: 21st International Conference, Granada, Spain, September 16-20, 2018, Proceedings, Part I. Springer International Publishing, 2018: 421-429.Google ScholarGoogle Scholar
  30. Song K, Yan Y. A noise robust method based on completed local binary patterns for hot-rolled steel strip surface defects. Applied Surface Science, 2013, 285: 858-864.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. DCS-YOLOv8: An Improved Steel Surface Defect Detection Algorithm Based on YOLOv8

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      ICIGP '24: Proceedings of the 2024 7th International Conference on Image and Graphics Processing
      January 2024
      480 pages
      ISBN:9798400716720
      DOI:10.1145/3647649

      Copyright © 2024 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 3 May 2024

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed limited
    • Article Metrics

      • Downloads (Last 12 months)13
      • Downloads (Last 6 weeks)13

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format .

    View HTML Format