PCP-YOLO: an approach integrating non-deep feature enhancement module and polarized self-attention for small object detection of multiscale defects

Wang, Penglin; Shi, Donghui; Aguilar, Jose

doi:10.1007/s11760-024-03666-4

PCP-YOLO: an approach integrating non-deep feature enhancement module and polarized self-attention for small object detection of multiscale defects

Original Paper
Published: 05 December 2024

Volume 19, article number 71, (2025)
Cite this article

Signal, Image and Video Processing Aims and scope Submit manuscript

Penglin Wang¹,
Donghui Shi^1,2 &
Jose Aguilar^3,4

347 Accesses
Explore all metrics

Abstract

The detection of small objects within multiscale defects amidst complex background interference presents a formidable challenge in industrial defect detection. To address this issue and achieve precise and expeditious identification in industrial defect detection, this study proposes PCP-YOLO, a novel network that incorporates a non-deep feature extraction module and a polarized filtering feature fusion module for small object defect detection. Initially, YOLOv8 is employed as the foundational model. Subsequently, a lightweight, non-deep feature extraction module, PotentNet, is designed and integrated into the backbone network. In the neck network, a feature fusion module incorporating polarized self-attention, C2f_ParallelPolarized, has been developed. Finally, CARAFE is utilized to substitute the original upsampling module in the neck network. The efficacy of this approach has been rigorously evaluated using three datasets: the publicly available NEU-DET and PKU-PCB datasets, and the real-world industrial dataset GC10-DET. The mAP@0.5 values achieved are 79.4%, 96.1%, and 77.6%, significantly outperforming other detection methods. The method also has a fast inference speed. These results demonstrate that PCP-YOLO exhibits substantial potential for rapid and accurate defect detection.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

YOLO-FGD: a fast lightweight PCB defect method based on FasterNet and the Gather-and-Distribute mechanism

Article 03 July 2024

DMC-Net: a lightweight network for real-time surface defect segmentation

Article 18 February 2025

CSC-YOLO: An Image Recognition Model for Surface Defect Detection of Copper Strip and Plates

Article 23 April 2024

Data availability

No datasets were generated or analysed during the current study.

References

Luo, Q., Fang, X., Liu, L., Yang, C., Sun, Y.: Automated visual defect detection for flat steel surface: a survey. IEEE Trans. Instrum. Meas. 69(3), 626–644 (2020)
Article MATH Google Scholar
Zhang, Y., Zhang, H., Huang, Q., Han, Y., Zhao, M.: DsP-YOLO: an anchor-free network with DsPAN for small object detection of multiscale defects. Expert Syst. Appl. 241, 122669–122685 (2024)
Article MATH Google Scholar
Dong, X., Zhang, C., Wang, J., Chen, Y., Wang, D.: Real-time detection of surface cracking defects for large-sized stamped parts. Comput. Ind. 159, 104105–104119 (2024)
Article MATH Google Scholar
Gao, Y., Gao, L., Li, X., Yan, X.: A semi-supervised convolutional neural network-based method for steel surface defect recognition. Robot. Comput. Integr. Manuf. 61, 101825–101832 (2020)
Article MATH Google Scholar
Wang, R., Yu, H., Tang, J., Feng, B., Kang, Y., Song, K.: Optimal design of iron-cored coil sensor in magnetic flux leakage detection of thick-walled steel pipe. Meas. Sci. Technol. 34(8), 085123–085133 (2023)
Article Google Scholar
Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.-Y., Berg, A.C.: Ssd: single shot multibox detector. In: Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11–14, 2016, Proceedings, Part I 14, pp. 21–37. Springer (2016)
Redmon, J., Divvala, S., Girshick, R., Farhadi, A.: You only look once: unified, real-time object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 779–788 (2016)
Girshick, R., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 580–587 (2014)
Girshick, R.: Fast r-cnn. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 1440–1448 (2015)
Ren, S., He, K., Girshick, R., Sun, J.: Faster r-cnn: towards real-time object detection with region proposal networks. IEEE Trans. Pattern Anal. Mach. Intel. 39(6), 1137–1149 (2016)
Article MATH Google Scholar
Tian, R., Jia, M.: Dcc-centernet: a rapid detection method for steel surface defects. Measurement 187, 110211–110225 (2022)
Article MATH Google Scholar
Aboah, A., Wang, B., Bagci, U., Adu-Gyamfi, Y.: Real-time multi-class helmet violation detection using few-shot data sampling technique and yolov8. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 5349–5357 (2023)
Safaldin, M., Zaghden, N., Mejdoub, M.: An improved yolov8 to detect moving objects. IEEE Access 12, 59782–59806 (2024)
Article Google Scholar
Dong, H., Yuan, M., Wang, S., Zhang, L., Bao, W., Liu, Y., Hu, Q.: Pham-yolo: a parallel hybrid attention mechanism network for defect detection of meter in substation. Sensors 23(13), 6052–6061 (2023)
Article Google Scholar
Xu, L., Dong, S., Wei, H., Ren, Q., Huang, J., Liu, J.: Defect signal intelligent recognition of weld radiographs based on yolo v5-improvement. J. Manuf. Process. 99, 373–381 (2023)
Article Google Scholar
Lu, Q., Lin, J., Luo, L., Zhang, Y., Zhu, W.: A supervised approach for automated surface defect detection in ceramic tile quality control. Adv. Eng. Inform. 53, 101692–101704 (2022)
Article Google Scholar
Ling, Q., Isa, N.A.M., Asaari, M.S.M.: Precise detection for dense pcb components based on modified yolov8. IEEE Access 11, 116545–116560 (2023)
Article Google Scholar
Yang, S., Wang, W., Gao, S., Deng, Z.: Strawberry ripeness detection based on YOLOv8 algorithm fused with LW-Swin transformer. Comput. Electron. Agric. 215, 108360–108369 (2023)
Article Google Scholar
Cao, Y., Pang, D., Zhao, Q., Yan, Y., Jiang, Y., Tian, C., Wang, F., Li, J.: Improved yolov8-gd deep learning model for defect detection in electroluminescence images of solar photovoltaic modules. Eng. Appl. Artif. Intell. 131, 107866–107876 (2024)
Article MATH Google Scholar
Chen, G., Hou, Y., Cui, T., Li, H., Shangguan, F., Cao, L.: Yolov8-cml: a lightweight target detection method for color-changing melon ripening in intelligent agriculture. Sci. Rep. 14(1), 14400–14410 (2024)
Article Google Scholar
Zhao, L., Liu, J., Ren, Y., Lin, C., Liu, J., Abbas, Z., Islam, M.S., Xiao, G.: Yolov8-qr: an improved yolov8 model via attention mechanism for object detection of qr code defects. Comput. Electr. Eng. 118, 109376–109390 (2024)
Article Google Scholar
Zhao, C., Shu, X., Yan, X., Zuo, X., Zhu, F.: Rdd-yolo: a modified yolo for detection of steel surface defects. Measurement 214, 112776–112780 (2023)
Article Google Scholar
Wang, Y., Wang, H., Xin, Z.: Efficient detection model of steel strip surface defects based on yolo-v7. IEEE Access 10, 133936–133944 (2022)
Article MATH Google Scholar
Qian, X., Wang, X., Yang, S., Lei, J.: Lff-yolo: a yolo algorithm with lightweight feature fusion network for multi-scale defect detection. IEEE Access 10, 130339–130349 (2022)
Article Google Scholar
Xie, W., Sun, X., Ma, W.: A light weight multi-scale feature fusion steel surface defect detection model based on yolov8. Meas. Sci. Technol. 35, 055017–055037 (2024)
Article Google Scholar
Su, B., Chen, H., Zhou, Z.: Baf-detector: an efficient cnn-based detector for photovoltaic cell defect detection. IEEE Trans. Ind. Electron. 69(3), 3161–3171 (2021)
Article MATH Google Scholar
Li, L., Wang, Z., Zhang, T.: Gbh-yolov5: ghost convolution with bottleneckcsp and tiny target prediction head incorporating yolov5 for pv panel defect detection. Electronics 12(3), 561–571 (2023)
Liu, Z., Abeyrathna, R.R.D., Sampurno, R.M., Nakaguchi, V.M., Ahamed, T.: Faster-yolo-ap: a lightweight apple detection algorithm based on improved yolov8 with a new efficient pdwconv in orchard. Comput. Electron. Agric. 223, 109118–109127 (2024)
Article Google Scholar
Li, Y., Fan, Q., Huang, H., Han, Z., Gu, Q.: A modified yolov8 detection network for UAV aerial image recognition. Drones 7(5), 304–314 (2023)
Article MATH Google Scholar
Ding, X., Zhang, X., Ma, N., Han, J., Ding, G., Sun, J.: Repvgg: making vgg-style convnets great again. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 13733–13742 (2021)
Wang, J., Chen, K., Xu, R., Liu, Z., Loy, C.C., Lin, D.: Carafe: content-aware reassembly of features. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 3007–3016 (2019)
He, Y., Song, K., Meng, Q., Yan, Y.: An end-to-end steel surface defect detection approach via fusing multiple hierarchical features. IEEE Trans. Instrum. Meas. 69(4), 1493–1504 (2019)
Article MATH Google Scholar
Lv, X., Duan, F., Jiang, J.-J., Fu, X., Gan, L.: Deep metallic surface defect detection: the new benchmark and detection network. Sensors 20(6), 1562–1571 (2020)
Article MATH Google Scholar
Lin, T.-Y., Goyal, P., Girshick, R., He, K., Dollár, P.: Focal loss for dense object detection. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2980–2988 (2017)
Panboonyuen, T., Thongbai, S., Wongweeranimit, W., Santitamnont, P., Suphan, K., Charoenphon, C.: Object detection of road assets using transformer-based yolox with feature pyramid decoder on Thai highway panorama. Information 13(1), 5–15 (2021)
Article Google Scholar
Wang, C.-Y., Bochkovskiy, A., Liao, H.-Y.M.: Yolov7: trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 7464–7475 (2023)
Liu, R., Huang, M., Gao, Z., Cao, Z., Cao, P.: Msc-dnet: an efficient detector with multi-scale context for defect detection on strip steel surface. Measurement 209, 112467–112482 (2023)
Article MATH Google Scholar
Zhang, D., Hao, X., Liang, L., Liu, W., Qin, C.: A novel deep convolutional neural network algorithm for surface defect detection. J. Comput. Des. Eng. 9(5), 1616–1632 (2022)
MATH Google Scholar

Download references

Acknowledgements

This study was supported by the Director’s Fund of the Anhui Province Key Laboratory of Intelligent Building and Building Energy Saving, Anhui Jianzhu University (Grant No. IBES2024ZR01), the Mass Spectrometry Key Technology R&D and Clinical Application of Anhui Province Jointly Constructed Discipline Key Experiments (GRANT: 2023ZPLH07), and the Anhui Province Graduate Education Quality Engineering Project (GRANT: 2023cxcysj129).

Author information

Authors and Affiliations

School of Electronics and Information Engineering, Anhui Jianzhu University, Hefei, 230601, Anhui, China
Penglin Wang & Donghui Shi
Anhui Province Key Laboratory of Intelligent Building and Building Energy Saving, Anhui Jianzhu University, Hefei, 230022, Anhui, China
Donghui Shi
Grupo de Investigación en I+D+i en TIC, Universidad EAFIT, Medellín, Colombia
Jose Aguilar
Centro de Estudios en Microelectrónica y Sistemas Distribuidos, Universidad de Los Andes, Merida, Venezuela
Jose Aguilar

Authors

Penglin Wang
View author publications
You can also search for this author inPubMed Google Scholar
Donghui Shi
View author publications
You can also search for this author inPubMed Google Scholar
Jose Aguilar
View author publications
You can also search for this author inPubMed Google Scholar

Contributions

PW (First Author): made substantial contributions to the conception of the work; the acquisition, analysis and interpretation of data; drafted the work. DS (Corresponding author): revised it critically for important intellectual content. JA: agree to be accountable for all aspects of the work in ensuring that questions related to the accuracy.

Corresponding author

Correspondence to Donghui Shi.

Ethics declarations

Conflict of interest

The authors declare no Conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Wang, P., Shi, D. & Aguilar, J. PCP-YOLO: an approach integrating non-deep feature enhancement module and polarized self-attention for small object detection of multiscale defects. SIViP 19, 71 (2025). https://doi.org/10.1007/s11760-024-03666-4

Download citation

Received: 11 July 2024
Revised: 20 September 2024
Accepted: 28 September 2024
Published: 05 December 2024
DOI: https://doi.org/10.1007/s11760-024-03666-4

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

PCP-YOLO: an approach integrating non-deep feature enhancement module and polarized self-attention for small object detection of multiscale defects

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

YOLO-FGD: a fast lightweight PCB defect method based on FasterNet and the Gather-and-Distribute mechanism

DMC-Net: a lightweight network for real-time surface defect segmentation

CSC-YOLO: An Image Recognition Model for Surface Defect Detection of Copper Strip and Plates

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now