Skip to main content

Advertisement

Log in

YOLOX with CBAM for insulator detection in transmission lines

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

The traditional manual inspection mode is inefficient for detecting transmission line insulators. Even in the case of the detection system generated by combining aerial images by unmanned aerial vehicles with traditional machine vision algorithms, the detection accuracy and response speed have been increasingly unable to meet the requirements of modern power-grid construction. However, with the development of deep learning image processing technology, its deep level neural network can simulate the human brain to automatically extract the rich feature expression of the insulator image coupled with network training to quickly provide the final recognition results, improving the detection performance of the insulator defect detection technology based on this optimization method. Therefore, this study uses the deep learning object detection network YOLOX to classify and locate transmission line insulators. Accordingly, this study introduces the convolutional block attention module (CBAM) theory to optimize the YOLOX network, further enhancing the performance of the network model. The experimental results show that after introducing the CBAM, the detection accuracy of YOLOX on the insulator dataset herein has been improved by ~ 3% and the performance of the model has been optimized to some extent.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14

Similar content being viewed by others

Data availability

All data generated or analysed during this study are included in this published article.

References

  1. Fan P, Shen HM, Zhao C, Wei Z, Yao JG, Zhou ZQ, Fu R, Hu Q (2021) Defect identification detection research for insulator of transmission lines based on deep learning. J Phys Conf Ser 1828:012019. https://doi.org/10.1088/1742-6596/1828/1/012019

    Article  Google Scholar 

  2. Siddiqui ZA, Park U (2020) A drone based transmission line components inspection system with deep learning technique. Energies 13:3348. https://doi.org/10.3390/en13133348

    Article  Google Scholar 

  3. Loukas C, Kelekis N (2022) Editorial for MRI-based multiple instance convolutional neural network (MICNN) for increased accuracy in the differentiation of borderline and malignant epithelial ovarian tumors. J Magn Reson Imaging 56:182–183. https://doi.org/10.1002/jmri.28008

    Article  Google Scholar 

  4. Park J, Jun MB, Yun H (2022) Development of robotic bin picking platform with cluttered objects using human guidance and convolutional neural network (CNN). J Manuf Syst 63:539–549. https://doi.org/10.1016/j.jmsy.2022.05.011

    Article  Google Scholar 

  5. Rahman S, Ramli M, Arnia F, Muharar R, Sembiring A (2021) Performance analysis of mAlexnet by training option and activation function tuning on parking images. IOP Conf Ser: Mater Sci Eng 1087:012084. https://doi.org/10.1088/1757-899X/1087/1/012084

    Article  Google Scholar 

  6. Malini A, Priyadharshini P, Sabeena S (2021) An automatic assessment of road condition from aerial imagery using modified VGG architecture in faster-RCNN framework. J Intell Fuzzy Sys Appl Eng Technol 40:11411–11422. https://doi.org/10.3233/JIFS-202596

    Article  Google Scholar 

  7. Ahmed KT, Jaffar S, Hussain MG, Fareed S, Mehmood A, Choi GS (2021) Maximum response deep learning using markov, retinal & primitive patch binding with GoogLeNet & VGG-19 for large image retrieval. IEEE Access 9:41934–41957. https://doi.org/10.1109/ACCESS.2021.3063545

    Article  Google Scholar 

  8. Ikechukwu AV, Murali S, Deepu R, Shivamurthy RC (2021) ResNet-50 vs VGG-19 vs training from scratch: a comparative analysis of the segmentation and classification of pneumonia from chest X-ray images. Glob Transit Proc 2:375–381. https://doi.org/10.1016/j.gltp.2021.08.027

    Article  Google Scholar 

  9. Mehta P (2021) The darknet and smarter crime: methods for investigating criminal entrepreneurs and the illicit drug economy. Comput Rev 7:62

    Google Scholar 

  10. Al-Wajih E, Ghazali R (2023) Threshold center-symmetric local binary convolutional neural networks for bilingual handwritten digit recognition. Knowl Based Syst 259:110079. https://doi.org/10.1016/j.knosys.2022.110079

    Article  Google Scholar 

  11. Al-Wajih E, Ghazali R (2021) Improving the performance of local binary convolutional neural networks for bilingual digit recognition. International Conference on Data Analytics for Business and Industry (ICDABI), Sakheer, Bahrain, pp 587–591

  12. Einy S, Sen E, Saygin H, Hivehchi H, Dorostkar Navaei Y (2023) Local binary convolutional neural networks’ long short-term memory model for human embryos’ anomaly detection. Sci Program 2023:11. https://doi.org/10.1155/2023/2426601

    Article  Google Scholar 

  13. Mei H, Jiang H, Yin F, Wang L, Farzaneh M (2021) Terahertz imaging method for composite insulator defects based on edge detection algorithm. IEEE Trans Instrum Meas 70:1–10. https://doi.org/10.1109/TIM.2021.3075031

    Article  Google Scholar 

  14. Alkentar SM, Alsahwa B, Assalem A, Karakolla D (2021) Practical comparation of the accuracy and speed of YOLO, SSD and faster RCNN for drone detection. J Eng 27:19–31. https://doi.org/10.31026/j.eng.2021.08.02

    Article  Google Scholar 

  15. Xing Z, Chen X, Pang F (2022) DD-YOLO: an object detection method combining knowledge distillation and differentiable architecture search. IET Comput Vis 16:418–430. https://doi.org/10.1049/cvi2.12097

    Article  Google Scholar 

  16. Ge Z, Liu S, Wang F, Li Z, Sun J (2021) YOLOX: exceeding YOLO series in 2021. arXiv e-prints 2107.08430

  17. Wang P, Niu T, He D (2021) Tomato young fruits detection method under near color background based on improved faster R-CNN with attention mechanism. Agriculture 11. https://doi.org/10.3390/agriculture11111059

  18. Zhou H, Zhao Y, Xiang W (2022) Method for judging parking status based on yolov2 target detection algorithm. Proced Comput Sci 199:1355–1362. https://doi.org/10.1016/j.procs.2022.01.171

    Article  Google Scholar 

  19. Xu H, Guo M, Nedjah N, Zhang J, Li P (2022) Vehicle and pedestrian detection algorithm based on lightweight YOLOv3-promote and semi-precision acceleration. IEEE Trans Intell Transp Syst 99:1–12. https://doi.org/10.1109/TITS.2021.3137253

    Article  Google Scholar 

  20. Wang K, Liu M (2022) Toward structural learning and enhanced YOLOv4 network for object detection in optical remote sensing images. Adv Theor Simul 5:5. https://doi.org/10.1002/adts.202200002

    Article  Google Scholar 

  21. Li Y, Ni M, Lu Y (2022) Insulator defect detection for power grid based on light correction enhancement and YOLOv5 model. Energy Rep 8:807–814

    Article  Google Scholar 

  22. Wang C-Y, Bochkovskiy A, Liao H-YM (2022) YOLOv7: trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. arXiv:2207.02696v1. https://doi.org/10.48550/arXiv.2207.02696

  23. Armstrong Aboah B, Wang U, Bagci, Yaw Adu-Gyamfi (2023) Real-time multi-class helmet violation detection using few-shot data sampling technique and YOLOv8. Computer Vision and Pattern Recognition. arXiv:2304.08256. https://doi.org/10.48550/arXiv.2304.08256

  24. Tan YS, Lim KM, Tee C, Lee CP, Low CY (2021) Convolutional neural network with spatial pyramid pooling for hand gesture recognition. Neural Comput Appl 33:5339–5351

    Article  Google Scholar 

  25. Bhatti UA, Yu Z, Chanussot J, Zeeshan Z, Yuan L, Luo W, Nawaz SA, Bhatti MA, Ain QU, Mehmood A (2022) Local similarity-based spatial–spectral fusion hyperspectral image classification with deep CNN and gabor filtering. IEEE Trans Geosci Remote Sens 60:1–15. https://doi.org/10.1109/TGRS.2021.3090410

    Article  Google Scholar 

  26. Chen J, Zhang D, Suzauddola M, Nanehkaran YA, Sun Y (2021) Identification of plant disease images via a squeeze-and‐excitation mobileNet model and twice transfer learning. I E T Image Process 15:1115–1127. https://doi.org/10.1049/ipr2.12090

    Article  Google Scholar 

  27. Wang X, Jia X, Zhang M, Lu H (2023) Object detection in 3D point cloud based on ECA mechanism. J Circuits Syst Comput 32. https://doi.org/10.1142/S0218126623500809

  28. Lin F, Hou T, Jin Q, You A (2021) Improved YOLO based detection algorithm for floating debris in waterway. Entropy 23:1111. https://doi.org/10.3390/e23091111

    Article  Google Scholar 

  29. Cao Z, Shao M, Xu L, Mu S, Qu H (2020) MaskHunter: real-time object detection of face masks during the COVID-19 pandemic. IET Image Process 14:4359–4367. https://doi.org/10.1049/iet-ipr.2020.1119

    Article  Google Scholar 

  30. Zheng L, Zhang X, Hu J, Gao Y, Zhang X, Zhang M, Li S, Zhou X, Niu T, Lu Y, Wang D (2020) Establishment and applicability of a diagnostic system for advanced gastric cancer T staging based on a faster region-based convolutional neural network. Front Oncol 10:1238. https://doi.org/10.3389/fonc.2020.01238

    Article  Google Scholar 

  31. Nawwar NM, Kasban H, Salama M (2021) Improvement of confusion matrix for hand vein recognition based on deep-learning multi-classifier decisions. Egypt Soc Nucl Sci Appl 1–14. https://doi.org/10.21608/ajnsa.2021.70450.1460

  32. Feng X, Zhang T (2021) Panoptic segmentation algorithm based on grouped convolution for feature fusion. J Comput Appl 41:2054–2061. https://doi.org/10.11772/j.issn.1001-9081.2020091523

    Article  Google Scholar 

  33. Hripcsak G, Rothschild AS (2005) Agreement, the F-measure, and reliability in information retrieval. J Am Med Inform Assoc 12:296–298

    Article  Google Scholar 

  34. Rojas-Perez LO, Martinez-Carranza J (2021) Towards autonomous drone racing without GPU using an OAK-D smart camera. Sensors 21:7436. https://doi.org/10.3390/s21227436

    Article  Google Scholar 

  35. Zhang Z-D et al (2022) FINet: an Insulator dataset and detection Benchmark based on Synthetic Fog and Improved YOLOv5. IEEE Trans Instrum Meas 71:1–8

    Google Scholar 

  36. Chen Z, Wu R, Lin Y, Li C, Chen S, Yuan Z, Chen S, Zou X (2022) Plant disease recognition model based on improved YOLOv5. Agronomy 12(2):365. https://doi.org/10.3390/agronomy12020365

    Article  Google Scholar 

  37. Lin J, Bai D, Xu R, Lin H (2023) TSBA-YOLO: an improved tea diseases detection model based on attention mechanisms and feature fusion. Forests 14(3):619. https://doi.org/10.3390/f14030619

    Article  Google Scholar 

Download references

Acknowledgements

This work was sponsored by Shanxi Provincial Higher Education Science and Technology Innovation Project (Grant no. 2022L524) and Shanxi Provincial Basic Research Program(Grant no. 202103021223048).

Funding

There was no specific funding for this study.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Kai Yang.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Yang, K., Zhang, Y., Zhang, X. et al. YOLOX with CBAM for insulator detection in transmission lines. Multimed Tools Appl 83, 43419–43437 (2024). https://doi.org/10.1007/s11042-023-17245-1

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-023-17245-1

Keywords

Navigation