Abstract
The traditional manual inspection mode is inefficient for detecting transmission line insulators. Even in the case of the detection system generated by combining aerial images by unmanned aerial vehicles with traditional machine vision algorithms, the detection accuracy and response speed have been increasingly unable to meet the requirements of modern power-grid construction. However, with the development of deep learning image processing technology, its deep level neural network can simulate the human brain to automatically extract the rich feature expression of the insulator image coupled with network training to quickly provide the final recognition results, improving the detection performance of the insulator defect detection technology based on this optimization method. Therefore, this study uses the deep learning object detection network YOLOX to classify and locate transmission line insulators. Accordingly, this study introduces the convolutional block attention module (CBAM) theory to optimize the YOLOX network, further enhancing the performance of the network model. The experimental results show that after introducing the CBAM, the detection accuracy of YOLOX on the insulator dataset herein has been improved by ~ 3% and the performance of the model has been optimized to some extent.
Similar content being viewed by others
Data availability
All data generated or analysed during this study are included in this published article.
References
Fan P, Shen HM, Zhao C, Wei Z, Yao JG, Zhou ZQ, Fu R, Hu Q (2021) Defect identification detection research for insulator of transmission lines based on deep learning. J Phys Conf Ser 1828:012019. https://doi.org/10.1088/1742-6596/1828/1/012019
Siddiqui ZA, Park U (2020) A drone based transmission line components inspection system with deep learning technique. Energies 13:3348. https://doi.org/10.3390/en13133348
Loukas C, Kelekis N (2022) Editorial for MRI-based multiple instance convolutional neural network (MICNN) for increased accuracy in the differentiation of borderline and malignant epithelial ovarian tumors. J Magn Reson Imaging 56:182–183. https://doi.org/10.1002/jmri.28008
Park J, Jun MB, Yun H (2022) Development of robotic bin picking platform with cluttered objects using human guidance and convolutional neural network (CNN). J Manuf Syst 63:539–549. https://doi.org/10.1016/j.jmsy.2022.05.011
Rahman S, Ramli M, Arnia F, Muharar R, Sembiring A (2021) Performance analysis of mAlexnet by training option and activation function tuning on parking images. IOP Conf Ser: Mater Sci Eng 1087:012084. https://doi.org/10.1088/1757-899X/1087/1/012084
Malini A, Priyadharshini P, Sabeena S (2021) An automatic assessment of road condition from aerial imagery using modified VGG architecture in faster-RCNN framework. J Intell Fuzzy Sys Appl Eng Technol 40:11411–11422. https://doi.org/10.3233/JIFS-202596
Ahmed KT, Jaffar S, Hussain MG, Fareed S, Mehmood A, Choi GS (2021) Maximum response deep learning using markov, retinal & primitive patch binding with GoogLeNet & VGG-19 for large image retrieval. IEEE Access 9:41934–41957. https://doi.org/10.1109/ACCESS.2021.3063545
Ikechukwu AV, Murali S, Deepu R, Shivamurthy RC (2021) ResNet-50 vs VGG-19 vs training from scratch: a comparative analysis of the segmentation and classification of pneumonia from chest X-ray images. Glob Transit Proc 2:375–381. https://doi.org/10.1016/j.gltp.2021.08.027
Mehta P (2021) The darknet and smarter crime: methods for investigating criminal entrepreneurs and the illicit drug economy. Comput Rev 7:62
Al-Wajih E, Ghazali R (2023) Threshold center-symmetric local binary convolutional neural networks for bilingual handwritten digit recognition. Knowl Based Syst 259:110079. https://doi.org/10.1016/j.knosys.2022.110079
Al-Wajih E, Ghazali R (2021) Improving the performance of local binary convolutional neural networks for bilingual digit recognition. International Conference on Data Analytics for Business and Industry (ICDABI), Sakheer, Bahrain, pp 587–591
Einy S, Sen E, Saygin H, Hivehchi H, Dorostkar Navaei Y (2023) Local binary convolutional neural networks’ long short-term memory model for human embryos’ anomaly detection. Sci Program 2023:11. https://doi.org/10.1155/2023/2426601
Mei H, Jiang H, Yin F, Wang L, Farzaneh M (2021) Terahertz imaging method for composite insulator defects based on edge detection algorithm. IEEE Trans Instrum Meas 70:1–10. https://doi.org/10.1109/TIM.2021.3075031
Alkentar SM, Alsahwa B, Assalem A, Karakolla D (2021) Practical comparation of the accuracy and speed of YOLO, SSD and faster RCNN for drone detection. J Eng 27:19–31. https://doi.org/10.31026/j.eng.2021.08.02
Xing Z, Chen X, Pang F (2022) DD-YOLO: an object detection method combining knowledge distillation and differentiable architecture search. IET Comput Vis 16:418–430. https://doi.org/10.1049/cvi2.12097
Ge Z, Liu S, Wang F, Li Z, Sun J (2021) YOLOX: exceeding YOLO series in 2021. arXiv e-prints 2107.08430
Wang P, Niu T, He D (2021) Tomato young fruits detection method under near color background based on improved faster R-CNN with attention mechanism. Agriculture 11. https://doi.org/10.3390/agriculture11111059
Zhou H, Zhao Y, Xiang W (2022) Method for judging parking status based on yolov2 target detection algorithm. Proced Comput Sci 199:1355–1362. https://doi.org/10.1016/j.procs.2022.01.171
Xu H, Guo M, Nedjah N, Zhang J, Li P (2022) Vehicle and pedestrian detection algorithm based on lightweight YOLOv3-promote and semi-precision acceleration. IEEE Trans Intell Transp Syst 99:1–12. https://doi.org/10.1109/TITS.2021.3137253
Wang K, Liu M (2022) Toward structural learning and enhanced YOLOv4 network for object detection in optical remote sensing images. Adv Theor Simul 5:5. https://doi.org/10.1002/adts.202200002
Li Y, Ni M, Lu Y (2022) Insulator defect detection for power grid based on light correction enhancement and YOLOv5 model. Energy Rep 8:807–814
Wang C-Y, Bochkovskiy A, Liao H-YM (2022) YOLOv7: trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. arXiv:2207.02696v1. https://doi.org/10.48550/arXiv.2207.02696
Armstrong Aboah B, Wang U, Bagci, Yaw Adu-Gyamfi (2023) Real-time multi-class helmet violation detection using few-shot data sampling technique and YOLOv8. Computer Vision and Pattern Recognition. arXiv:2304.08256. https://doi.org/10.48550/arXiv.2304.08256
Tan YS, Lim KM, Tee C, Lee CP, Low CY (2021) Convolutional neural network with spatial pyramid pooling for hand gesture recognition. Neural Comput Appl 33:5339–5351
Bhatti UA, Yu Z, Chanussot J, Zeeshan Z, Yuan L, Luo W, Nawaz SA, Bhatti MA, Ain QU, Mehmood A (2022) Local similarity-based spatial–spectral fusion hyperspectral image classification with deep CNN and gabor filtering. IEEE Trans Geosci Remote Sens 60:1–15. https://doi.org/10.1109/TGRS.2021.3090410
Chen J, Zhang D, Suzauddola M, Nanehkaran YA, Sun Y (2021) Identification of plant disease images via a squeeze-and‐excitation mobileNet model and twice transfer learning. I E T Image Process 15:1115–1127. https://doi.org/10.1049/ipr2.12090
Wang X, Jia X, Zhang M, Lu H (2023) Object detection in 3D point cloud based on ECA mechanism. J Circuits Syst Comput 32. https://doi.org/10.1142/S0218126623500809
Lin F, Hou T, Jin Q, You A (2021) Improved YOLO based detection algorithm for floating debris in waterway. Entropy 23:1111. https://doi.org/10.3390/e23091111
Cao Z, Shao M, Xu L, Mu S, Qu H (2020) MaskHunter: real-time object detection of face masks during the COVID-19 pandemic. IET Image Process 14:4359–4367. https://doi.org/10.1049/iet-ipr.2020.1119
Zheng L, Zhang X, Hu J, Gao Y, Zhang X, Zhang M, Li S, Zhou X, Niu T, Lu Y, Wang D (2020) Establishment and applicability of a diagnostic system for advanced gastric cancer T staging based on a faster region-based convolutional neural network. Front Oncol 10:1238. https://doi.org/10.3389/fonc.2020.01238
Nawwar NM, Kasban H, Salama M (2021) Improvement of confusion matrix for hand vein recognition based on deep-learning multi-classifier decisions. Egypt Soc Nucl Sci Appl 1–14. https://doi.org/10.21608/ajnsa.2021.70450.1460
Feng X, Zhang T (2021) Panoptic segmentation algorithm based on grouped convolution for feature fusion. J Comput Appl 41:2054–2061. https://doi.org/10.11772/j.issn.1001-9081.2020091523
Hripcsak G, Rothschild AS (2005) Agreement, the F-measure, and reliability in information retrieval. J Am Med Inform Assoc 12:296–298
Rojas-Perez LO, Martinez-Carranza J (2021) Towards autonomous drone racing without GPU using an OAK-D smart camera. Sensors 21:7436. https://doi.org/10.3390/s21227436
Zhang Z-D et al (2022) FINet: an Insulator dataset and detection Benchmark based on Synthetic Fog and Improved YOLOv5. IEEE Trans Instrum Meas 71:1–8
Chen Z, Wu R, Lin Y, Li C, Chen S, Yuan Z, Chen S, Zou X (2022) Plant disease recognition model based on improved YOLOv5. Agronomy 12(2):365. https://doi.org/10.3390/agronomy12020365
Lin J, Bai D, Xu R, Lin H (2023) TSBA-YOLO: an improved tea diseases detection model based on attention mechanisms and feature fusion. Forests 14(3):619. https://doi.org/10.3390/f14030619
Acknowledgements
This work was sponsored by Shanxi Provincial Higher Education Science and Technology Innovation Project (Grant no. 2022L524) and Shanxi Provincial Basic Research Program(Grant no. 202103021223048).
Funding
There was no specific funding for this study.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Yang, K., Zhang, Y., Zhang, X. et al. YOLOX with CBAM for insulator detection in transmission lines. Multimed Tools Appl 83, 43419–43437 (2024). https://doi.org/10.1007/s11042-023-17245-1
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-023-17245-1