Abstract
Bird’s nests, as well as suspended foreign objects such as plastic and rags, are serious potential safety hazards on transmission lines. Because the Bird’s nests and suspended foreign objects in the unmanned aerial vehicle (UAV) images often belong to small targets with less pixels, more noise and easy to be disturbed, the detection of these objects puts forward higher requirements for the detection algorithm. In this paper, an deep learning-based algorithm are designed for the detection of these two kinds of small targets. By adding the attention mechanism module to the backbone network in this algorithm, the importance of each part of the feature map extracted from UAV image are refined in two different dimensions, and sufficient context learning are carried out to improve the detection of small targets. Further more, a post-processing algorithm based on Soft-NMS are designed to prevent small targets from being filtered and further improve the detection of small targets. Compared with the benchmark algorithm Faster R-CNN, the proposed algorithm achieves \(4.7\%\) improvement in average precision (AP).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Ding, Y., et al.: Bird-related fault analysis and prevention measures of \(\pm \)400 kV Qinghai-Tibet DC transmission line. Energy Rep. 7, 426–433 (2021)
Zaidi, S.S.A., Ansari, M.S., Aslam, A., Kanwal, N., Asghar, M., Lee, B.: A survey of modern deep learning based object detection models. Digital Sign. Process. 162, 103514 (2022)
Liu, Y., Sun, P., Wergeles, N., Shang, Y.: A survey and performance evaluation of deep learning methods for small object detection. Expert Syst. Appl. 172, 114602 (2021)
Lin, T.-Y., et al.: Microsoft COCO: common objects in context. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8693, pp. 740–755. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10602-1_48
Pang, Y., Wang, T., Answer, R.M., Khan, F.S., Shao, L.: Efficient featurized image pyramid network for single shot detector. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 7336–7344 (2019)
Yang, X., Liu, Q., Yan, J., Feng, Z., He, T.: R3Det: refined single-stage detector with feature refinement for rotating object. arXiv preprint arXiv:1908.05612 (2019)
Hu, J., Shen, L., Sun, G.: Squeeze-and-excitation networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7132–7141 (2018)
Woo, S., Park, J., Lee. J.Y., Kweon, I.S.: CBAM: convolutional block attention module. In: Proceedings of the European Conference on Computer Vision (ECCV), pp. 3–19 (2018)
Hu, J., Shen, L., Albanie, S., Sun, G., Vedaldi, A.: Gather-excite: exploiting feature context in convolutional neural networks. Adv. Neural Inf. Process. Syst. 31, 9423–9433 (2018)
Liu, B., Huang, J., Lin, S., Yang, Y., Qi, Y.: Improved YOLOX-S abnormal condition detection for power transmission line corridors. In: 2021 IEEE 3rd International Conference on Power Data Science (ICPDS), pp. 13–16 (2021)
Zheng, X., Jia, R., Gong, L., Zhang, G., Dang, J.: Component identification and defect detection in transmission lines based on deep learning. J. Intell. Fuzzy Syst. 40(2), 3147–3158 (2021)
Fan, P., et al.: Defect identification detection research for insulator of transmission lines based on deep learning. J. Phys. Conf. Ser. 1828, 012019. IOP Publishing (2021)
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
Lin, T.Y., Dollár, P., Girshick, R., He, K., Hariharan, B., Belongie, S.: Feature pyramid networks for object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2117–2125 (2017)
Ioffe, S., Szegedy, C.: Batch normalization: accelerating deep network training by reducing internal covariate shift. In: International Conference on Machine Learning, pp. 448–456. PMLR (2015)
Nair, V., Hinton, G. E.: Rectified linear units improve restricted Boltzmann machines. In: ICML (2010)
Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. Adv. Neural Inf. Process. Syst. 39(6), 1137–1149 (2015)
Bodla, N., Singh, B., Chellappa, R., Davis, L.S.: Soft-NMS-improving object detection with one line of code. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 5561–5569 (2017)
Deng, J., Dong, W., Socher, R., Li, L.J., Li, K., Li, F.F.: ImageNet: a large-scale hierarchical image database. In: 2009 IEEE Conference on Computer Vision and Pattern Recognition, pp. 248–255. IEEE (2009)
Acknowledgement
This work was supported in part by the Key R &D Project of Shandong Province under Grant No. 2022CXGC010503, the Youth Foundation of Shandong Province under Grant No. ZR202102230323, the National Natural Science Foundation for Young Scientists of China under Grant No. 61903155, and the Doctoral Scientific Fund Project under Grant No. xbs1910.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Wang, W., Meng, P., Huang, W., Zhang, M., Qiao, J., Zhang, Y. (2022). Improved Faster R-CNN Algorithm for Transmission Line Small Target Detection. In: Zhang, H., et al. Neural Computing for Advanced Applications. NCAA 2022. Communications in Computer and Information Science, vol 1638. Springer, Singapore. https://doi.org/10.1007/978-981-19-6135-9_32
Download citation
DOI: https://doi.org/10.1007/978-981-19-6135-9_32
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-19-6134-2
Online ISBN: 978-981-19-6135-9
eBook Packages: Computer ScienceComputer Science (R0)