Abstract
Aiming at the problems of insufficient feature fusion, large number of network parameters and low target saliency in the current digital meter detection algorithm, a digital meter detection algorithm based on YOLOv5s is designed. First, a new feature fusion structure Bi-Directional Feature Pyramid Network Based on Multi-Scale Feature Fusion is designed to realize the full fusion between different scale feature maps and improve the detection accuracy; second, a new convolutional module Ghostconv Combined Channel Shuffle, is designed to realize the lightweight design of the network and meet the requirements of real-time substation detection tasks; finally, to improve the network’s ability to characterize the instrumented digits, the Convolutional Block Attention Module is introduced in the backbone network to further enhance the network performance. Experiments are carried out on the homemade dataset, and the experimental results show that the algorithm proposed in this paper improves the average accuracy by 2.84–98.58% compared with the original network; the amount of network parameters is reduced by 26.4%, and the detection speed is improved by 25 FPS, and the detection time for each image is only 0.012 s. Compared with other digital meter detection algorithms, the network performance and the number of parameters also have a great advantage.
Similar content being viewed by others
Data Availability
The code involved in this paper will not be disclosed for the time being due to the need for subsequent research, and will be available for others to use after the overall project is completed.
References
Liu, Y., Liu, J., Ke, Y.: A detection and recognition system of pointer meters in substations based on computer vision. Measurement 152, 107333 (2020)
Xiong, S., Liu, Y., Yan, Y., Pei, L., Xu, P., Fu, X., Jiang, X.: Object recognition for power equipment via human-level concept learning. IET Gen. Transm. Distrib. 15, 1578–1587 (2021)
Xu, Q., Huang, H., Zhou, C., Zhang, X.: Research on real-time infrared image fault detection of substation high-voltage lead connectors based on improved YOLOv3 network. Electronics 10, 544 (2021)
Li, Y., Huang, H., Xie, Q., Yao, L., Chen, Q.: Research on a surface defect detection algorithm based on MobileNet-SSD. Appl. Sci. 8, 1678 (2018)
Alegria, E.C., Serra, A.C.: Automatic calibration of analog and digital measuring instruments using computer vision. IEEE Trans. Instrum. Meas. 49, 94–99 (2000)
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60, 91–110 (2004)
Zhang, T., Zhang, X., Ke, X., Liu, C., Xu, X., Zhan, X., Wang, C., Ahmad, I., Zhou, Y., Pan, D., et al.: HOG-ShipCLSNet: a novel deep learning network with hog feature fusion for SAR ship classification. IEEE Trans. Geosci. Remote Sens. 60, 1–22 (2021)
Yu, S., Li, X., Zhang, X., Wang, H.: The OCS-SVM: an objective-cost-sensitive SVM with sample-based misclassification cost invariance. IEEE Access 7, 118931–118942 (2019)
Xiaoxiao, C., Hua, F., Guoqing, Y., Hao, Z., Yan, D.: A new method of digital number recognition for substation inspection robot. In: 2016 4th International Conference on Applied Robotics for the Power Industry (CARPI), pp. 1–4 (2016)
Lv, Q., Rao, Y., Zeng, S., Huang, C., Cheng, Z.: Small-scale robust digital recognition of meters under unstable and complex conditions. IEEE Trans. Instrum. Meas. 71, 1–13 (2022)
Tan, M., Pang, R., Le, Q.V.: Efficientdet: Scalable and efficient object detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 10781–10790 (2020)
Liu, J., Wu, H., Chen, Z.: Automatic detection and recognition method of digital instrument representation. In: 2021 4th International Conference on Intelligent Autonomous Systems (ICoIAS), pp. 139–143 (2021)
Xie, Y., Wang, C., Hu, X., Lin, X., Zhang, Y., Li, W.: An MPC-based control strategy for electric vehicle battery cooling considering energy saving and battery lifespan. IEEE Trans. Veh. Technol. 69, 14657–14673 (2020)
Pan, X., Tang, F., Dong, W., Gu, Y., Song, Z., Meng, Y., Xu, P., Deussen, O., Xu, C.: Self-supervised feature augmentation for large image object detection. IEEE Trans. Image Process. 29, 6745–6758 (2020)
Li, G., Gan, Y., Wu, H., Xiao, N., Lin, L.: Cross-modal attentional context learning for RGB-D object detection. IEEE Trans. Image Process. 28, 1591–1601 (2018)
Girshick, R., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 580–587 (2014)
Girshick, R.: Fast r-cnn. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 1440–1448 (2015)
He, K., Gkioxari, G., Dollár, P., Girshick, R.: Mask r-cnn. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2961–2969 (2017)
Zhang, Z., Hua, Z., Tang, Y., Zhang, Y., Lu, W., Dai, C.: Recognition method of digital meter readings in substation based on connected domain analysis algorithm. Actuators 10, 170 (2021)
Ren, S., He, K., Girshick, R., Sun, J.: Faster r-cnn: towards real-time object detection with region proposal networks. Adv. Neural Inf. Process. Syst. 28 (2015)
Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.-Y., Berg, A.C.: Ssd: single shot multibox detector. In: Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11–14, 2016, Proceedings, Part I 14, pp. 21–37 (2016)
Redmon, J., Divvala, S., Girshick, R., Farhadi, A.: You only look once: unified, real-time object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 779–778 (2016)
Redmon, J., Farhadi, A.: YOLO9000: better, faster, stronger. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7263–7271 (2017)
Redmon, J., Farhadi, A.: Yolov3: An incremental improvement (2018). arXiv preprint arXiv:1804.02767
Bochkovskiy, A., Wang, C.-Y., Liao, H.-Y.M.: Yolov4: Optimal speed and accuracy of object detection (2020). arXiv preprint arXiv:2004.10934
Jocher, G., Stoken, A., Borovec, J., Chaurasia, A., Changyu, L., Hogan, A., Hajek, J., Diaconu, L., Kwon, Y., Defretin, Y., et al.: ultralytics/yolov5: v5. 0-YOLOv5-P6 1280 models, AWS, Supervise. ly and YouTube integrations, Zenodo (2021)
Zhou, W., Peng, J., Han, Y.: Deep learning-based intelligent reading recognition method of the digital multimeter. In: 2021 IEEE International Conference on Systems, Man, and Cybernetics (SMC), pp. 3272–3277 (2021)
Wang, C.-Y., Liao, H.-Y.M., Wu, Y.-H., Chen, P.-Y., Hsieh, J.-W., Yeh, I.-H.: CSPNet: a new backbone that can enhance learning capability of CNN. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, pp. 390–391 (2020)
Liu, S., Qi, L., Qin, H., Shi, J., Jia, J.: Path aggregation network for instance segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 8759–8768 (2018)
Zheng, Z., Wang, P., Ren, D., Liu, W., Ye, R., Hu, Q., Zuo, W.: Enhancing geometric factors in model learning and inference for object detection and instance segmentation. IEEE Trans. Cybern. 52, 8574–8586 (2021)
Han, K., Wang, Y., Tian, Q., Guo, J., Xu, C., Xu, C.: Ghostnet: more features from cheap operations. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 1580–1589 (2020)
Woo, S., Park, J., Lee, J.-Y., Kweon, I.S.: Cbam: Convolutional block attention module. In: Proceedings of the European Conference on Computer Vision (ECCV), pp. 3–19 (2018)
Acknowledgements
This work is supported by the National Nature Science Foundation of China (NSFC, Grant no. 52074213), and by the National key research and development program in China ( Grant no. 2021YFE0105000).
Author information
Authors and Affiliations
Contributions
HZ: Put forward suggestions for paper revision; ZX: The main algorithm design and improvement optimization and writing papers; LH: Provide the main ideas and suggestions of the thesis; XM: Collect part of the data needed for the paper; ZZ: Draw part of the model diagram; WZ: Provide experimental equipment; WW: Provide financial support; all the authors reviewed the paper.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Hao, Z., Zhang, X., Li, H. et al. Real-time detection algorithm for digital meters based on multi-scale feature fusion and GCS. J Real-Time Image Proc 21, 35 (2024). https://doi.org/10.1007/s11554-023-01408-2
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11554-023-01408-2