Abstract
To address the issues of low real-time performance and poor algorithm accuracy in detecting miner behavior underground, we propose a high-precision real-time detection method named DSY-YOLOv8n based on the characteristics of human body behavior. This method integrates DSConv into the backbone network to enhance multi-scale feature extraction. Additionally, SCConv-C2f replaces C2f modules, reducing redundant calculations and improving model training speed. The optimization strategy of the loss function is employed, and MPDIoU is used to improve the model’s accuracy and speed. The experimental results show: (1) With almost no increase in parameters and calculation amount, the mAP50 of the DSY-YOLOv8n model is 97.4%, which is a 3.2% great improvement over the YOLOv8n model. (2) Compared to Faster-R-CNN, YOLOv5s, and YOLOv7, DYS-YOLOv8n has improved the average accuracy to varying degrees while significantly increasing the detection speed. (3) DYS-YOLOv8n meets the real-time requirements for behavioral detection in mines with a detection speed of 243FPS. In summary, the DYS-YOLOv8n offers a real-time, efficient, and lightweight method for detecting miner behavior in mines, which has high practical value.








We’re sorry, something doesn't seem to be working properly.
Please try refreshing the page. If that doesn't work, please contact support so we can address the problem.
Data availability
Data will be made available on request.
References
Xiaobin, Y., Shilu, Z., Na, L.I., Xiaoyao, W.: Deep learning and its application in coal mine safety. Safety in Coal Mines (2019)
Wu, B., Wang, J., Zhong, M., Xu, C., Qu, B.: Multidimensional analysis of coal mine safety accidents in china—70 years review. In: Mining, Metallurgy & Exploration, pp. 1–10 (2022)
Ren, S., He, K., Girshick, R., Sun, J.: Faster r-cnn: towards real-time object detection with region proposal networks. IEEE Trans. Pattern Anal. Mach. Intell. 39(6), 1137–1149 (2017)
Redmon, J., Divvala, S., Girshick, R., Farhadi, A.: You only look once: unified, real-time object detection. In: Computer vision & pattern recognition (2016)
Farhadi, A., Redmon, J.: Yolo9000: better, faster, stronger (2016)
Redmon, J., Farhadi, A.: Yolov3: an incremental improvement. arXiv e-prints (2018)
Bochkovskiy, A., Wang, C.Y., Liao, H.Y.M.: Yolov4: optimal speed and accuracy of object detection (2020)
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. IEEE (2016)
Huang, G., Liu, Z., Van Der Maaten, L., Weinberger, K.Q.: Densely connected convolutional networks (2017)
Howard, A.G., Zhu, M., Chen, B., Kalenichenko, D., Wang, W., Weyand, T., Andreetto, M., Adam, H.: Mobilenets: efficient convolutional neural networks for mobile vision applications (2017)
Zhu, X., Lyu, S., Wang, X., Zhao, Q.: Tph-yolov5: Improved yolov5 based on transformer prediction head for object detection on drone-captured scenarios (2021)
Ge, Z., Liu, S., Wang, F., Li, Z., Sun, J.: Yolox: exceeding yolo series in 2021 (2021)
Cao, X., Zhang, C., Wang, P., Wei, H., Huang, S., Li, H.: Unsafe mining behavior identification method based on an improved st-gcn. Sustainability 15(2) (2023)
Shi, X., Huang, J., Huang, B.: An underground abnormal behavior recognition method based on an optimized alphapose-st-gcn. J. Circuits Syst. Comput. (2022)
Liu, S., Bai, X., Fang, M., Li, L., Hung, C.C.: Mixed graph convolution and residual transformation network for skeleton-based action recognition. Appl. Intell. 1–12 (2021)
Zhang, P., Lan, C., Zeng, W., Xing, J., Zheng, N.: Semantics-guided neural networks for efficient skeleton-based human action recognition. In: 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2020)
Yang, H., Gu, Y., Zhu, J., Hu, K., Zhang, X.: Pgcn-tca: pseudo graph convolutional network with temporal and channel-wise attention for skeleton-based action recognition. IEEE Access 8, 8 (2020)
Shi, L., Zhang, Y., Cheng, J., Lu, H.: Two-stream adaptive graph convolutional networks for skeleton-based action recognition (2018)
Rijayanti, R., Hwang, M., Jin, K.: Detection of anomalous behavior of manufacturing workers using deep learning-based recognition of human-object interaction. Appl. Sci. 13(15), 8584 (2023)
Li, X., Hao, T., Li, F., Zhao, L., Wang, Z.: Faster r-cnn-lstm construction site unsafe behavior recognition model. Appl. Sci. 13(19), 10700 (2023)
Yao, W., Wang, A., Nie, Y., Lv, Z., Nie, S., Huang, C., Liu, Z.: Study on the recognition of coal miners’ unsafe behavior and status in the hoist cage based on machine vision. Sensors 23(21), 8794 (2023)
Li, L., Zhang, P., Yang, S., Jiao, W.: Yolov5-sfe: an algorithm fusing spatio-temporal features for detecting and recognizing workers’ operating behaviors. Adv. Eng. Inform. 56, 101988 (2023)
Shao, X., Liu, S., Li, X., Lyu, Z., Li, H.: Rep-yolo: an efficient detection method for mine personnel. J. Real-Time Image Proc. 21(2), 1–16 (2024)
Li, X., Wang, S., Liu, B., Chen, W., Fan, W., Tian, Z.: Improved yolov4 network using infrared images for personnel detection in coal mines. J. Electron. Imaging 31(1), 013017 (2022)
Zhao, D., Guoyong, S., Cheng, G., Wang, P., Chen, W., Yang, Y.: Research on real-time perception method of key targets in the comprehensive excavation working face of coal mine. Meas. Sci. Technol. 35(1), 015410 (2023)
Zhi, X., Li, J., Meng, Y., Zhang, X.: Cap-yolo: channel attention based pruning yolo for coal mine real-time intelligent monitoring. Sensors 22(12), 4331 (2022)
Fan, Y., Mao, S., Li, M., Wu, Z., Kang, J.: Cm-yolov8: lightweight yolo for coal mine fully mechanized mining face (2024)
Yu, F., Koltun, V.: Multi-scale context aggregation by dilated convolutions. In: ICLR (2016)
Dai, J., Qi, H., Xiong, Y., Li, Y., Zhang, G., Hu, H., Wei, Y.: Deformable convolutional networks. IEEE (2017)
Qi, Y., He, Y., Qi, X., Zhang, Y., Yang, G.: Dynamic snake convolution based on topological geometric constraints for tubular structure segmentation. In: 2023 IEEE/CVF International Conference on Computer Vision (ICCV), pp. 6047–6056 (2023)
Li, J., Wen, Y., He, L.: Scconv: spatial and channel reconstruction convolution for feature redundancy. In: 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 6153–6162 (2023)
Ma, S., Yong, X.: Mpdiou: a loss for efficient and accurate bounding box regression (2023)
Yang, W., Zhang, X., Ma, B., Wang, Y., Wu, Y., Yan, J., Liu, Y., Zhang, C., Wan, J., Wang, Y.: An open dataset for intelligent recognition and classification of abnormal condition in longwall mining. Sci. Data 10(1) (2023)
Acknowledgements
This work is supported by the Xi’an Science and Technology Program (23ZDCYJSGG0025-2022), the General Project of Science and Technology Department of Shaanxi Province (2021JQ-574), the Science ResearchProgram of Shaanxi Educational Committee under Grant 23JC049, Science and Technology Innovation Fund Special project of Tiandi (Changzhou) Automation Co., Ltd. (2022TY2012).
Author information
Authors and Affiliations
Contributions
FX: methodology, software, formal analysis, investigation. XH: supervision, writing—original draft, writing—review editing, funding acquisition. CY: conceptualization, methodology, validation, supervision. SL: conceptualization, methodology, validation, supervision, modification. BM: conceptualization, methodology, validation, supervision, modification. HP: conceptualization, methodology, validation, supervision.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Xin, F., He, X., Yao, C. et al. A real-time detection for miner behavior via DYS-YOLOv8n model. J Real-Time Image Proc 21, 92 (2024). https://doi.org/10.1007/s11554-024-01466-0
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11554-024-01466-0