Abstract
Conveyor belts in coal mines are critical for coal extraction and safety. Detecting foreign objects in low-light underground environments is challenging. This paper presents an enhanced foreign body detection algorithm using an improved Dual-Model Low-Light Enhancement Algorithm (DLEA) and a lightweight Star Attention Region-based Convolutional Detection Transformer (SARC-DETR). The DLEA improves image quality in low-light conditions, while SARC-DETR, with its StarNet backbone and Efficient Additive Attention mechanism, reduces computational costs without compromising accuracy. A lightweight dynamic group efficient module network is proposed for optimized feature extraction, and the CIoU loss function further enhances positioning accuracy. Experimental results demonstrate a 4.7% precision improvement, a 2.7% increase in average precision, a 47.01% reduction in parameters, and an inference speed of 97.3 FPS. This approach significantly boosts detection accuracy and real-time performance in coal mine conveyor belt foreign object detection.












Similar content being viewed by others
Data availability
No datasets were generated or analysed during the current study.
Abbreviations
- DLEA:
-
Dual-model low-light enhancement algorithm
- LDGEM-Net:
-
Lightweight dynamic group efficient module network
- EAA:
-
Efficient additive attention
- CIoU:
-
Complete intersection over union
- GIoU:
-
Generalized intersection over union
- DIoU:
-
Distance intersection over union
- MPDIoU:
-
Mean projected distance intersection over union
- EIoU:
-
Efficient intersection over union
- SIoU:
-
Self-consistency intersection over union
- YOLO:
-
You only look once
- R-CNN:
-
Region-based convolutional neural network
- DUAL:
-
Dual illumination estimation
- LIME:
-
Lowlight image enhancement
- StarNet:
-
Star network
- DGSM:
-
Dynamic group convolution shuffle module
- DGST:
-
Dynamic group convolution shuffle transformer
References
Dai, L., Zhang, X., Gardoni, P., Lu, H., Liu, X., Krolczyk, G., Li, Z.: A new machine vision detection method for identifying and screening out various large foreign objects on coal belt conveyor lines. Complex Intell. Syst. 9, 5221–5234 (2023)
You, Q., Yao, Q., Song, R., Yu, K., Xu, C., Cao, H.: Multi-dimensional safety risk assessment on coal mines under the profitability dilemma. Sci. Rep. 13, 2687 (2023)
Zhao, M., Liu, H., Liu, C., Li, X., Li, F., Yang, X., Yang, Q., Ma, Q.: Spatial effect analysis of coal and gangue recognition detector based on natural gamma ray method. Nat. Resour. Res. 31, 953–969 (2022)
Xiao, J., Yao, Y., Zhou, J., Guo, H., Yu, Q., Wang, Y.F.: FDLR-Net: a feature decoupling and localization refinement network for object detection in remote sensing images. Expert Syst. Appl. 225, 120068 (2023)
Sharma, A., Shrivastava, B.P., Tyagi, P.K., Siddiqui, E.A., Prasad, R., Gautam, S., Pranjal, P.: Enhanced satellite image resolution with a residual network and correlation filter. Chemomet. Intell. Lab. Syst. 256, 105277 (2025)
Sharma, A., Shrivastava, B.P.: Complex wavelet transform with progressive network for medical imaging super resolution. Multimed. Tools Appl. (2024). https://doi.org/10.1007/s11042-024-19448-6
Sharma, A., Shrivastava, B.P.: Medical image super-resolution using correlation filter interleaved progressive convolution network (CFIPC). Electron. Lett. 58, 360–362 (2022)
Sharma, A., Shrivastava, B., Gautam, S.: A review on image super-resolution using GAN. In: Meta-Learning Frameworks for Imaging Applications, pp. 12–31 (2023)
Sharma, A., Shrivastava, B.P., Priya, A.: Multilevel progressive recursive dilated networks with correlation filter (MPRDNCF) for image super-resolution. Multimed. Syst. 29, 2455–2467 (2023)
Sharmaz, A., Shrivastava, B.P.: Different techniques of image SR using deep learning: a review. IEEE Sens. J. 23, 724–1733 (2023)
Tao, H., Cheng, L., Qiu, J., Stojanovic, V.: Few shot cross equipment fault diagnosis method based on parameter optimization and feature mertic. Meas. Sci. Technol. 33(11), 115005 (2022)
Song, S., Jing, J., Cheng, W.: Online monitoring system for macro-fatigue characteristics of glass fiber composite materials based on machine vision. IEEE Trans. Instrum. Meas. 71, 1–12 (2022)
Hong, Y., Pan, R., Su, J., Pang, R.: Detection of coal gangue based on MSRCR algorithm and improved lightweight YOLOv8n. Int. J. Coal Prep. Util. (2024). https://doi.org/10.1080/19392699.2024.2398522
Yan, P., Wen, Z., Wu, Z., Li, G., Zhao, Y., Wang, J., Wang, W.: Intelligent detection of coal gangue in mining Operations using multispectral imaging and enhanced RT-DETR algorithm for efficient sorting. Microchem. J. 207, 111789 (2024)
Zhang, Q., Nie, Y., Zheng, W.S.: Dual illumination estimation for robust exposure correction. Comput. Graph. Forum 38, 243–252 (2019)
Guo, X., Li, Y., Ling, H.: LIME: low-light image enhancement via illumination map estimation. IEEE Trans. Image Process. 26(2), 982–993 (2016)
Cheng, D.Q., Xu, J.Y., Kou, Q.Q., Zhang, H.X., Han, C.G., Yu, B., Qian, J.S.: Lightweight network based on residual information for foreign body classification on coal conveyor belt. J. China Coal Soc. 47(3), 1361–1369 (2022)
Chen, P., Liu, S., Zhao, H., Wang, X., Jia, J.: Gridmask data augmentation. arXiv preprint arXiv:2001.04086 (2020)
Carion, N., Massa, F., Synnaeve, G., Usunier, N., Kirillov, A., Zagoruyko, S.: End-to-end object detection with transformers, Springer. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 213–229(2020)
Ma, X., Dai, X., Bai, Y., Wang, Y., Fu Y.: Rewrite the stars. In: 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 5694–5703 (2024)
Shaker, A., Maaz, M., Rasheed, H., Khan, S., Yang, M.H., Khan, F.S.: SwiftFormer: efficient additive attention for transformer-based real-time mobile vision applications. In: 2023 IEEE/CVF International Conference on Computer Vision (ICCV), pp. 17379–17390 (2023)
Zheng, Z., Wang, P., Liu, W., Li, J., Ye, R., Ren, D.: Distance-IoU loss: faster and better learning for bounding box regression. In: Proceedings of the AAAI Conference on Artificial Intelligence, pp. 12993–13000 (2020)
Rezatofighi, H., Tsoi, N., Gwak, J., Sadeghian, A., Reid, I., Savarese, S.: Generalized intersection over union: a metric and a loss for bounding box regression. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 658–666 (2019)
Chollet, F.: Xception: deep learning with depthwise separable convolutions. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp 1800–1807 (2017)
Ioffe, S.: Batch normalization: accelerating deep network training by reducing internal covariate shift. arXiv:1502.03167 (2015)
Pan, X., Ge, C., Lu, R., Song, S., Chen, G., Huang, Z., Huang, G.: On the integration of self-attention and convolution. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 815–825 (2022)
Messaoud, W., Trabelsi, R., Cabani, A., Abdelkefi, F.: Multi-head self attention for enhanced object detection in the maritime domain. In: 2023 International Conference on Cyberworlds (CW), pp. 179–184 (2023)
Lin, Z., Feng M., dos Santos, C., Yu, M., Xiang, B., Zhou, B., Bengio, Y.: A structured self-attentive sentence embedding. arXiv:1703.03130
Wang, C.Y., Bochkovskiy, A., Liao, H.Y.M.: YOLOv7: trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. In: 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 7464–7475 (2023)
Chen, Y., Dai, X., Liu, M., Chen, D., Yuan, L., Liu, Z.: Dynamic convolution: attention over convolution kernels. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 11030–11039 (2020)
Vaswani, A.: Attention is all you need. In: Advances in Neural Information Processing Systems, pp. 5998–6008 (2017)
Gong, W.: Lightweight object detection: a study based on YOLOv7 integrated with ShuffleNetv2 and vision transformer. arXiv:2403.01736 (2024)
Dong, C., Duoqian, M.: Control distance IoU and control distance IoU loss for better bounding box regression. Pattern Recognit. 137, 109256 (2023)
Zhang, Y.F., Ren, W., Zhang, Z., Jia, Z., Wang, L., Tan, T.: Focal and efficient IOU loss for accurate bounding box regression. Neurocomputing 506, 146–157 (2022)
Gevorgyan, Z.: SIoU loss: more powerful learning for bounding box regression. arXiv:2205.12740 (2022)
Ma, S., Xu, Y.: Mpdiou: a loss for efficient and accurate bounding box regression. arXiv:2307.07662 (2023)
Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.-Y., Berg, A.C.: SSD: single shot multibox detector. In: Leibe, B., Matas, J., Sebe, N., Welling, M., (eds.), Computer Vision—ECCV 2016, pp 21–37 (2016)
Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. IEEE Trans. Pattern Anal. Mach. Intell. 39(6), 1137–1149 (2016)
Selvaraju, R.R., Cogswell, M., Das, A., Vedantam, R., Parikh, D., Batra, D.: Grad-cam: visual explanations from deep networks via gradient-based localization. In: Proceedings of the IEEE International Conference on Computer Vision, pp 618–626 (2017)
Acknowledgements
Special thanks to the Intelligent Detection and Pattern Recognition Research Center of China University of Mining and Technology for providing the dataset for this study. We would also like to thank every reviewer and editorial team member for their hard work and team support.
Funding
This research was funded by the National Natural Science Foundation of China, grant number 52174141, 62105004; the Anhui Mining Machinery and Electrical Equipment Coordination Innovation Center, Anhui University of Science and Technology, grant number KSJD202304; the Anhui Digital Agriculture Engineering Technology Research Center Open Project of China, grant number AHSZNYGCZXKF021;the Graduate Innovation Fund Project of Anhui University of Science and Technology, grant number 2024cx2067, 2024cx2064; the College Student Innovation and Entrepreneurship Fund project of China, grant number 202210361053, 202310361037.
Author information
Authors and Affiliations
Contributions
YH and LW wrote the main text of the manuscript, and the other authors reviewed the manuscript.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Hong, Y., Wang, L., Su, J. et al. Enhanced foreign body detection on coal mine conveyor belts using improved DLEA and lightweight SARC-DETR model. SIViP 19, 349 (2025). https://doi.org/10.1007/s11760-025-03922-1
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11760-025-03922-1