Abstract
While rice pests and diseases significantly impact crop yields, existing deep learning methods for their detection face challenges with accuracy and deployment complexity. Addressing these issues, this study proposes the YOLOv8-HSFPN, an advanced detection framework. Firstly, it features an innovative High-level Select Feature Pyramid Network (HSFPN) neck network that effectively integrates high-level and low-level feature sets for enhanced feature fusion. Secondly, the addition of a deformable self-attention module further refines the model’s adaptability to the varying shapes and locations of targets, dynamically adjusting to the salient features. The proposed model has undergone comparative and ablation studies alongside YOLOv8, YOLOv9, and YOLOv5, confirming its improved accuracy and streamlined deployment. This integration results in a robust detection model that not only marks a significant leap in accuracy, evidenced by a 3% empirical increase over the standard YOLOv8, but is also remarkably compact. At a mere 3.97MB, this substantial 49.87% size reduction compared to its predecessors renders it exceptionally suitable for devices with limited computational resources, thereby enhancing its viability in practical, real-world applications.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Watt, M.S., et al.: Early prediction of regional red needle cast outbreaks using climatic data trends and satellite-derived observations. Remote Sens. 16(8), 1401 (2024)
Xia, Y., et al.: Detection of surface defects for maize seeds based on YOLOv5. J. Stored Prod. Res. 105, 102242 (2024)
Chen, Y., et al.: Accurate leukocyte detection based on deformable-DETR and multi-level feature fusion for aiding diagnosis of blood diseases. Comput. Biol. Med. 170, 107917 (2024)
Nguyen, D.K., Ju, J., Booij, O., Oswald, M.R., Snoek, C.G.: Boxer: box-attention for 2D and 3D transformers. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 4773–4782 (2022)
Patterson, J., Gibson, A.: Deep Learning: A Practitioner’s Approach. O’Reilly Media, Inc. (2017)
Teixeira, A.C., Ribeiro, J., Morais, R., Sousa, J.J., Cunha, A.: A systematic review on automatic insect detection using deep learning. Agriculture 13(3), 713 (2023)
Liu, J., Wang, X.: Plant diseases and pests detection based on deep learning: a review. Plant Methods 17, 1–18 (2021)
Tian, L., et al.: VMF-SSD: a novel v-space based multi-scale feature fusion SSD for apple leaf disease detection. IEEE/ACM Trans. Comput. Biol. Bioinform. (2022)
Zhang, Y., Ma, B., Hu, Y., Li, C., Li, Y.: Accurate cotton diseases and pests detection in complex background based on an improved YOLOX model. Comput. Electron. Agric. 203, 107484 (2022)
Jiang, M., Wang, Y., Guo, M., Liu, L., Yu, F.: UPDN: pedestrian detection network for unmanned aerial vehicle perspective. In: Computer Graphics International Conference, pp. 27–39. Springer (2023)
Shahbaz, A., Jo, K.H.: Deep Atrous spatial features-based supervised foreground detection algorithm for industrial surveillance systems. IEEE Trans. Ind. Inf. 17(7), 4818–4826 (2020)
Panda, M.K., Sharma, A., Bajpai, V., Subudhi, B.N., Thangaraj, V., Jakhetiya, V.: Encoder and decoder network with ResNet-50 and global average feature pooling for local change detection. Comput. Vis. Image Underst. 222, 103501 (2022)
Scherer, D., Müller, A., Behnke, S.: Evaluation of pooling operations in convolutional architectures for object recognition. In: International Conference on Artificial Neural Networks, pp. 92–101. Springer (2010)
Yuan, X., Qiao, Z., Meyarian, A.: Scale attentive network for scene recognition. Neurocomputing 492, 612–623 (2022)
Jiao, X., Chen, Y., Dong, R.: An unsupervised image segmentation method combining graph clustering and high-level feature representation. Neurocomputing 409, 83–92 (2020)
Lu, W., Song, Z., Chu, J.: A novel 3D medical image super-resolution method based on densely connected network. Biomed. Sig. Process. Control 62, 102120 (2020)
Zhang, H., Zhang, S.: Focaler-IoU: more focused intersection over union loss. arXiv preprint arXiv:2401.10525 (2024)
Acknowledgments
This work was supported in part by the Fund for Academic Innovation Teams and Research Platform of South-Central Minzu University (Grant Number: XTZ24003, PTZ24001), Knowledge Innovation Program of Wuhan-Basic Research (Project No.: 2023010201010151), and the Research Start-up Funds of South-Central Minzu University under grant YZZ18006, and the Spring Sunshine Program of Ministry of Education of the People’s Republic of China under grant HZKY20220331.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
1 Electronic supplementary material
Below is the link to the electronic supplementary material.
Supplementary material 1 (mp4 22762 KB)
Rights and permissions
Copyright information
© 2025 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Yang, Y., Zhu, J., Yang, B., Zhang, X., Huang, J. (2025). An Improved YOLOv8-Based Rice Pest and Disease Detection. In: Magnenat-Thalmann, N., Kim, J., Sheng, B., Deng, Z., Thalmann, D., Li, P. (eds) Advances in Computer Graphics. CGI 2024. Lecture Notes in Computer Science, vol 15340. Springer, Cham. https://doi.org/10.1007/978-3-031-82024-3_8
Download citation
DOI: https://doi.org/10.1007/978-3-031-82024-3_8
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-82023-6
Online ISBN: 978-3-031-82024-3
eBook Packages: Computer ScienceComputer Science (R0)