Abstract
Wheat spike detection is of great research importance for wheat yield estimation as well as wheat quality management. For the detection of wheat spikes, object detection methods based on machine learning are often employed. The accuracy of wheat spikes photographs is significantly hampered by the images’ small size, extreme density, and heavy overlapping. For the features of wheat spikes images, we present an enhanced YOLOv5-based small wheat spikes detection technique in this research. This approach addresses the issue of erroneous and missed detection in the wheat spikes detection process. Specifically, by introducing the application of SPD-Conv in the wheat spikes detection algorithm, the performance of small-size object detection is improved and the missed detection of wheat spikes is reduced. By introducing the CA attention mechanism in the neck of the small wheat sheaf detection algorithm, thus improving the accuracy of object feature information extraction and detection, the problem of wrong and missed detection of small-sized wheat sheaves is effectively improved. By introducing an efficient RepGFPN module to replace the C3 module in the wheat spikes detection algorithm, the feature extraction capability of the model for small-sized wheat spikes is improved, and better detection accuracy is achieved. The experimental outcomes demonstrate that the enhanced algorithm for detecting wheat spikes can increase detection precision of wheat spikes with an mean average accuracy (mAP) of 94.3\(\%\), which is better than the general object detection model.
Similar content being viewed by others
Availability of data and materials
Not applicable.
Code Availability
Not applicable.
References
Sun, J., Yang, K., Chen, C., Shen, J., Yang, Y., Wu, X., Norton, T.: Wheat head counting in the wild by an augmented feature pyramid networks-based convolutional neural network. Comput. Electron. Agric. 193, 106705 (2022)
Zang, H., Wang, Y., Ru, L., Zhou, M., Chen, D., Zhao, Q., Zhang, J., Li, G., Zheng, G.: Detection method of wheat spike improved yolov5s based on the attention mechanism. Front. Plant Sci. 168, 16648714 (2023)
Girshick, R., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 580–587 (2014)
Ren, S., He, K., Girshick, R., Sun, J.: Faster r-cnn: Towards real-time object detection with region proposal networks. Adv. Neural Inf. Process. Syst. 28 (2015)
He, K., Gkioxari, G., Dollár, P., Girshick, R.: Mask r-cnn. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2961–2969 (2017)
Redmon, J., Divvala, S., Girshick, R., Farhadi, A.: You only look once: unified, real-time object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 779–788 (2016)
Redmon, J., Farhadi, A.: Yolo9000: better, faster, stronger. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7263–7271 (2017)
Redmon, J., Farhadi, A.: Yolov3: an incremental improvement. arXiv preprint arXiv:1804.02767 (2018)
Bochkovskiy, A., Wang, C.-Y., Liao, H.-Y.M.: Yolov4: Optimal speed and accuracy of object detection. arXiv preprint arXiv:2004.10934 (2020)
Wang, D., Fu, Y., Yang, G., Yang, X., Liang, D., Zhou, C., Zhang, N., Wu, H., Zhang, D.: Combined use of FCN and Harris corner detection for counting wheat ears in field conditions. IEEE Access 7, 178930–178941 (2019)
He, M.-X., Hao, P., Xin, Y.-Z.: A robust method for wheatear detection using UAV in natural scenes. IEEE Access 8, 189043–189053 (2020)
Hong, Q., Jiang, L., Zhang, Z., Ji, S., Gu, C., Mao, W., Li, W., Liu, T., Li, B., Tan, C.: A lightweight model for wheat ear fusarium head blight detection based on RGB images. Remote Sensing 14(14), 3481 (2022)
Lin, T.-Y., Goyal, P., Girshick, R., He, K., Dollár, P.: Focal loss for dense object detection. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2980–2988 (2017)
Li, J., Li, C., Fei, S., Ma, C., Chen, W., Ding, F., Wang, Y., Li, Y., Shi, J., Xiao, Z.: Wheat ear recognition based on retinanet and transfer learning. Sensors 21(14), 4845 (2021)
Bao, W., Lin, Z., Hu, G., Liang, D., Huang, L., Zhang, X.: Method for wheat ear counting based on frequency domain decomposition of msvf-isct. Inf. Process. Agric. (2022)
Dong, Y., Liu, Y., Kang, H., Li, C., Liu, P., Liu, Z.: Lightweight and efficient neural network with SPSA attention for wheat ear detection. PeerJ Comput. Sci. 8, 931 (2022)
Khaki, S., Safaei, N., Pham, H., Wang, L.: Wheatnet: a lightweight convolutional neural network for high-throughput image-based wheat head detection and counting. Neurocomputing 489, 78–89 (2022)
Zhao, J., Zhang, X., Yan, J., Qiu, X., Yao, X., Tian, Y., Zhu, Y., Cao, W.: A wheat spike detection method in UAV images based on improved yolov5. Remote Sensing 13(16), 3095 (2021)
Sunkara, R., Luo, T.: No more strided convolutions or pooling: a new cnn building block for low-resolution images and small objects. In: Machine Learning and Knowledge Discovery in Databases: European Conference, ECML PKDD 2022, Grenoble, France, September 19–23, 2022, Proceedings, Part III, pp. 443–459 . Springer (2023)
Hou, Q., Zhou, D., Feng, J.: Coordinate attention for efficient mobile network design. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 13713–13722 (2021)
Obsie, E.Y., Qu, H., Zhang, Y.-J., Annis, S., Drummond, F.: Yolov5s-ca: an improved yolov5 based on the attention mechanism for mummy berry disease detection. Agriculture 13(1), 78 (2022)
Xu, X., Jiang, Y., Chen, W., Huang, Y., Zhang, Y., Sun, X.: Damo-yolo: A report on real-time object detection design. arXiv preprint arXiv:2211.15444 (2022)
David, E., Madec, S., Sadeghi-Tehran, P., Aasen, H., Zheng, B., Liu, S., Kirchgessner, N., Ishikawa, G., Nagasawa, K., Badhon, M.A., et al.: Global wheat head detection (gwhd) dataset: a large and diverse dataset of high-resolution RGB-labelled images to develop and benchmark wheat head detection methods. Plant Phenomics (2020)
David, E., Serouart, M., Smith, D., Madec, S., Velumani, K., Liu, S., Wang, X., Espinosa, F.P., Shafiee, S., Tahir, I.S., et al.: Global wheat head dataset 2021: more diversity to improve the benchmarking of wheat head localization methods. arXiv preprint arXiv:2105.07660 (2021)
Acknowledgements
The authors thank the reviewers who remained anonymous for their constructive criticism and recommendations, which helped to make the manuscript better.
Funding
Not applicable.
Author information
Authors and Affiliations
Contributions
LL provided the rationale and methodology for the improved algorithm. PL did the writing-original manuscript preparation, data collection and analysis, and editing. All authors reviewed the manuscript.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no competing interests.
Ethical approval
Not applicable.
Consent to participate
Not applicable.
Consent for publication
Not applicable.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Liu, L., Li, P. An improved YOLOv5-based algorithm for small wheat spikes detection. SIViP 17, 4485–4493 (2023). https://doi.org/10.1007/s11760-023-02682-0
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11760-023-02682-0