PlantDet: A Benchmark for Plant Detection in the Three-Rivers-Source Region

Li, Huanhuan; Zhang, Yu-an; Zou, Xuechao; Li, Zhiyong; Zhaba, Jiangcai; Li, Guomei; Yongga, Lamao

doi:10.1007/978-3-031-44201-8_14

Huanhuan Li ORCID: orcid.org/0009-0004-1825-5025¹¹,
Yu-an Zhang ORCID: orcid.org/0000-0002-7450-9876¹¹,
Xuechao Zou¹¹,
Zhiyong Li¹¹,
Jiangcai Zhaba¹²,
Guomei Li¹² &
…
Lamao Yongga¹²

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14262))

Included in the following conference series:

International Conference on Artificial Neural Networks

887 Accesses

Abstract

The Three-River-Source region is a highly significant natural reserve in China that harbors a plethora of botanical resources. To meet the practical requirements of botanical research and intelligent plant management, we construct a dataset for Plant detection in the Three-River-Source region (PTRS). It comprises 21 types, 6965 high-resolution images of 2160 $\times $ 3840 pixels, captured by diverse sensors and platforms, and featuring objects of varying shapes and sizes. The PTRS presents us with challenges such as dense occlusion, varying leaf resolutions, and high feature similarity among plants, prompting us to develop a novel object detection network named PlantDet. This network employs a window-based efficient self-attention module (ST block) to generate robust feature representation at multiple scales, improving the detection efficiency for small and densely-occluded objects. Our experimental results validate the efficacy of our proposed plant detection benchmark, with a precision of 88.1%, a mean average precision (mAP) of 77.6%, and a higher recall compared to the baseline. Additionally, our method effectively overcomes the issue of missing small objects.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 69.99; Price excludes VAT (USA)

Softcover Book: USD 89.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Multi-stage progressive detection method for water deficit detection in vertical greenery plants

Article Open access 26 April 2024

An annotated satellite imagery dataset for automated river barrier object detection

Article Open access 10 February 2025

Feature augmentation and scale penalty for tiny floating detection

Article 23 September 2023

References

Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. Commun. ACM 60(6), 84–90 (2017)
Article Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
Google Scholar
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014)
Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.-Y., Berg, A.C.: SSD: single shot multibox detector. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9905, pp. 21–37. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46448-0_2
Chapter Google Scholar
Redmon, J., Divvala, S., Girshick, R., Farhadi, A.: You only look once: unified, real-time object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 779–788 (2016)
Google Scholar
Redmon, J., Farhadi, A.: Yolo9000: better, faster, stronger. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7263–7271 (2017)
Google Scholar
Redmon, J., Farhadi, A.: Yolov3: an incremental improvement. arXiv preprint arXiv:1804.02767 (2018)
Bochkovskiy, A., Wang, C.Y., Liao, H.Y.M.: Yolov4: optimal speed and accuracy of object detection. arXiv preprint arXiv:2004.10934 (2020)
Li, C., et al.: Yolov6: a single-stage object detection framework for industrial applications. arXiv preprint arXiv:2209.02976 (2022)
Wang, C.Y., Bochkovskiy, A., Liao, H.Y.M.: Yolov7: trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. arXiv preprint arXiv:2207.02696 (2022)
Girshick, R., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 580–587 (2014)
Google Scholar
Liu, J., Wang, X.: Plant diseases and pests detection based on deep learning: a review. Plant Methods 17, 1–18 (2021)
Article Google Scholar
Mohammadi, V., Kheiralipour, K., Ghasemi-Varnamkhasti, M.: Detecting maturity of persimmon fruit based on image processing technique. Scientia Horticulturae 184, 123–128 (2015)
Article Google Scholar
Tian, Y., Yang, G., Wang, Z., Wang, H., Li, E., Liang, Z.: Apple detection during different growth stages in orchards using the improved yolo-v3 model. Comput. Electron. Agric. 157, 417–426 (2019)
Article Google Scholar
Carion, N., Massa, F., Synnaeve, G., Usunier, N., Kirillov, A., Zagoruyko, S.: End-to-end object detection with transformers. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12346, pp. 213–229. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58452-8_13
Chapter Google Scholar
Dosovitskiy, A., et al.: An image is worth 16x16 words: transformers for image recognition at scale. arXiv preprint arXiv:2010.11929 (2020)
Liu, Z., et al.: Swin transformer: Hierarchical vision transformer using shifted windows. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 10012–10022 (2021)
Google Scholar
Yang, J., et al.: Focal self-attention for local-global interactions in vision transformers. arXiv preprint arXiv:2107.00641 (2021)
Wang, N., Zhou, W., Wang, J., Li, H.: Transformer meets tracker: Exploiting temporal context for robust visual tracking. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 1571–1580 (2021)
Google Scholar
Sun, P., et al.: TransTrack: multiple object tracking with transformer. arXiv preprint arXiv:2012.15460 (2020)
Lin, T.Y., Dollár, P., Girshick, R., He, K., Hariharan, B., Belongie, S.: Feature pyramid networks for object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2117–2125 (2017)
Google Scholar
Wang, W., et al.: Efficient and accurate arbitrary-shaped text detection with pixel aggregation network. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 8440–8449 (2019)
Google Scholar
Zheng, Z., Wang, P., Liu, W., Li, J., Ye, R., Ren, D.: Distance-IOU loss: Faster and better learning for bounding box regression. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 34, pp. 12993–13000 (2020)
Google Scholar
Buzzy, M., Thesma, V., Davoodi, M., Mohammadpour Velni, J.: Real-time plant leaf counting using deep object detection networks. Sensors 20(23), 6896 (2020)
Article Google Scholar
Oh, S., et al.: Plant counting of cotton from UAS imagery using deep learning-based object detection framework. Remote Sens. 12(18), 2981 (2020)
Article Google Scholar
Fuentes, A., Yoon, S., Kim, S.C., Park, D.S.: A robust deep-learning-based detector for real-time tomato plant diseases and pests recognition. Sensors 17(9), 2022 (2017)
Article Google Scholar
Reckling, W., Mitasova, H., Wegmann, K., Kauffman, G., Reid, R.: Efficient drone-based rare plant monitoring using a species distribution model and AI-based object detection. Drones 5(4), 110 (2021)
Article Google Scholar
Basavegowda, D.H., Mosebach, P., Schleip, I., Weltzien, C.: Indicator plant species detection in grassland using efficientdet object detector. 42. GIL-Jahrestagung, Künstliche Intelligenz in der Agrar-und Ernährungswirtschaft (2022)
Google Scholar
Tian, Z., Shen, C., Chen, H., He, T.: FCOS: fully convolutional one-stage object detection. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 9627–9636 (2019)
Google Scholar
Law, H., Deng, J.: CornerNet: detecting objects as paired keypoints. In: Proceedings of the European Conference on Computer Vision (ECCV), pp. 734–750 (2018)
Google Scholar
Girshick, R.: Fast R-CNN. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 1440–1448 (2015)
Google Scholar
Chen, Q., Wang, Y., Yang, T., Zhang, X., Cheng, J., Sun, J.: You only look one-level feature. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 13039–13048 (2021)
Google Scholar

Download references

Acknowledgments

This study is supported by the Science and Technology Plan of Qinghai Province (2020-QY-218), and China Agriculture Research System of MOF and MARA (CARS-37).

Author information

Authors and Affiliations

Department of Computer Technology and Applications, Qinghai University, Xining, China
Huanhuan Li, Yu-an Zhang, Xuechao Zou & Zhiyong Li
Forestry and Grassland Comprehensive Service Center of Yushu Prefecture, Yushu, China
Jiangcai Zhaba, Guomei Li & Lamao Yongga

Authors

Huanhuan Li
View author publications
You can also search for this author in PubMed Google Scholar
Yu-an Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Xuechao Zou
View author publications
You can also search for this author in PubMed Google Scholar
Zhiyong Li
View author publications
You can also search for this author in PubMed Google Scholar
Jiangcai Zhaba
View author publications
You can also search for this author in PubMed Google Scholar
Guomei Li
View author publications
You can also search for this author in PubMed Google Scholar
Lamao Yongga
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yu-an Zhang .

Editor information

Editors and Affiliations

Democritus University of Thrace, Xanthi, Greece
Lazaros Iliadis
Democritus University of Thrace, Xanthi, Greece
Antonios Papaleonidas
Lancaster University, Lancaster, UK
Plamen Angelov
Teesside University, Middlesbrough, UK
Chrisina Jayne

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Li, H. et al. (2023). PlantDet: A Benchmark for Plant Detection in the Three-Rivers-Source Region. In: Iliadis, L., Papaleonidas, A., Angelov, P., Jayne, C. (eds) Artificial Neural Networks and Machine Learning – ICANN 2023. ICANN 2023. Lecture Notes in Computer Science, vol 14262. Springer, Cham. https://doi.org/10.1007/978-3-031-44201-8_14

Download citation

DOI: https://doi.org/10.1007/978-3-031-44201-8_14
Published: 23 September 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-44200-1
Online ISBN: 978-3-031-44201-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

PlantDet: A Benchmark for Plant Detection in the Three-Rivers-Source Region

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Multi-stage progressive detection method for water deficit detection in vertical greenery plants

An annotated satellite imagery dataset for automated river barrier object detection

Feature augmentation and scale penalty for tiny floating detection

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

PlantDet: A Benchmark for Plant Detection in the Three-Rivers-Source Region

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Multi-stage progressive detection method for water deficit detection in vertical greenery plants

An annotated satellite imagery dataset for automated river barrier object detection

Feature augmentation and scale penalty for tiny floating detection

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation