Detection of tomato organs based on convolutional neural network under the overlap and occlusion backgrounds

Sun, Jun; He, Xiaofei; Wu, Minmin; Wu, Xiaohong; Shen, Jifeng; Lu, Bing

doi:10.1007/s00138-020-01081-6

Detection of tomato organs based on convolutional neural network under the overlap and occlusion backgrounds

Original Paper
Published: 06 May 2020

Volume 31, article number 31, (2020)
Cite this article

Machine Vision and Applications Aims and scope Submit manuscript

Jun Sun¹,
Xiaofei He¹,
Minmin Wu¹,
Xiaohong Wu¹,
Jifeng Shen¹ &
…
Bing Lu¹

940 Accesses
23 Citations
Explore all metrics

Abstract

Traditional detection methods are not sensitive to small-sized tomato organs (flowers and fruits), because the immature green tomatoes are highly similar to the background color. The overlap among fruits and the occlusion of stems and leaves on tomato organs can lead to false and missing detection, which decreases the accuracy and generalization ability of the model. Therefore, a tomato organ recognition method based on improved Feature Pyramid Network was proposed in this paper. To begin with, multi-scale feature fusion was used to fuse the detailed bottom features and high-level semantic features to detect small-sized tomato organs to improve recognition rate. And then repulsion loss was used to take place of the original smooth L₁ loss function. Besides, Soft-NMS (Soft non-maximum suppression) was adopted to replace non-maximum suppression to screen the bounding boxes of tomato organs to construct a recognition model of tomato key organ. Finally, the network was trained and verified on the collected image data set. The results showed that compared with the traditional Faster R-CNN model, the performance was greatly improved (mean average precision was improved from 90.7 to 99.5%). Subsequently, the training model can be compressed so that it can be embedded into the microcontroller to develop further precise pesticide targeting application system of tomato organs and the automatic picking device.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fruit ripeness identification using YOLOv8 model

Article Open access 31 August 2023

Bingjie Xiao, Minh Nguyen & Wei Qi Yan

Automatic fruit picking technology: a comprehensive review of research advances

Article Open access 14 February 2024

Jun Zhang, Ningbo Kang, … Hongbo Zhang

The improved YOLOv8 algorithm based on EMSPConv and SPE-head modules

Article 02 January 2024

Guihao Wen, Ming Li, … Yunfei Tan

References

Paran, E., Engelhard, Y.: Effect of tomato’s lycopene on blood pressure, serum lipoproteins, plasma homocysteine and oxidative sress markers in grade I hypertensive patients. Am. J. Hypertens. 14(4), A141–A141 (2001)
Article Google Scholar
He, S., He, D., Xu, C., et al.: Effects of nutrient solution on growth and quality of short-term cultivation tomatoes grown in rockwool. Trans. CSAE. 33(18), 188–195 (2017)
Google Scholar
Li, H., Zhang, M., Gao, Y., et al.: Green ripe tomato detection method based on machine vision in greenhouse. Trans. CSAE. 33(Supp. 1), 328–334 (2017)
Google Scholar
Jiang, H., Peng, Y., Shen, H., et al.: Recognizing and locating ripe tomatoes based on binocular stereo vision technology. Trans. CSAE. 24(8), 279–283 (2008)
Google Scholar
Zhao, J., Yang, G., Liu, M., et al.: Discrimination of mature tomato based on HIS color space in natural outdoor scenes. Trans. CSAM. 35(5), 101–120 (2004)
Google Scholar
Zhang, R., Ji, C., Shen, M., et al.: Application of computer vision to tomato harvesting. Trans. CSAM. 32(5), 50–52 (2001)
Google Scholar
Wang, L., Wei, S., Zhao, B., et al.: Target extraction method of ripe tomato in greenhouse based on Niblack self-adaptive adjustment parameter. Trans. CSAE. 33(Supp. 1), 322–327 (2017)
Google Scholar
Yamamoto, K., Guo, W., Yoshioka, Y., et al.: On plant detection of intact tomato fruits using image analysis and machine learning methods. Sensors 14(7), 12191–12206 (2014)
Article Google Scholar
Farabet, C., Couprie, C., Najman, L., et al.: Learning hierarchical features for scene labeling. IEEE Trans. Pattern Anal. Mach. Intell. 35(8), 1915–1929 (2013)
Article Google Scholar
Zhang, X., Cheng, L., Li, B., et al.: Too far to see? Not really! —Pedestrian detection with scale-aware localization policy. IEEE Trans. Image Process. 27(8), 3703–3715 (2018)
Article MathSciNet MATH Google Scholar
Liu, W., Anguelov, D., Erhan, D., et al.: SSD: Single shot multibox detector. In: Proceedings of the European Conference on Computer Vision, pp. 21–37. Springer (2016)
Zheng, L., San, Z., Hong, S., et al.: Scene text recognition using residual convolutional recurrent neural network. Mach. Vis. Appl. 29(5), 861–871 (2018)
Article Google Scholar
Sho, K., Kazuhiro, H., Takio, K.: Mixture of counting CNNs. Mach. Vis. Appl. 29(7), 1119–1126 (2018)
Article Google Scholar
Jang, C., Sunwoo, M.: Semantic segmentation-based parking space detection with standalone around view monitoring system. Mach. Vis. Appl. 30(2), 1–11 (2018)
Google Scholar
Zhou, Y., Xu, T., Zhen, W., et al.: Classification and recognition approaches of tomato main organs based on DCNN. Trans. CSAE. 33(15), 219–226 (2017)
Google Scholar
Inkyu, S., Zong, G., Feras, D., et al.: Deep fruits: a fruit detection system using deep neural networks. Sensors 16(8), 1222–1230 (2016)
Article Google Scholar
Peng, H., Huang, B., Shao, Y., et al.: General improved SSD model for picking object recognition of multiple fruits in natural environment. Trans. CSAE. 34(16), 155–162 (2018)
Google Scholar
Lin, Y., Dollár, Piotr, et al.: Feature pyramid networks for object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp. 936–944 (2017)
Han, J., Zhang, D., et al.: Object detection in optical remote sensing images based on weakly supervised learning and high-level feature learning. IEEE Trans. Geosci. Remote Sens. 53(6), 3325–3337 (2015)
Article Google Scholar
Yuan, F., Zhang, L., Wan, B., et al. Convolutional neural networks based on multi-scale additive merging layers for visual smoke recognition. Mach. Vis. Appl. pp. 1–14 (2018)
Hu, Y., Lu, M., Lu, X.: Driving behaviour recognition from still images by using multi-stream fusion CNN. Mach. Vis. Appl. 30(5), 851–865 (2019)
Article Google Scholar
Zhang, L., Zhang, Q., et al.: Ensemble manifold regularized sparse low-rank approximation for multiview feature embedding[J]. Pattern Recogn. 48(10), 3102–3112 (2015)
Article Google Scholar
Tang, H., Xiao, B., et al.: Pixel convolutional neural network for multi-focus image fusion. Inf. Sci. 433–434, 125–141 (2018)
Article MathSciNet Google Scholar
Wang, X., Xiao, T., Jiang, Y., et al.: Repulsion loss: detecting pedestrians in a crowd. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp. 7774–7783 (2018)
Qiu, S., Wen, G., Deng, Z., et al.: Accurate non-maximum suppression for object detection in high-resolution remote sensing images. Remote Sens. Lett. 9(3), 238–247 (2018)
Article Google Scholar
Bodla, N., Singh, B., Chellappa, R., et al.: Soft-NMS-Improving object detection with one line of code. In Proceedings of the IEEE International Conference on Computer Vision. pp.5562–5570 (2017)
Sun, J., He, X., Tan, W., et al.: Recognition of crop seedling and weed recognition based on dilated convolution and global pooling in CNN. Trans. CSAE. 34(11), 159–165 (2018)
Google Scholar
Barter, R., Yu, B.: Superheat: an R package for creating beautiful and extendable heatmaps for visualizing complex data. Statistics 27(4), 1–30 (2017)
MathSciNet Google Scholar
Zhou, B., Khosla, A., Lapedriza, A., et al.: Learning deep features for discriminative localization. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp. 2921–2929 (2016)
Davis, J., Goadrich, M.: The relationship between precision-recall and ROC curves. In: Proceedings of the International Conference on Machine Learning. pp. 233–240 (2006)
Renmon, J., Farhadi, A.: YOLO9000: better, faster, stronger. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp. 7263–7271 (2017)
Renmon, J., Farhadi, A.: YOLOv3: an incremental improvement. arXiv:1804.02767, (2018)
Ren, S., He, K., Girshick, R., et al.: Faster r-cnn: Towards real-time object detection with region proposal networks. arXiv:1506.01497, (2015)
Dai, J., Li, Y., He, K., et al. R-fcn: Object detection via region-based fully convolutional networks. Advances in Neural Information Processing Systems. pp. 379–387 (2017)
Everingham, M., Winn, J.: The PASCAL visual object classes challenge 2007 (VOC2007) development kit. Int. J. Comput. Vis. 111(1), 98–136 (2006)
Article Google Scholar
Zhang, D., Meng, D., et al.: Co-saliency detection via a self-paced multiple-instance learning framework. IEEE Trans. Pattern Anal. Mach. Intell. 39(5), 865–878 (2017)
Article Google Scholar

Download references

Acknowledgements

This work is supported by the Priority Academic Program Development of Jiangsu Higher Education Institutions (PAPD), Six Talent Peaks Project in Jiangsu Province (ZBZZ-019) and Project of Agricultural Equipment Department of Jiangsu University (4121680001).

Author information

Authors and Affiliations

School of Electrical and Information Engineering of Jiangsu University, Zhenjiang, 212013, China
Jun Sun, Xiaofei He, Minmin Wu, Xiaohong Wu, Jifeng Shen & Bing Lu

Authors

Jun Sun
View author publications
You can also search for this author in PubMed Google Scholar
Xiaofei He
View author publications
You can also search for this author in PubMed Google Scholar
Minmin Wu
View author publications
You can also search for this author in PubMed Google Scholar
Xiaohong Wu
View author publications
You can also search for this author in PubMed Google Scholar
Jifeng Shen
View author publications
You can also search for this author in PubMed Google Scholar
Bing Lu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jun Sun.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Sun, J., He, X., Wu, M. et al. Detection of tomato organs based on convolutional neural network under the overlap and occlusion backgrounds. Machine Vision and Applications 31, 31 (2020). https://doi.org/10.1007/s00138-020-01081-6

Download citation

Received: 24 January 2019
Revised: 03 April 2020
Accepted: 14 April 2020
Published: 06 May 2020
DOI: https://doi.org/10.1007/s00138-020-01081-6

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Detection of tomato organs based on convolutional neural network under the overlap and occlusion backgrounds

Abstract

Access this article

Similar content being viewed by others

Fruit ripeness identification using YOLOv8 model

Automatic fruit picking technology: a comprehensive review of research advances

The improved YOLOv8 algorithm based on EMSPConv and SPE-head modules

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Detection of tomato organs based on convolutional neural network under the overlap and occlusion backgrounds

Abstract

Access this article

Similar content being viewed by others

Fruit ripeness identification using YOLOv8 model

Automatic fruit picking technology: a comprehensive review of research advances

The improved YOLOv8 algorithm based on EMSPConv and SPE-head modules

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation