Abstract
Plant diseases are the leading cause of crop yield reduction. Rapid diagnosis using deep learning-based methods can effectively control the deterioration and spread of diseases. Convolutional Neural Network (CNN)-based methods are the current mainstream disease classification solution. However, most methods based on CNN are aimed at different diseases of a single crop, and they are difficult to distinguish similar diseases, which does not perform well in large-scale and fine-grained disease diagnosis tasks. In this paper, an image classification model for large-scale and fine-grained diseases named Squeeze-and-Excitation Vision Transformer (SEViT) is proposed to solve the above problems. SEViT uses ResNet embedded with channel attention module as the preprocessing network, ViT as the feature classification network. It aims to improve the model’s classification accuracy in the case of many types of diseases and high similarity of disease features. Experimental results show that the classification accuracy of SEViT in the test set achieves 88.34%, higher than comparison models. Compared with the baseline model, the classification accuracy of SEViT is improved by 5.15%.








Similar content being viewed by others
Availability of data and materials
Data are available from the authors upon request.
References
Strange, R.N., Scott, P.R.: Plant disease: a threat to global food security. Annu. Rev. Phytopathol. 43, 83–116 (2005)
Arivazhagan, S., Shebiah, R.N., Ananthi, S., Varthini, S.V.: Detection of unhealthy region of plant leaves and classification of plant leaf diseases using texture features. Agric. Eng. Int. CIGR J. 15(1), 211–217 (2013)
Guo, P., Liu, T., Li, N.: Design of automatic recognition of cucumber disease image. Inf. Technol. J. 13(13), 2129 (2014)
Zhang, S., Wu, X., You, Z., Zhang, L.: Leaf image based cucumber disease recognition using sparse representation classification. Comput. Electron. Agric. 134, 135–141 (2017)
Panchal, P., Raman, V.C., Mantri, S.: Plant diseases detection and classification using machine learning models. In: 2019 4th International Conference on Computational Systems and Information Technology for Sustainable Solution (CSITSS), 4, 1–6 (2019). IEEE
Zhang, S., Zhang, S., Zhang, C., Wang, X., Shi, Y.: Cucumber leaf disease identification with global pooling dilated convolutional neural network. Comput. Electron. Agric. 162, 422–430 (2019)
Too, E.C., Yujian, L., Njuki, S., Yingchun, L.: A comparative study of fine-tuning deep learning models for plant disease identification. Comput. Electron. Agric. 161, 272–279 (2019)
Karthik, R., Hariharan, M., Anand, S., Mathikshara, P., Johnson, A., Menaka, R.: Attention embedded residual CNN for disease detection in tomato leaves. Appl. Soft Comput. 86, 105933 (2020)
Zhao, Y., Chen, J., Xu, X., Lei, J., Zhou, W.: Sev-net: residual network embedded with attention mechanism for plant disease severity detection. Concurr. Comput.: Pract. Exper. 33(10), 6161 (2021)
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, Ł., Polosukhin, I.: Attention is all you need. Adv. Neural Inf. Process. Syst. 30 (2017)
Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T., Dehghani, M., Minderer, M., Heigold, G., Gelly, S., et al.: An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929 (2020)
Deng, J.: A large-scale hierarchical image database. In: Proceeding of IEEE Computer Vision and Pattern Recognition, 2009 (2009)
Hirani, E., Magotra, V., Jain, J., Bide, P.: Plant disease detection using deep learning. In: 2021 6th International Conference for Convergence in Technology (I2CT), pp. 1–4 (2021). IEEE
Hughes, D., Salathé, M., et al.: An open access repository of images on plant health to enable the development of mobile disease diagnostics. arXiv preprint arXiv:1511.08060 (2015)
Wu, S., Sun, Y., Huang, H.: Multi-granularity feature extraction based on vision transformer for tomato leaf disease recognition. In: 2021 3rd International Academic Exchange Conference on Science and Technology Innovation (IAECST), pp. 387–390 (2021). IEEE
Zhuang, L.: Deep-learning-based diagnosis of cassava leaf diseases using vision transformer. In: 2021 4th Artificial Intelligence and Cloud Computing Conference, pp. 74–79 (2021)
Zhang, Z., Gong, Z., Hong, Q., Jiang, L.: Swin-transformer based classification for rice diseases recognition. In: 2021 International Conference on Computer Information Science and Artificial Intelligence (CISAI), pp. 153–156 (2021). IEEE
Hu, J., Shen, L., Sun, G.: Squeeze-and-excitation networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7132–7141 (2018)
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
Howard, A., Sandler, M., Chu, G., Chen, L.-C., Chen, B., Tan, M., Wang, W., Zhu, Y., Pang, R., Vasudevan, V., et al.: Searching for mobilenetv3. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 1314–1324 (2019)
Tan, M., Le, Q.V.: Efficientnet: rethinking model scaling for convolutional neural networks. ArXiv arXiv:1905.11946 (2019)
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014)
Huang, G., Liu, Z., Van Der Maaten, L., Weinberger, K.Q.: Densely connected convolutional networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4700–4708 (2017)
Szegedy, C., Ioffe, S., Vanhoucke, V., Alemi, A.A.: Inception-v4, inception-resnet and the impact of residual connections on learning. In: Thirty-first AAAI Conference on Artificial Intelligence (2017)
Chollet, F.: Xception: Deep learning with depthwise separable convolutions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1251–1258 (2017)
Liu, Z., Mao, H., Wu, C.-Y., Feichtenhofer, C., Darrell, T., Xie, S.: A convnet for the 2020s. arXiv preprint arXiv:2201.03545 (2022)
Funding
This work was supported in part by NSFC (U1931207 and 61702306), Sci. Tech. Development Fund of Shandong Province of China (ZR2022MF288, ZR2017MF02 and ZR2022MF319), and the Taishan Scholar Program of Shandong Province.
Author information
Authors and Affiliations
Corresponding authors
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Ethical approval
This paper contains no cases of studies with human participants performed by any of the authors.
Consent to participate
Not applicable.
Consent for publication
Not applicable.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Zeng, Q., Niu, L., Wang, S. et al. SEViT: a large-scale and fine-grained plant disease classification model based on transformer and attention convolution. Multimedia Systems 29, 1001–1010 (2023). https://doi.org/10.1007/s00530-022-01034-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00530-022-01034-1