Do we really need deep CNN for plant diseases identification?

doi:10.1016/j.compag.2020.105803

Computers and Electronics in Agriculture

Volume 178, November 2020, 105803

https://doi.org/10.1016/j.compag.2020.105803 Get rights and content

Highlights

•
The information extraction ability of shallow CNN was explored.
•
The shallow CNN and classic machine learning classifiers were combined.
•
The proposed methods outperform other deep models on several indicators.

Abstract

Timely identification of plant diseases plays crucial roles in the management and decision-making to protect the agricultural yield and quality. In this research field, there have been so many efforts focused on deep learning, namely deep CNN. The CNN is powerful and essential for image processing; however, do we really need deep CNN for plant diseases identification, cannot the shallow CNN extract enough information? We proposed two methods namely SCNN-KSVM (Shallow CNN with Kernel SVM) and SCNN-RF (Shallow CNN with Random Forest) to solve this confusion. The comparison experiments with other deep learning models were carried out on three different datasets. The results show that the SCNN-KSVM and SCNN-RF outperform other pretrained deep models on the indicators of precision, recall, and F1-score, with fewer parameters. The combination of shallow CNN and classic machine learning classification algorithm is a positive attempt to deal with the plant diseases identification in a simple manner.

Introduction

The plant diseases are important issues in agricultural production, if they cannot be identified in time, there will be a negative impact on the yield and quality of crops (Sethy et al., 2020). As known, the early detection and warning are basis of effective prevention and control for plant diseases, which play crucial roles in the management and decision-making (Fang and Ramasamy, 2015). However, the visual observations by experts or experienced farmers are still the primary approach for the plant diseases detection in many countries and areas. This traditional approach has many disadvantages, e.g. manual observations in large farm will be time-consuming and the high-frequency expert consultations will be too expensive. So, the automatic identification of plant diseases is of great realistic significance, which aims at detecting the symptoms of plant diseases as soon as they appear on leaves (Singh and Misra, 2017, Pooja et al., 2017).

Plenty of previous works have considered this problem by processing plant leaf images and designing some particular classifier to categorize the samples into healthy or diseased images. The reason to select plant leaf images as analysed data is that the plant leaves are generally the first area for symptoms of most plant diseases to appear (Barbedo, 2016, Barbedo, 2019). With the help of computer science and technology, there are two main categories of methods: classical machine learning and deep learning. The classical machine learning algorithms used for plant diseases identification include k-nearest neighbours (KNN) (Singh and Kaur, 2018), support vector machine (SVM) (Naik and Sivappagari, 2016), random forest (RF) (Chaudhary et al., 2016), etc. But these classical approaches despond heavily on the hand-designed features by various ways, such as HOG (histogram of oriented gradient), SIFT (scale-invariant feature transform), Gabor transform, PCA (principal components analysis), etc. On the other hand, the deep learning technology, particularly the convolutional neural network (CNN), have received excessive attentions to deal with the agricultural images, such as plant diseases and crop pests (Kamilaris and Prenafeta-Boldú, 2018, Li and Chao, 2020, Chen et al., 2020, Li and Yang, 2020, Thenmozhi and Reddy, 2019).

The deep CNN models have dominated in numerous fields of image processing research (Too et al., 2019). This phenomenon should be attributed to the powerful image processing ability of CNN, which can automatically extract features by the stack of convolutional layers. As known, the shallow convolutional layers in CNN can extract the primary features, e.g. edges, textures, colors, etc. With the deepening of CNN layers, the extracted features will become more and more abstract. Hence, we want to explore whether the deep CNN is necessary for plant diseases identification, cannot the shallow CNN provide enough features to deal with?

The main motivation of this study is to consider the future practical applications with intelligent algorithms. At present, many researches or algorithms are implemented on the high-powered hardware, such as GPUs and servers. To obtain better performance, the depth of the network becomes deeper with amounts of computational parameters, which consumes a lot of hardware resources and running time. But the high-powered hardware may not be suitable for the field applications, lacking of the convenience. In contrast, the embedded applications are more promising, due to the convenience and low cost (de Castro et al., 2020). Besides, the applications on smart phone running intelligent classification algorithm are also a promising trend, considering convenience, low power consumption, and low computation cost (Tao et al., 2020). In view of the above considerations, we want to explore the ability of shallow CNN to deal with the plant diseases identification.

In this paper, we proposed two methods namely SCNN-KSVM (Shallow CNN with Kernel SVM) and SCNN-RF (Shallow CNN with Random Forest) to deal with the plant diseases identification. The features of plant images are automatically extracted from the shallow CNN and then fed to the classical machine learning algorithms, such as Kernel SVM and Random Forest. By comparing the machine learning classification algorithms, the SVM algorithm is an important classification algorithm with ability to avoid the problem of overfitting, which is also one of the most frequently used machine learning classification algorithms (Liakos et al., 2018). Simultaneously, the Random Forest algorithm has excellent generalization ability and fast training speed (Liang et al., 2020). Thus, the Kernel SVM and Random Forest are selected to classify the extracted feature by the shallow CNN, which was excerpted from the pretrained VGG-16 model without any new trainable parameters. Our methods were compared with other deep learning models on three different datasets. It turned out that our method outperforms other deep learning models on the indicators of precision, recall, and F1-score, with fewer parameters.

Section snippets

Related works

There are two broad categories of technologies in the field of agricultural images recognition and classification: deep learning and traditional machine learning (Liakos et al., 2018). The deep learning technology develops very fast in this field and achieves many remarkable achievements. For instance, Mohanty et al. train a deep learning model for recognizing 14 crop species and 26 crop diseases (Mohanty et al., 2016). Ferentinos designs a deep CNN model to diagnose and identify the plant

Framework overview

A general overview of our work for plant diseases identification based on shallow CNN and traditional machine learning classification algorithms is presented as Fig. 1.

Firstly, the input plant leaf images are labeled and fed to the shallow CNN, which comes from the pretrained VGG-16 model by transfer learning. The pretrained VGG-16 model is trained by the ImageNet dataset with 1000 classes, with the ability to extract features of different levels. We use the strategy of transfer learning to

Results and discussion

In this section, we carry out amount of experiments on the laptop T490, with CPU of Core I7 and 16 GB memory. The experimental environment is Jupyter Notebook, with Python 3.7. Firstly, the experiments on pixel features after PCA are conducted to show the disadvantage of pixel features or manual features. Then, the experiments on features extracted by shallow CNN are conducted. Finally, the comparison experiments with other deep learning models on all the datasets were carried out, considering

Conclusion

Timely and early identification of plant diseases is helpful for farmers to make fast decisions to protect the agricultural yield and quality. In this paper, we proposed two methods, namely SCNN-KSVM and SCNN-RF, to recognize the plant diseases based on the shallow CNN. The results show that our methods can outperform other deep learning models on three different datasets, considering several evaluation indicators. The advantages of proposed methods are better performance and fewer parameters,

CRediT authorship contribution statement

Yang Li: Conceptualization, Methodology, Writing - original draft. Jing Nie: Investigation, Visualization. Xuewei Chao: Software, Validation, Writing - review & editing.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgements

This work was supported by the National Natural Science Foundation of China (31860333) and the Natural Science Program of Shihezi University (KX01230101).

References (28)

P.K. Sethy et al.
Deep feature based rice leaf disease identification using support vector machine
Comput. Electron. Agric.
(2020)
V. Singh et al.
Detection of plant leaf diseases using image segmentation and soft computing techniques
Inform. Process. Agric.
(2017)
J. Barbedo
A review on the main challenges in automatic plant disease identification based on visible range images
Biosyst. Eng.
(2016)
A. Chaudhary et al.
An improved random forest classifier for multi-class classification
Inform. Process. Agric.
(2016)
A. Kamilaris et al.
Deep learning in agriculture: A survey
Comput. Electron. Agric.
(2018)
J. Chen et al.
Using deep transfer learning for image-based plant disease identification
Comput. Electron. Agric.
(2020)
Y. Li et al.
Few-shot cotton pest recognition and terminal realization
Comput. Electron. Agric.
(2020)
K. Thenmozhi et al.
Crop pest classification based on deep convolutional neural network and transfer learning
Comput. Electron. Agric.
(2019)
E. Too et al.
A comparative study of fine-tuning deep learning models for plant disease identification
Comput. Electron. Agric.
(2019)
A. de Castro et al.
BerryIP embedded: An embedded vision system for strawberry crop
Comput. Electron. Agric.
(2020)

M. Tao et al.

Smartphone-based detection of leaf color levels in rice plants

Comput. Electron. Agric.

(2020)

K. Ferentinos

Deep learning models for plant disease detection and diagnosis

Comput. Electron. Agric.

(2018)

Y. Lu et al.

Identification of rice diseases using deep convolutional neural networks

Neurocomputing

(2017)

S. Lee et al.

New perspectives on plant disease characterization based on deep learning

Comput. Electron. Agric.

(2020)

Cited by (0)

View full text

Do we really need deep CNN for plant diseases identification?

Highlights

Abstract

Introduction

Section snippets

Related works

Framework overview

Results and discussion

Conclusion

CRediT authorship contribution statement

Declaration of Competing Interest

Acknowledgements

Comput. Electron. Agric.

Inform. Process. Agric.

Biosyst. Eng.

Inform. Process. Agric.

Comput. Electron. Agric.

Comput. Electron. Agric.

Comput. Electron. Agric.

Comput. Electron. Agric.

Comput. Electron. Agric.

Comput. Electron. Agric.

Comput. Electron. Agric.

Comput. Electron. Agric.

Neurocomputing

Comput. Electron. Agric.