Skip to main content
Log in

Performance evaluation of PCA based reduced features of leaf images extracted by DWT using random Forest and XGBoost classifier

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

The leaf disease classification is a method for putting diseases into groups based on their properties, like texture, shape, and color. Even though DL features are very good at classifying leaf diseases, some authors focused on handcrafted features for leaf disease classification and got quite good results with similar accuracy. In this paper, we have also focused on handcrafted features and ML based shallow classifier to get comparable accuracy of DL models. Handcrafted features and shallow ML based classifier are used for leaf disease detection and classification mainly for three species such as tomato, bell pepper and potato. Here we have used 3- level decomposition based 2D-DWT for image feature extraction and PCA for dimensionality reduction of features. We have used stratified K-Fold validation because the dataset is small and there is a need to maintain the class ratio for classification. For classification Random Forest and XGBoost are used. The proposed method is made up of 4 steps: image pre-processing, feature extraction, feature reduction, and classification. We evaluate the proposed model’s classification accuracy against the classification accuracy of several scholarly works. When applied to Datasets 1, 2, 3, and 4, RF classifiers achieve accuracies of 98.45%, 100%, 98.33%, and 98.55%, respectively, while XGBoost achieves accuracies of 99.11%, 98.72%, 98.23%, and 97.73%.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16

Similar content being viewed by others

Data availability

Access the datasets used in the current study on the GitHub repository, https://github.com/MonuBhagat11/Leafdisease_data.

Abbreviations

CNN:

Convolutional Neural network

SVM:

Support Vector Machine

KNN:

K-Nearest Neighbor

ROC:

Receiver Operating Characteristic

DL:

Deep Learning

HIS:

Hyperspectral Imaging

MCC:

Moving Center Classifier

LDC:

Linear Discriminant Classifier

FNN:

Fuzzy Neural Network

RF:

Random Forest

EM:

Expectation-maximization

CCD:

Centroid Contour Distance Curve

ML:

Machine Learning

CA:

Classification Accuracy

HSV:

Hue Saturation Value

RGB:

Red Green Blue

LBPs:

Local Binary Patterns

GLCM:

Grey-Level Co-occurrence Matrix

SGDM:

Spatial Gray Level Dependence Matrix

SIFT:

Scale Invariant and Feature Transformation

MSF:

Multiscale fusion

GWO:

Grey Wolf optimizer

GSA:

Gravitational Search Algorithm

RGA:

Real coded Genetic Algorithm

CA:

Classification Accuracy

CHT:

Circular Hough Transform

DRNN:

Deep Residual Neural Network

ABC:

Ant Bee Colony Optimization

IGA:

Improved Genetic Algorithm

PCA:

Principal Component Analysis

ANN:

Artificial Neural Networks

GA:

Genetic Algorithm

PNN:

Probabilistic Neural Network

SURF:

Speeded Up Robust Features

PCA:

Principal Component Analysis

RGA:

Real coded Genetic Algorithm

ACH:

Angle Code Histogram

BoWs:

Bag-of-words

CSM:

Chaotic spider monkey

DWT:

Discrete Wavelet Transform

PSO:

Particle Swarm Optimization

YOP:

Year of Publication

FRT:

Feature Reduction Technique

DiffN:

Difference of Normal Orientations

BRBFNN:

Bacteria Foraging Algorithm Radial Basis Function Neural Network

CFO:

Central force optimization

MDM:

Multiscale Distance Matrix

DNM:

Decomposed Newton’s Method

MMC:

Maximum Margin Criterion

MTS:

Mahalanobis Taguchi System

References

  1. Arjunagi S, Patil NB (2019) Texture based Leaf Disease classification using Machine Learning Techniques

  2. Bhagat M, Kumar D (2022) A comprehensive survey on leaf disease identification & classification. Multimed Tools Appl 81:33897–33925. https://doi.org/10.1007/s11042-022-12984-z

    Article  Google Scholar 

  3. Bhagat M, Kumar D, Kumar D (2019) Role of internet of things (IoT) in smart farming: a brief survey. 2019 Devices for Integrated Circuit (DevIC):141–145. https://doi.org/10.1109/DEVIC.2019.8783800

  4. Bhagat M, Kumar D, Mahmood R, Pati B, Kumar M (2020) "Bell Pepper Leaf Disease Classification Using CNN," 2nd International Conference on Data, Engineering and Applications (IDEA), pp. 1–5, https://doi.org/10.1109/IDEA49133.2020.9170728

  5. Bhagat M, Kumar D, Haque I, Munda HS and Bhagat R, "Plant Leaf Disease Classification Using Grid Search Based SVM," 2nd International Conference on Data, Engineering and Applications (IDEA), 2020, pp. 1–6, https://doi.org/10.1109/IDEA49133.2020.9170725.

  6. Cai J, Miklavcic S (2012) Automated extraction of three-dimensional cereal plant structures from two-dimensional orthographic images. IET Image Process 6(6):687696–687696

    Article  Google Scholar 

  7. Chhikara RR, Sharma P, Singh L (2016) A hybrid feature selection approach based on improved pso and filter approaches for image steganalysis. Int J Mach Learn Cybern 7(6):11951206–11951206

    Article  Google Scholar 

  8. Chouhan SS, Kaul A, Singh UP, Jain S (2018) Bacterial foraging optimization based radial basis function neural network (brbfnn) for identification and classification of plant leaf diseases: An automatic approach towards plant pathology. IEEE Access 6:8852–8863

    Article  Google Scholar 

  9. Chowdhury MEH, … Ali SHM (2021) Automatic and reliable leaf disease detection using deep learning techniques. AgriEngineering 3(2):294–312

    Article  Google Scholar 

  10. Deepa N, … Gadekallu TR (2020) Multiclass model for agriculture development using multivariate statistical method. IEEE Access 8:183749–183758

    Article  MathSciNet  Google Scholar 

  11. Dey AK, Sharma M, Meshram MR (2016) Image processing based leaf rot disease, detection of betel vine (Piper betle L.). Procedia Comput Sci. https://doi.org/10.1016/j.procs.2016.05.262

  12. Dhaka, Singh V et al (2021) A survey of deep convolutional neural networks applied for prediction of plant leaf diseases. Sensors 21.14:4749

    Article  Google Scholar 

  13. Emary E, Zawbaa HM, Grosan C, and Hassenian AE (2015) Feature subset selection approach by gray-wolf optimization. In AfroEuropean Conference for Industrial Advancement, Springer, pp. 1–13

  14. Es-Saady Y, El Massi I, El Yassa M, Mammass D, Benazoun A. Automatic recognition of plant leaves diseases based on serial combination of two SVM classifiers. In: Proc 2016 Int Conf Electr Inf Technol ICEIT 2016 2016:561–6. https://doi.org/10.1109/EITech.2016.7519661

  15. Fuentes A, Yoon S, Kim SC, Park DS (2017) A robust deep-learning- based detector for real-time tomato plant diseases and pests recognition. Sensors (Switzerland) 17. https://doi.org/10.3390/s17092022

  16. Gadekallu TR, … Alazab M (2021) A novel PCA–whale optimization-based deep neural network model for classification of tomato plant diseases using GPU. J Real-Time Image Proc 18(4):1383–1396. https://doi.org/10.1007/s11554-020-00987-8

    Article  Google Scholar 

  17. Hassan SM, Maji AK, Jasiński M, Leonowicz Z, Jasińska E (2021) Identification of plant-leaf diseases using CNN and transfer-learning approach. Electronics 10:1388. https://doi.org/10.3390/electronics10121388

    Article  Google Scholar 

  18. Hlaing CS, Maung Zaw SM. Tomato plant diseases classification using statistical texture feature and color feature. In: Proc - 17th IEEE/ACIS Int Conf Comput Inf Sci ICIS 2018 2018:439–44. https://doi.org/10.1109/ICIS.2018.8466483

  19. Hu R, Jia W, Ling H, Huang D (2012) Multiscale distance matrix for fast plant leaf recognition. IEEE Trans Image Process 21(11):4667–4672

    Article  MathSciNet  MATH  Google Scholar 

  20. Hu J, Chen Z, Yang M, Zhang R, Cui Y (2018) A multiscale fusion convolutional neural network for plant leaf recognition. IEEE Signal Process Lett 25(6):853–857

    Article  Google Scholar 

  21. Huang M-L, Chang Y-H (2020) Dataset of Tomato Leaves. Mendeley Data V1. https://doi.org/10.17632/ngdgg79rzb.1

  22. Hussain R et al. "Revise-Net: Exploiting Reverse Attention Mechanism for Salient Object Detection." Remote Sensing 13.23 (2021): 4941.

  23. Ijaz MF, Attique M, Son Y (2020) Data-driven cervical cancer prediction model with outlier detection and over-sampling methods. Sensors 20(10):2809. https://doi.org/10.3390/s20102809

    Article  Google Scholar 

  24. Jackson RD, Pinter PJ, Reginato RJ, Idso SB (1986) Detection and evaluation of plant stresses for crop management decisions. IEEE Trans Geosci Remote Sensing 1:99–106

    Article  Google Scholar 

  25. Jackson RD, Pinter PJ, Reginato RJ, Idso SB (1986) Detection and evaluation of plant stresses for crop management decisions. IEEE Trans Geosci Remote Sensing 1:99–106

    Article  Google Scholar 

  26. Johannes A, Picon A, Alvarez-Gila A, Echazarra J, Rodriguez-Vaamonde S, Navajas AD et al (2017) Automatic plant disease diagnosis using mobile capture devices, applied on a wheat use case. Comput Electron Agric 138:200–209. https://doi.org/10.1016/j.compag.2017.04.013

    Article  Google Scholar 

  27. Kaur S, Pandey S, Goel S (2019) Plants disease identification and classification through leaf images: a survey. Arch Comput Methods Eng 26(2):507–530

    Article  Google Scholar 

  28. Kharrat A, Gasmi K, Ben M, Nacera (2010) A hybrid approach for automatic classification of Brain using Genetic algorithm and SVM. Leonardo J Sci (17)

  29. Kundu N, et al. "IoT and interpretable machine learning based framework for disease prediction in pearl millet." Sensors 21.16 (2021): 5386.

  30. Li S, Zhang X (2019) Research on orthopedic auxiliary classification and prediction model based on XGBoost algorithm. Neural Comput Applic:1–9

  31. Mohammadi FG, Abadeh MS (2014) Image steganalysis using a bee colony-based feature selection algorithm. Eng Appl Artif Intell 31:35–43

    Article  Google Scholar 

  32. Mohanty SP, Hughes DP, Salathé M (2016) Using deep learning for image-based plant disease detection. Front Plant Sci 7:1419

    Article  Google Scholar 

  33. Naik J, Patel S (2014) Tumor detection and classification using decision tree in brain MRI. Int J Comput Sci Network Sec 14(6):87

    Google Scholar 

  34. Ngugi LC, Abelwahab M, Abo-Zahhad M (2021) Recent advances in image processing techniques for automated leaf pest and disease recognition–a review. Inform Process Agricult 8(1):27–51

    Article  Google Scholar 

  35. Padol PB, Yadav AA (2016) SVM classifier based grape leaf disease detection. In: Conf Adv Signal Process CASP 2016:175–179. https://doi.org/10.1109/CASP.2016.7746160

    Article  Google Scholar 

  36. Pantazi XE, Moshou D, Tamouridou AA (2019) Automated leaf disease detection in different crop species through image features analysis and one class classifiers. Comput Electron Agric 156:96–104

    Article  Google Scholar 

  37. Qin F, Liu D, Sun B, Ruan L, Ma Z, Wang H (2016) Identification of alfalfa leaf diseases using image recognition technology. PLoS One 11:1–26. https://doi.org/10.1371/journal.pone.0168274

    Article  Google Scholar 

  38. Ramesh S, Vydeki D (2020) Recognition and classification of paddy leaf diseases using optimized deep neural network with Jaya algorithm. Inform Process Agricult 7(2):249–260

    Article  Google Scholar 

  39. Rashedi E, Nezamabadi-Pour H, Saryazdi S (2009) Gsa: a gravitational search algorithm. Inf Sci 179(13):2232–2248

    Article  MATH  Google Scholar 

  40. Sabrol H, Satish K (2016) Tomato plant disease classification in digital images using classification tree. Int Conf Commun Signal Process ICCSP 2016:1242–1246. https://doi.org/10.1109/ICCSP.2016.7754351

    Article  Google Scholar 

  41. Sambasivam G, Opiyo GD (2021) A predictive machine learning application in agriculture: cassava disease detection and classification with imbalanced dataset using convolutional neural networks. Egyptian Inform J 22(1):27–34

    Article  Google Scholar 

  42. Singh V (2019) Sunflower leaf diseases detection using image segmentation based on particle swarm optimization. Artif Intell Agric 3:62–68. https://doi.org/10.1016/j.aiia.2019.09.002

    Article  Google Scholar 

  43. Singh V, Misra AK (2017) Detection of plant leaf diseases using image segmentation and soft computing techniques. Inform Process Agricult 4(1):41–49

    Article  Google Scholar 

  44. Singh V, Varsha, Misra AK. Detection of unhealthy region of plant leaves using image processing and genetic algorithm. In: Conf Proceeding - 2015 Int Conf Adv Comput Eng Appl ICACEA 2015 2015:1028–32. https://doi.org/10.1109/ICACEA.2015.7164858

  45. Sulistyo SB, Woo WL, Dlay SS (2016) Regularized neural networks fusion and genetic algorithm based oneld nitrogen status estimation of wheat plants. IEEE Trans Industrial Inform 13(1):103–114

    Article  Google Scholar 

  46. Sulistyo SB, Wu D, Woo WL, Dlay SS, Gao B (2017) Computational deep intelligence vision sensing for nutrient content estimation in agricultural automation. IEEE Trans Autom Sci Eng 15(3):1243–1257

    Article  Google Scholar 

  47. Tiwari VM, Tarum G (2017) Plant leaf disease analysis using image processing technique with modified SVM-CS classifier. Int J Eng Manag Technol 5:11–17

    Google Scholar 

  48. Wang Z, Chi Z, Feng D (2003) Shape based leaf image retrieval. IEE Proceed-Vision, Image Signal Process 150(1):34–43

    Article  Google Scholar 

  49. Wang H, Li G, Ma Z, Li X (2012) Image recognition of plant diseases based on backpropagation networks. In: 2012 5th Int Congr Image Signal Process CISP 2012:894–900. https://doi.org/10.1109/CISP.2012.6469998

    Article  Google Scholar 

  50. Wang X, Ma J, Li X, Zhao X, Lin Z, Chen J, Shao Z (2014) Optimization of chemical fungicide combinations targeting the maize fungal pathogen, bipolaris maydis: a systematic quantitative approach. IEEE Trans Biomed Eng 62(1):80–87

    Article  Google Scholar 

  51. Zhang Y-D, Wang S, Lenan W (2010) A novel method for magnetic resonance brain image classification based on adaptive chaotic PSO. Prog Electromagn Res 109:325–343

    Article  Google Scholar 

Download references

Acknowledgements

The authors would like to express their gratitude to the esteemed reviewers for their insightful and useful suggestions.

Funding

No public, private, or charitable organizations provided any direct funding for this investigation.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Monu Bhagat.

Ethics declarations

Conflict of interest

The authors have stated that there are no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Bhagat, M., Kumar, D. Performance evaluation of PCA based reduced features of leaf images extracted by DWT using random Forest and XGBoost classifier. Multimed Tools Appl 82, 26225–26254 (2023). https://doi.org/10.1007/s11042-023-14370-9

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-023-14370-9

Keywords

Navigation