Abstract
Tea category classification is of vital importance to industrial applications. We developed a tea-category identification system based on machine learning and computer vision with the aim of classifying different tea types automatically and accurately. 75 photos of three categories of tea were obtained with 3-CCD digital camera, they are green, black, and oolong. After preprocessing, we obtained 7 coefficient subbands using 2-level wavelet transform, and extracted the entropies from the coefficient subbands as the features. Finally, a weighted k-Nearest Neighbors algorithm was trained for the classification. The experiment results over 5 × 5-fold cross validation showed that the proposed approach achieved sensitivities of 95.2 %, 90.4 %, and 98.4 %, for green, oolong, and black tea, respectively. We obtained an overall accuracy of 94.7 %. The average time to identify a new image was merely 0.0491 s. Our method is accurate and efficient in identifying tea categories.
Similar content being viewed by others
References
Adewole AC et al. (2016) Distribution network fault section identification and fault location using wavelet entropy and neural networks. Appl Soft Comput 46:296–306
Aguiar V, Guedes I (2015) Shannon entropy, fisher information and uncertainty relations for log-periodic oscillators. Physica A: Statistical Mechanics and its Applications 423:72–79
Alshatwi AA et al. (2016) Synergistic anticancer activity of dietary tea polyphenols and bleomycin hydrochloride in human cervical cancer cell: Caspase-dependent and independent apoptotic pathways. Chem Biol Interact 247:1–10
Boros K et al. (2016) Theanine and caffeine content of infusions prepared from commercial tea samples. Pharmacogn Mag 12(45):75–79
Chen J et al. (2007) Prediction of linear b-cell epitopes using amino acid pair antigenicity scale. Amino Acids 33:423–428
Chen Q et al. (2013) Classification of tea category using a portable electronic nose based on an odor imaging sensor array. J Pharm Biomed Anal 84:77–83
Chen QS et al. (2015) Recent developments of green analytical techniques in analysis of tea's quality and nutrition. Trends Food Sci Technol 43(1):63–82
Dai Y et al. (2015) Longjing tea quality classification by fusion of features collected from e-nose. Chemom Intell Lab Syst 144:63–70
Diniz PHGD et al. (2015) Simplified tea classification based on a reduced chemical composition profile via successive projections algorithm linear discriminant analysis (spa-lda). J Food Compos Anal 39:103–110
Dong Z et al. (2015) Preclinical diagnosis of magnetic resonance (mr) brain images via discrete wavelet packet transform with tsallis entropy and generalized eigenvalue proximal support vector machine (gepsvm). Entropy 17(4):1795–1813
Du S et al. (2016) Application of stationary wavelet entropy in pathological brain detection. Multimedia Tools and Applications. doi:10.1007/s11042-016-3401-7
Gao Y, Gao F (2010) Edited adaboost by weighted knn. Neurocomputing 73(16–18):3079–3088
Gardy JL et al. (2005) Psortb v.2.0: expanded prediction of bacterial protein subcellular localization and insights gained from comparative proteome analysis. Bioinformatics 21(5):617–623
Hua S, Sun Z (2001) Support vector machine approach for protein subcellular localization prediction. Bioinformatics 17(8):721–728
Korkmaz SA (2016) Diagnosis of cervical cancer cell taken from scanning electron and atomic force microscope images of the same patients using discrete wavelet entropy energy and jensen shannon, hellinger, triangle measure classifier. Spectrochimica Acta Part a-Molecular And Biomolecular Spectroscopy 160:39–49
Kumar A, Singh M (2015) Optimal selection of wavelet function and decomposition level for removal of ecg signal artifacts. Journal of Medical Imaging and Health Informatics 5(1):138–146
Lee MJ et al. (2016) The direction-constrained k nearest neighbor query dealing with spatio-directional objects. GeoInformatica 20(3):471–502
Ma L et al. (2016) Identification and comparative study of chemosensory genes related to host selection by legs transcriptome analysis in the tea geometrid ectropis obliqua. PLoS One 11(3):e0149591
Mangalova E, Shesterneva O (2016) K-nearest neighbors for gefcom2014 probabilistic wind power forecasting. Int J Forecast 32(3):1067–1073
Milani RF et al. (2016) Trace elements in Camellia sinensis marketed in southeastern Brazil: extraction from tea leaves to beverages and dietary exposure. LWT-Food Science and Technology 68:491–498
Peng B et al. (2016) Image processing methods to elucidate spatial characteristics of retinal microglia after optic nerve transection. Sci Rep 6:21816
Phillips P et al. (2015) Pathological brain detection in magnetic resonance imaging scanning by wavelet entropy and hybridization of biogeography-based optimization and particle swarm optimization. Prog Electromagn Res 152:41–58
Pławiak P, Maziarz W (2014) Classification of tea specimens using novel hybrid artificial intelligence methods. Sensors Actuators B Chem 192:117–125
Plimley B et al. (2016) Angular sensitivity of modeled scientific silicon charge-coupled devices to initial electron direction. Nuclear Instruments & Methods In Physics Research Section A-Accelerators Spectrometers Detectors And Associated Equipment 827:18–23
Schumann A, et al. (2015) Spectral decomposition of pupillary unrest using wavelet entropy. In Annual International Conference of the IEEE Engineering in Medicine and Biology Society, Milan, pp 6154–7
Shahabi M et al. (2014) Image analysis and green tea color change kinetics during thin-layer drying. Food Sci Technol Int 20(6):465–476
Su M-Y (2011) Real-time anomaly detection systems for denial-of-service attacks by weighted k-nearest-neighbor classifiers. Expert Syst Appl 38(4):3492–3498
Sun P et al. (2015) Pathological brain detection based on wavelet entropy and hu moment invariants. Biomed Mater Eng 26(s1):1283–1290
Sun Y et al. (2016) A multilayer perceptron based smart pathological brain detection system by fractional fourier entropy. J Med Syst 40(7):173
Tan S (2005) Neighbor-weighted k-nearest neighbor for unbalanced text corpus. Expert Syst Appl 28(4):667–671
Tang Z et al. (2015) A local binary pattern based texture descriptors for classification of tea leaves. Neurocomputing 168:1011–1023
Wang S et al. (2015) Identification of green, oolong and black teas in China via wavelet packet entropy and fuzzy support vector machine. Entropy 17(10)
Wu L, Zhang Y (2012) Classification of fruits using computer vision and a multiclass support vector machine. Sensors 12(9):12489–12505
Xiao P et al. (2015) Screening lactic acid bacteria with high yielding-acid capacity from pickled tea for their potential uses of inoculating to ferment tea products. Journal of Food Science And Technology-Mysore 52(10):6727–6734
Xie J et al. (2016) Robust clustering by detecting density peaks and assigning points based on fuzzy weighted k-nearest neighbors. Inf Sci 354:19–40
Xu YT (2016) K-nearest neighbor-based weighted multi-class twin support vector machine. Neurocomputing 205:430–438
Yang X-J et al. (2015a) Pathological brain detection in mri scanning by wavelet packet tsallis entropy and fuzzy support vector machine. SpringerPlus 4(1):716
Yang X et al. (2015b) Pathological brain detection by a novel image feature—fractional fourier entropy. Entropy 17(12):8278–8296
Yaroshenko TY et al. (2015) Wavelet modeling and prediction of the stability of states: the roman empire and the european union. Commun Nonlinear Sci Numer Simul 26(1–3):265–275
Yu XJ, Liu KS, He Y, Wu D (2011) Color and texture classification of green tea using least squares support vector machine (lssvm). Key Eng Mater 460–461:774–779
Zhang Y et al. (2016) Tea category identification using a novel fractional fourier entropy and jaya algorithm. Entropy 18(3):77
Zhou XX et al. (2016a) Detection of abnormal mr brains based on wavelet entropy and feature selection. IEEJ Trans Electr Electron Eng 11(3):364–373
Zhou X et al. (2016b) Tea category classification based on feed-forward neural network and two-dimensional wavelet entropy. In: Xie J et al. (eds) Third international conference on high performance computing and applications. Springer International Publishing, Cham, pp. 48–54
Acknowledgments
This paper was supported by NSFC (61602250), Natural Science Foundation of Jiangsu Province (BK20150983), Open Project Program of the State Key Lab of CAD&CG, Zhejiang University (A1616), Open Research Fund of Hunan Provincial Key Laboratory of Network Investigational Technology (2016WLZC013), Open Fund of Fujian Provincial Key Laboratory of Data Intensive Computing (BD201607).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflicts of interest
The authors declare no conflict of interest involved in this paper.
Additional information
Xueyan Wu and Jiquan Yang contribute equally to this work
Rights and permissions
About this article
Cite this article
Wu, X., Yang, J. & Wang, S. Tea category identification based on optimal wavelet entropy and weighted k-Nearest Neighbors algorithm. Multimed Tools Appl 77, 3745–3759 (2018). https://doi.org/10.1007/s11042-016-3931-z
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-016-3931-z