Original papers
An evaluation of utilizing geometric features for wheat grain classification using X-ray images

https://doi.org/10.1016/j.compag.2017.12.004Get rights and content

Highlights

  • An automatic feature detection system based on image processing is proposed.

  • The relationship of grain morphological traits should be controlled during classification.

  • The classification procedure relies on image analysis and multivariate statistical methods.

Abstract

Nowadays, with the rapid development of digital image processing, there has been a notable increase in elaborating advanced tools for studying the internal structure of objects. This may be very helpful in characterizing certain morphological traits of grains, as well as in quantifying the differences between them. The current research was carried out to study the structure of the traits and to determine their importance in relation to grain classification and identification. To achieve better performance and deeper understanding of their usefulness, the investigation was done by means of both principal component analysis and multivariate factor analysis. Herein, the percentage of variation explained by the first three factors reached a high 89.97%. Thus, the presented methodology supported reliable discrimination of the wheat varieties as regards their shape descriptors. The conducted study confirmed the practical usefulness and effectiveness of the evolved method when applied to the many practical tasks wherein the image analysis commonly employed in multivariate statistical methods is recommended.

Introduction

Recent advances in information technology and in digital image processing have resulted in the development and practical usage of computer-aided techniques in data analysis. The three-dimensional nature of computed tomography scanning allows the same object to be scanned multiple times and provides an opportunity to investigate its particle at any location within a sample. In the last few years, research studies indicate that X-ray computed tomography provides an alternate approach for the nondestructive measuring technique (Papadopoulos et al., 2009, Peth et al., 2010). This is very useful in characterizing the internal structure of objects and in quantifying their geometric features (Charytanowicz, 2014, Charytanowicz and Kulczycki, 2014, Czachor et al., 2015). In our research, the feature extraction method based on the utilization of X-ray images is proposed to measure several grain traits, hence, enabling their classification.

The identification of wheat grain requires some knowledge of their characteristics. High classification accuracy can be obtained by using kernel shape, color, length, and texture (Wiwart et al., 2012, Zapotoczny, 2011). They are considered as major distinctions and can be combined to construct the feature vector, which represents wheat grain. In the previous studies on wheat classification morphology, color, and texture were exploited for wheat variety recognition (Utku, 2000). Indeed, Majumdar and Jayas have suggested several different approaches for classification cereal grains using different types of features and their combinations (Majumdar and Jayas, 2000a, Majumdar and Jayas, 2000b, Majumdar and Jayas, 2000c, Majumdar and Jayas, 2000d).

Various computer-aided systems based on morphological features for the classification have been reported in literature (Guevara-Hernandez and Gomez-Gil, 2011, Niewczas et al., 1995). The majority of different features have involved the identification of grain varieties. The key problem encountered in practice is a very large number of variables. Still, when the dimensionality of the data increases, classification problems become significantly harder. A high number of features can lead to lower classification accuracy. Moreover, the amount of computations required for classification increases exponentially with the growth of data dimension. On the other hand, a reduction of the space dimensionality leads to a better understanding of a model and simplifies the usage of different visualization techniques (Camastra, 2003). A main problem is identifying a representative set of features from which a classification model for a particular task will be constructed. This addresses the problem of feature selection through a correlation based approach.

A logical attitude for categorizing the traits requires the use of multivariate statistical methods such as principal component analysis and factor analysis. These methods basically reduce a high number of variables to several components without a significant loss in classification accuracy. These approaches also enable the user to detect and explain correlations among variables. In addition, dimensionality reduction aims to reveal meaningful structures and guarantees to show the genuine properties of the original data (Janenek et al., 2008, Tian et al., 2010).

Vahid et al. (2011) has employed factor analysis to research the relationship in totality of some quantitative traits with regard to wheat grain yield under end drought stress via factorial split plot and on the basis of completely randomized block design in three replications. In this work, according to factor analysis, through decomposition to main components, four factors altogether explained 83.51% of all variations. The first factor was called the effective factor to yield. The second factor was that of effective traits to spike characteristics. The third factor was called the effective factor to plant height. Finally, the fourth factor was called the effective factor to plant growth. Leilah and Al-Khateeb (2005) have harnessed factor analysis and principal components analysis to study the relationship between wheat grain yield and its components under the drought conditions. Herein, three main factors accounted for 74.4% of the total variability in the dependent structure. The results showed that biological yield, harvest index, weight of grains/spike, spike length and number of spikes/m2 had the highest communality, and, consequently, the high relative contribution in wheat grain yield.

Furthermore, the accomplished studies showed that the digital image processing techniques commonly applied in multivariate statistical analysis give reliable results in recognizing wheat varieties. In the previous paper (Charytanowicz et al., 2010), conducted in the earlier stage of the research study, an effective gradient clustering algorithm was proposed for wheat variety classification. This had resource to the basic geometric features, including the kernel area, kernel perimeter, compactness, kernel length, kernel width, asymmetry coefficient and length of kernel groove, common to particular wheat grain varieties. The presented procedure achieved an accuracy of about 92%. The utility of the investigated methodology in the context of discrimination by way of using the geometric features of wheat grain has been also confirmed by the results of discriminant analysis. In the work (Charytanowicz et al., 2016), the selected combination of geometric features, including the kernel perimeter, compactness, asymmetry coefficient, the ratio between the germ length and the kernel length, the ratio between the germ area and the kernel area, and the ratio between the kernel width and the kernel length, had significant contribution to the discrimination, and such features have permitted discriminant analysis to achieve a recognition rate of 89–96%. Moreover, in the paper (Kulczycki and Łukasik, 2014), the problems of reducing data set dimension and size were investigated. After the reduction process, the number of wheat grains assigned to the right class was very high – achieving almost 90%. Finally, the data set of wheat grain was used to verify classification quality results for the probabilistic neural network with simplified structure (Kowalski and Kusy, in press). The obtained results confirmed the practical usefulness of the proposed methodology.

Thus, this paper demonstrate the utilization of grain geometric traits in wheat variety recognition. The main objective of this work is to determine a basic set of these parameters with respect to wheat grain morphology. Both principal component analysis and multivariate factor analysis are exploited to better comprehend the relations between traits, as well as to identify effective factors in wheat grain classification.

Section snippets

Materials and methods

The research was conducted at the Institute of Agrophysics of the Polish Academy of Sciences in Lublin. For this work, we chose combine harvested wheat grain of three varieties: Canadian, Kama, and Rosa. Herein, the relationship between the different grain traits and their effectiveness in grain classification were studied by way of utilizing two multivariate statistical procedures which can be employed for structure detection: principal component analysis and factor analysis (Basilevsky, 1994,

Results and discussion

The studied material was firstly subjected by image analysis to determine twelve geometric traits. All measurements were made from a total of 288 samples of three wheat varieties: Canadian, Kama, and Rosa, containing 108, 72, and 108 kernels, respectively. Table 1 shows the arithmetic mean, minimum and maximum values, as well as the standard deviation for all estimated wheat traits, both separately for each variety and combined.

The ANOVA results indicated significant differences between

Final remarks and summary

Computer vision-based systems, along with data analysis procedures based on classical statistical methods or computation intelligence can be successfully exploited for classification tasks (Forczmanski and Markiewicz, 2013, Forczmański and Markiewicz, 2016, Hu et al., 1998, Sabanci et al., 2017). The conducted research has shown the usefulness of the main geometric features of three wheat varieties: Kama, Rosa and Canadian in such problems. Thus, in works (Charytanowicz et al., 2010, Kulczycki

Acknowledgments

Our heartfelt thanks go to our colleague Professor Jerzy Niewczas, with whom we commenced the research presented in the previous papers. We are also very grateful for his valuable advice and inspiration.

References (38)

  • Charytanowicz, M., Niewczas, J., Kulczycki, P., Kowalski, P.A., Łukasik, S., Żak, S., 2010. Complete Gradient...
  • Charytanowicz, M., Niewczas, J., Kulczycki, P., Kowalski, P.A., Łukasik, S., 2016. Discrimination of Wheat Grain...
  • H. Czachor et al.

    Impact of long term mineral and organic fertilization on water stability, wettability and porosity of aggregates of two silt loamy soils

    Eur. J. Soil Sci.

    (2015)
  • N.R. Draper et al.

    Applied Regression Analysis

    (1981)
  • Forczmański, P., Markiewicz, A., 2013. Low-level image features for stamps detection and classification. In: 8th...
  • P. Forczmański et al.

    Two-stage approach to extracting visual objects from paper documents

    Mach. Vision Appl.

    (2016)
  • F. Guevara-Hernandez et al.

    A machine vision system for classification of wheat and barley grain kernels

    Spanish J. Agricult. Res.

    (2011)
  • B.-G. Hu et al.

    Application of fuzzy classification technique in computer grading of fish product

    IEEE Trans. Fuzzy Syst.

    (1998)
  • Janecek, A., Gansterer, W.N., Demel, M., Ecker, G.F., 2008. On the relationship between feature selection and...
  • Cited by (19)

    • Optical techniques in non-destructive detection of wheat quality: A review

      2022, Grain and Oil Science and Technology
      Citation Excerpt :

      Kudra et al. [44] clearly showed wheat internal cracks through soft X-ray images, and analyzed the influence of drying conditions on stress cracks from germination rate, germination length and mechanical strength, which provided theoretical support for the qualitative evaluation of the wheat quality and the selection of parameters in the drying process. With the help of X-ray images, Charytanowicz et al. [45] measured several parameters of the wheat kernel, such as kernel perimeter, kernel width, germ length and the ratio of germ area to the kernel, which are fed to the K-nearest neighbor (KNN) classification model to detect wheat with different quality, and the correct rate reached 90%. Germination can also lead to changes in wheat tissue structure, Neethirajan et al. [46] showed that there were white spots in the X-ray image of germinated wheat compared with the normal one.

    • Prediction model for daily reference crop evapotranspiration based on hybrid algorithm and principal components analysis in Southwest China

      2021, Computers and Electronics in Agriculture
      Citation Excerpt :

      Factor analysis is a statistical technology to extract common factors from variable groups. It can find hidden and representative factors from multiple variables, express variables with the same nature as a linear combination of factors, and use fewer factors to reflect most information of original data (Charytanowicz et al., 2018; Zhang et al., 2014). The GBDT (Sathyaseelan et al., 2020) model is a widely used classification and regression method, which takes decision stump or regression tree as a weak classifier.

    • Methodically unified procedures for a conditional approach to outlier detection, clustering, and classification

      2021, Information Sciences
      Citation Excerpt :

      An example application of the kernel conditional density to model identification for the needs of robust control has been presented in the paper [20] for the one-dimensional case and in the article [21] for the multidimensional case. See also [6,7]. The task of detecting atypical elements is one of the fundamental problems of contemporary data analysis [1], especially in the preliminary phase of processing.

    View all citing articles on Scopus
    View full text