Elsevier

Neurocomputing

Volume 175, Part B, 29 January 2016, Pages 911-923
Neurocomputing

A practical framework for automatic food products classification using computer vision and inductive characterization

https://doi.org/10.1016/j.neucom.2015.06.095Get rights and content

Abstract

With the increasingly international regulatory demands for food products import and export, as well as with the increased awareness and sophistication of consumers, the food industry needs accurate, fast and efficient quality inspection means. Each producer seeks to ensure that their products satisfy all consumer׳s expectations and that the appropriate quality level of each product is offered and sold to each different socio-economic consumer group. This paper presents a framework that uses computer vision and inductive characterization with a reduced set of features, along with three cases where this framework has been successfully applied to improve the quality inspection process. Three different basic food products are studied: Hass Avocado, Manila Mango and Corn Tortillas. These products are very important in economical terms for the sheer volume of their production and marketing. Each product has particular characteristics that involve different ways of handling the quality inspection process, but this framework allows addressing common key points that allow automatizing this process. Experimental results of each case shows that the proposed technique is competitive with existing systems and has significantly lower costs in terms of the number of features required for classification.

Introduction

Nowadays, assessing the quality of food products involves mostly using physical methods. These methods can be non-destructive (produce compression, tactile testing) and destructives (v. gr. penetrometers); however, even traditional non-destructive methods may represent a quality dropping, because the produce is usually delicate and can be easily damaged [28]. For these reasons, newer noninvasive techniques have been proposed. Noninvasive methods such as acoustic, spectroscopic and digital image analysis, including multi- and hyper-spectral imaging techniques, have been widely applied to evaluate the food quality. However, acoustic and spectroscopic methods (near-infrared, nuclear magnetic resonance and magnetic resonance imaging) could be relatively expensive and laborious, and a large number of samples, as well as a large laboratory area, could be required; therefore, an interesting alternative to study the superficial features in produces is the computer vision systems that have been developed in the food sector. Physical characteristics, such as size, shape, morphology, color and texture properties can be measured using image processing [17], [33].

Computer vision systems allow food quality evaluation by maintaining accuracy and consistency while eliminating the subjectivity of manual inspections. Image analysis has more advantages than other noninvasive techniques such as being cheap and easy to adapt to obtain online measurements. In addition, it allows achieving high accuracy and a good correlation with visual human inspection. In general, its versatility allows obtaining a widespread number of features from a simple digital image [6]. A comparative introduction, latest developments and applications of computer vision systems in the external quality inspection of fruits and vegetables are presented in [39]. They emphasize that the outer appearance of fruits and vegetables affects their point-of-sale value and consumer׳s buying behavior, thus causing great economic losses. In this comprehensive study, only mango is mentioned.

In the literature, there are only a few ideas in the establishment of a general framework for automatic classification of food products. In [35], where the hardware and software parts are explained separately; however the software steps are only established for volume measurement. A more general scheme is introduced in [7]; nevertheless a general set of features and an unique classifier for a diverse kinds of food products are missing. In other hand, a thorough survey about the use of machine vision system for the quality inspection of food and agricultural products is shown in [30].

Moreda et al. [23] present a review of some two-dimensional computer vision methods applied throughout the past 25 years for determining the shape of horticultural produces. The authors mentioned that the boundary signature is used to determine malformation or misshapenness. They present examples to show that is possible to detect two shape abnormalities in avocado or establish a maturity index for the Mango.

Careful image feature selection plays an important role for a successful classification. A common set of features to analyze food products is shown in Vanloot et al. [38], while some works as [18] extract as much as 2300 features to finally use only a few, 64. Zhang et al. [40] present a fruit classification of 18 different food products (Hass avocado included) with an overall classification accuracy of 89.1%. In order to achieve this result, they originally used 79 features that were in turn reduced via PCA to only 14 features; however the authors do not mention which these 14 items are; additionally, great amounts of preprocessing were necessary, including the use of the split-and merge algorithm used to remove the background of the images. Using a great amount of features represents a system overload and hinders automatic classification systems for real time applications, where very fast classification is needed due to the sheer volume of production. In this work we aim to find a reduced, but suitable set of features that allow automatic classification of different products, establishing a framework as simple as possible, while effective.

For demonstration purposes, we present the feature extraction on three different food products:

  • i.

    Tortilla is a fundamental element in the diet of Mexican people. Since ancient times, tortillas have been hand-made, but as a result of the increasing demand of this product, even from other countries, there has been an increase in the necessity to update production methods and mass-produce tortillas. Thus, their quality must be assessed to comply with industrial production standards as well as import/export regulations (NOM-187-SSA1/SCFI-2002, 2002); however, there is an absence of standards and guides to evaluate the sensory impression of consumers, i.e., color, shape, spots, etc., or some other kind of imperfections resulting from their elaboration process.

    It has been shown [13], [18] that sensory characteristics are of great importance for consumers, however, for small and mid-size producers currently there is no system for quality control that can guarantee a high level of sensory quality; tortillas are evaluated manually so that the quality of the tortilla is not uniform.

  • ii.

    Avocado is a tree that belongs to the Lauracea family and to the Persea genus, where the best-known member is variety Hass (Persea americana Mill.) According to experts, there are about 500 varieties of avocados; however, only three strains are considered appropriate for commercial production: Mexican, Guatemalan and West Indian [1]. The Hass avocado fruit is part of the Guatemalan strain, and it is the most widely harvested and consumed around the world as a result of its good production yields, resistance to handling and transportation, the excellent quality of its pulp, its pleasant texture and its great taste [27].

    Avocado is a high metabolic fruit and completes its ripeness within 12 days at 15 °C after harvesting [5]. In [29] the authors comment that avocado maturity and picking time are determined according to external markers (color and size). However, [16] reported that determining the commercial maturity of the avocado is difficult because the start of ripening is not accompanied by visible external changes. These situations made the task of classification very challenging, indicating that an automatic sorting system would be very useful.

  • iii.

    The mango (Manguifera Indica L.) is one of the more important crops with a high demand in the market both national and international; this is mainly derived from the consumption in fresh, as well as its use of industrial processes for juice production, nectars and jams.

Mango fruit has high nutritional values, nice flavor, taste, etc. and hence is known as king of fruits. However, it faces a number of challenges such as the attack of microorganisms, having short shelf life, mismanagement during transportation and storage [3]. In [36], the authors indicated different postharvest damages due to harvesting fruit at improper maturity, mechanical damage during supply chain, sap burn, spongy tissue, lenticels discolouration, fruit softening, decay, chilling injury and disease and pest damage. The automatic detection of these damages via a machine vision system would help the producers to improve their postharvest treatment reducing its economic losses.

The remainder of the article is organized as follows: in Section 2 our proposed Framework is introduced, the difference between computer and machine vision is discussed. The designed features and a brief digital image analysis are explained in Section 3. The inductive characterization via the BOUNDSTAR algorithm is presented in Section 4. The results of applying our methodology to the previous mentioned food products are shown in Section 5, and finally we draw conclusions and future work in Section 6.

Section snippets

Proposed framework

In this section, based on [7], we present the proposed general methodology for non-destructive classification of diverse food products. Fig. 1 shows a simplified block diagram of the framework; it includes systems and sub-systems for different processes. The big rectangles represent the subsystems while the small ones indicate gathering information. Blocks 1–6 show the typical components of a computer vision system, and block 7 represents a machine vision system. Blocks 1–4 are very common

Feature selection

In this section, we describe the feature selection of the selected food products: tortilla, avocado and mango for their latter classification using inductive algorithms. Careful selection of features is important to provide useful information to the classification algorithm. A good feature set must be compact, non-redundant, and one that allows a classifier to correctly separate the classes of interest.

We consider the following scene constraints: Only one full shape will appear in the image,

Inductive characterization

Since its beginnings, inductive reasoning and learning techniques have had a prominent role in machine learning. In general, these techniques involve the process of learning by example – where a system tries to induce a general rule from a set of observed instances.

Conceptual Clustering [21] is one of the most widely used technique for inductive learning, distinguished from ordinary data clustering because it generates a concept description for each class of objects. From the 1980 decade, when

Experimental results

In this section from the three food products, we present in detail the use of our proposed methodology (see Fig. 1) in the tortilla. For the other two products, the procedure can be applied in a straightforward way.

Conclusions

This paper presents three study cases where digital image analysis and inductive characterization techniques have been successfully applied to improve the quality inspection process. Three different basic food products were studied: Avocado, Manila Mango and Tortillas. Each one of these products has some special and particular features that involve different ways of handling the quality inspection process, but the general technique of inductive characterization with a reduced set of features

Acknowledgments

The authors thank the support of the Mexican Government (CONACYT, SNI, SIP-IPN, COFAA-IPN and BEIFI-IPN) (Grant 60557-R). We thank Oscar Rojas and Julio Bocanegra for their collaboration in experiments and building of the machine vision system.

Hiram Calvo. Obtained his PhD degree in Computer Science (with honors) in 2006 from Centro de Investigación en Computación (CIC) of the Instituto Politécnico Nacional (IPN), Mexico. His thesis was about the Spanish syntax analyzer DILUCT. He was awarded with the Lázaro Cárdenas Prize in 2006; this Prize was handed personally by the President of Mexico. Calvo did a post-doctoral stay at the Nara Institute of Science and Technology, Japan, from 2008 to 2010. Since 2006 he is a lecturer at

References (40)

  • D.G. Pelli et al.

    Measuring contrast sensitivity

    Vis. Res.

    (2013)
  • R. Quevedo et al.

    Determination of senescent spotting in banana (Musa cavendish) using fractal texture Fourier image

    J. Food Eng.

    (2008)
  • D.J. Sampson et al.

    A dual-view computer-vision system for volume and image texture analysis in multiple apple slices drying

    J. Food Eng.

    (2014)
  • J. Siswantoro et al.

    Volume measurement of food product with irregular shape using computer vision and Monte Carlo method: a framework

    Procedia Technol.

    (2013)
  • D. Sivakumar et al.

    Maintaining mango (Mangifera indica L.) fruit quality during the export chain

    Food Res. Int.

    (2011)
  • P. Vanloot et al.

    Artificial vision and chemometrics analyses of olive stones for varietal identification of five French cultivars

    Comput. Electron. Agric.

    (2014)
  • B. Zhang et al.

    Principles, developments and applications of computer vision for external quality inspection of fruits and vegetables: a review

    Food Res. Int.

    (2014)
  • Y. Zhang et al.

    Fruit classification using computer vision and feedforward neural network

    J. Food Eng.

    (2014)
  • L. Almanza

    Bioplaguicida para el control del barrenador grande del hueso del aguacate

    Claridades Agropecu.

    (2011)
  • I. Arzate et al.

    Image processing applied to classification of avocado variety Hass (Persea americanaMill) during the ripening process

    .Food BioprocessTechnol.: Int. J.

    (2011)
  • Cited by (11)

    View all citing articles on Scopus

    Hiram Calvo. Obtained his PhD degree in Computer Science (with honors) in 2006 from Centro de Investigación en Computación (CIC) of the Instituto Politécnico Nacional (IPN), Mexico. His thesis was about the Spanish syntax analyzer DILUCT. He was awarded with the Lázaro Cárdenas Prize in 2006; this Prize was handed personally by the President of Mexico. Calvo did a post-doctoral stay at the Nara Institute of Science and Technology, Japan, from 2008 to 2010. Since 2006 he is a lecturer at CIC-IPN. His research interests are lexical semantics, pattern recognition, similarity measures and statistical language models.

    Salvador Godoy-Calderon holds a Ph.D. in Computer Science from CIC-IPN and a Master׳s degree in C.S. from CINVESTAV-IPN. He is a full time researcher and Head of the Artificial Intelligence Laboratory at CIC-IPN with research interests including pattern recognition, evolutionary computation, strategic and tactical knowledge systems and formal logics.

    Marco A. Moreno-Armendariz received the B.S. degree from La Salle University, Mexico, in 1998, and the M.S. and Ph.D. degrees in automatic control from the Research Center of Advanced Studies, National Polytechnic Institute (CINVESTAV-IPN), Mexico, in 1999 and 2003, respectively. From 2001 to 2006, he was the Chair Professor in the Engineering School, La Salle University, Mexico. In April 2006, he joined the Centro de Investigación en Computación, National Polytechnic Institute, Mexico. His current research interests include neural networks for identification and control, computer vision, robotics, pattern recognition, and the implementation in field-programmable gate arrays (FPGAs) of related algorithms.

    View full text