A Novel Camera Based Approach for Automatic Expiry Date Detection and Recognition on Food Packages

Gong, Liyun; Yu, Miao; Duan, Wenting; Ye, Xujiong; Gudmundsson, Kjartan; Swainson, Mark

doi:10.1007/978-3-319-92007-8_12

Liyun Gong¹⁸,
Miao Yu¹⁸,
Wenting Duan¹⁸,
Xujiong Ye¹⁸,
Kjartan Gudmundsson¹⁹ &
…
Mark Swainson¹⁹

Part of the book series: IFIP Advances in Information and Communication Technology ((IFIPAICT,volume 519))

Included in the following conference series:

IFIP International Conference on Artificial Intelligence Applications and Innovations

3291 Accesses
7 Altmetric

Abstract

There is abundant of information on food packages, which include the food name, the expiry date and the ingredients. These information, especially the expiry date needs to be coded correctly before the products can be released into the market/supply chains. Failure of printing the correct expiry date can lead to both the health issues to the public and financial issues for recalling product back and even reimbursement. In this paper, we develop an automatic system that can achieve the expiry date region detection and recognition in an efficient and effective way. A deep neural network (DNN) based approach is firstly applied to find the expiry date region on the food package. The date characters are then extracted and recognized through the image processing and machine learning methods from the expiry date region. The system is the first camera based automatic system for recognizing expiry date on food packages. And the results tested on different types of food packages show that the system can achieve good performance on both detection and recognition of the expiry date.

You have full access to this open access chapter, Download conference paper PDF

A novel unified deep neural networks methodology for use by date recognition in retail food package image

Article Open access 01 September 2020

Design and Development of an Automated Snack Maker with CNN-Based Quality Monitoring

Recognition and Quantity Estimation of Pastry Images Using Pre-training Deep Convolutional Networks

Keywords

1 Introduction

In the European Union food production is the largest manufacturing sector where it accounts for 13.3% of the total EU-28 manufacturing sector with a reported turnover of 945 billion [1]. Whilst food availability is a primary concern in developing nations and food quality (value) is a focal point in more affluent societies, food safety is a requirement that is common across all food supply chains. Food safety in the sector is typically underpinned by food science and technology and assured by a combination of operational control systems and procedures including Good Manufacturing Practice (GMP) and Hazard Analysis Critical Control Point (HACCP) [2].

The food product information printed on the food package is a vital for the food safety. Pre-packaged food product information which are incorrectly labelled, especially the expiry date results in product recalls as the fault/issue could cause a food safety incident such as food poisoning due to the consumption of product which is past its actual safe Use-by date. These recalls are usually at very high financial and reputational cost to food manufacturers.

The reasons/root causes for issues or mistakes resulting in label faults on food packaging are many and varied. They include human error and equipment faults. For example, a label printer on a production line can break down and the line carries on running. The faulty packaging therefore needs to be identified and the production line stopped. A common current process line Use-by check approach is to use a human operator to read and verify the packaging label. This check is conducted by either manually picking a pack from the line for inspection or verifying it through an image captured of the pack. However, these methods create mundane and repetitive tasks and therefore place the operator in an error-prone working environment.

Another common approach to control date codes is to use Optical Character Verification (OCV). This involves a supervisory system holding the correct expiry date string and transferring it to both the printer and the vision system. The latter will then verify its read and actions are taken depending on the result. However, OCV systems rely on consistency in expiry date format, packaging and camera view angle. This consistency tends to be hard to achieve in the food and drink manufacturing environment and therefore there is a great need for a more robust solution.

In this work, we have developed an automatic system based on a camera, which can efficiently and effectively recognize the expiry date information printed on different types of food packages. Food packages with wrong expiry date printed on will be picked up. This system will enable far greater control over the accuracy and legibility of critical ‘Use-by’/‘Best Before’ dates and also key traceability information in food and drink manufacturing operations, resulting in significantly increased food safety and compliance with related legislation.

To develop such a system, the first step is to identify expiry date regions as the region of interest (ROI) on a recorded food package image. And the expiry date recognition task is then performed within the ROI instead of the whole image. In this way, the computational costs can be saved to a large extent. One straightforward method to determine ROI is applying the text detection method for detecting text regions as ROI on a food package. For text detection, a number of traditional image processing based techniques have been applied, examples include Stroke Width Transform (SWT) based approach [3] and Maximally Stable Extremal Regions (MSER) based approach [4]. With the deep learning techniques having become mainstream in the image processing, computer vision and machine learning communities, different types of deep neural networks have been applied for the text detection [5, 6] with better results being obtained.

However, if the food package contains too much other text information in addition to the expiry date (that is the usual situation on the food package), the obtained ROI will still be tremendous. Instead of the text region detection for the ROI identification, in our work we apply a deep neural network approach for directly identifying the expiry date region as ROI. The fully convolutional network (FCN) in [5], which is originally developed for text detection, is fine-tuned by our dateset for expiry date region detection. By adopting such an approach, only the expiry date region can be extracted while other texts on the food package are excluded. The most precise ROI is directly obtained and computational costs can then be further reduced by performing the recognition on only the ROI of the expiry date region.

Based on the extracted ROI, the date characters blobs in the ROI can be directly extracted. Related shape features are then extracted for classification by an efficient nearest neighbour method. In our experiment, we have tested our system for both expiry date region detection and classification on different types of food packages in different captured image formats (colour/grayscale), with good results being obtained.

2 Method

In this section, we present the methodology for expiry date recognition on the food package, which is divided into two parts: expiry region identification and recognition. The block diagram of the proposed methodology is shown in Fig. 1. Details of every block are presented as follows.

2.1 Date Code Region Identification

For effectively identifying the expiry date region on the food package which contains different types of pictures/texts contents with different colours, a deep neural network based approach is applied. The deep neural network structure is a fully convolutional network (FCN) as described in [5], which was originally developed for detecting texts. The network is fine-tuned on our food package dataset for detecting the date expiry region.

The FCN structure is shown in Fig. 2, which is decomposed into three parts: feature extractor stem, feature-merging branch and output layer.

The stem part is a PVANet [7], with interleaving convolution and pooling layers. Four levels of feature maps, denoted as f_i are extracted from the original input image, whose sizes are $ \frac{1}{32},\,\frac{1}{16},\,\frac{1}{8}\, $ and $ \frac{1}{4} $ of the original input image. Features from different scale levels meet the requirements of detecting text regions with different sizes.

In the feature-merging branch, features are merged in the following strategy:

$$ g_{i} = \left\{ {\begin{array}{*{20}c} {unpool\left( {h_{i} } \right) if\, i \le 3 } \\ {conv_{3 \times 3} \left( {h_{i} } \right) if \,i = 4} \\ \end{array} } \right. $$

$$ h_{i} = \left\{ {\begin{array}{*{20}c} {f_{i} } & {if\, i = 1} \\ {conv_{3 \times 3} \left( {conv_{1 \times 1} \left( {\left[ {g_{i - 1} ;f_{i} } \right]} \right)} \right)} & {if \,i = 4} \\ \end{array} } \right. $$

(1)

where g_i is the merge based as in [5] and h_i is the merged feature map. And the operator [;]. represents concatenation along the channel axis. In each merging stage, the feature map from the last stage is first fed to an unpooling layer to double its size, and then concatenated with the current feature map. A conv_1×1 bottleneck cuts down the number of channels to reduce computation, followed by a conv_3×3 that fuses the information to finally produce the output of this merging stage. Following the last merging stage, a conv_3×3 layer produces the final feature map of the merging branch and feed it to the output layer.

The final output layer contains several conv_1×1 operations to project 32 channels of feature maps into 1 channel of score map F_s, which gives the likelihood that a pixel belong to the expiry date region as well as a multi-channel geometry map F_g, which could be either rotated box (RBOX) or quadrangle (QUAD) representing different geometries. RBOX geometry map contains a 4-channel map representing 4 distances from every pixel location to the top, right, bottom, left boundaries of a rectangle enclosing the candidate expiry date region, as well as a 1-channel map representing the angle of the related rectangle. QUAD geometry map is a 8-channel map, which contains the coordinate shift from four corner vertices of a quadrangle (representing candidate expiry date region) to every pixel position.

FCN Training and Testing. For obtaining the network parameters, firstly, a loss function is defined as:

$$ L = L_{s} + \lambda L_{g} $$

(2)

where L_s and L_g represent losses for score and geometry maps respectively, while λ is a balancing parameter.

The term L_s is defined as:

$$ L_{s} = - \beta Y^{*} log\hat{Y} - (1 - \beta )(1 - Y^{*} )log(1 - \hat{Y}) $$

(3)

where $ \hat{Y} $ and Y ^∗ represent the predicted and groundtruth score maps respectively. β is a balancing parameter. While the L_g is defined as scale-invariant IoU loss for RBOX geometry map and scale-normalized smoothed-L1 for the QUAD one as [5]. Based on the defined loss function, the network is trained end-to-end using ADAM optimizer until performance stops improving.

To determine the final expiry date region, first a threshold is set to find positions at which score map values are larger than it. The geometries associated with those positions on the geometry map will then be merged by the locality aware Non-Maximum Suppression (NMS) to determine the final expiry date region, which can achieve lower computational costs compared with the basic NMS algorithm. Under the assumption that the geometries from nearby pixels tend to be highly correlated, the locality-aware NMS is proposed to merge the geometries row by row. And while merging geometries in the same row, the geometry currently encountered will be merged with the last merged one. In this way, the computational costs could be reduced from O(n2) of the original NMS to O(n), where n is the number of candidate geometries. Figure 3 shows the results of different parts the expiry date region detection procedure.

2.2 Expiry Date Recognition

Expiry date will then be recognized based on the identified region by Tesseract OCR [10]. The Maximally Stable External Regions (MSER) algorithm will firstly be applied, to make a binarization of the extracted date code region with characters being differentiated from the background (Fig. 4 (b)). Component connected analysis [9] is then made to find blobs representing different characters, with small noisy blobs being filtered out (Fig. 4 (c)).

As in [8], for each candidate blob, the boundary will be extracted as in Fig. 4 (d) (here the Canny edge extraction operator is applied) and the corresponding shape features, such as topological and polynomial approximation can be extracted for characters classification. In this work, a simple but effective nearest neighbour (NN) approach is applied for the classification. The features of every blob will be compared with prototypes representing different characters. A blob will be classified as the character for which the related distance is the smallest.

3 Experimental Results

The proposed system is trained and tested on different types of food packages, with representative examples being shown in Fig. 5. We have collected 800 images from stores, among which 70% (560 images) are used for training and 30% (240 images) are used for testing.

3.1 Expiry Date Region Detection Evaluation

The FCN as mentioned in the previous section is fine tuned for identifying the expiry date region on the food package. We have manually masked the ground truth expiry date regions in the training dataset for tuning the FCN, to transfer it from a text detection network to a date code region detection one. We train the network in the GPU-supported tensorflow environment, with two PASCAL GPUs. The input image patches are resized to 512 × 512. Mini-batch training is applied with the batch size is 14 per GPU and the learning rate is set to be 0.0001 in the ADAM optimizer.

Figure 6 shows the comparison results between FCN before and after fine tuning. We can see that after fine-tuning, the original text detection network transfers from a text detection network to a expiry date detection one. The transferred network is tested on captured images of different food packages, with both colour and grayscale formats. Related results are presented in Fig. 7, we can see that the date code regions on different food packages can be successfully identified. For a qualitative analysis, we test the developed FCN on the aforementioned testing dataset containing 240 images. The testing results show that 236 out of 240 images, the date region is correctly identified with 4 out of 240 are missing. A total detection rate of 98% is obtained.

3.2 Expiry Date Recognition Evaluation

Based on the detected expiry date region, the characters within it are extracted and classified. We have applied the Tesseract OCR [10] for classification, which has implemented the characters extraction, feature extraction and classification steps as mentioned in the Sect. 2.2. Some initial results are presented that in Fig. 8. It is shown that the characters on the extracted expiry date region can be successfully recognized; however, some classification mistakes may happen when characters are blurred.

4 Conclusions

In this work, we have developed a novel food package expiry date recognition system based on the camera. A FCN deep neural network approach is applied to detect the expiry date region. Based on the detection results, the date character blobs will be extracted. Related features will be extracted and classified to particular characters. Such a system will potentially advance the assurance of food quality and safety. The experimental results show that the proposed method can achieve very good performance in identifying the expiry date region and classifying characters correctly when they are clear. However, if the characters are blurred, misclassifications can be made and that will be the future researches which will be investigated.

References

Eurostat: Manufacturing statistics - NACE Rev. 2 (2014). http://ec.europa.eu/eurostat/statistics-explained/index.php/
WHO FAO (2009). http://www.fao.org/docrep/012/a1552e/a1552e00.htm
Epshtein, B., Ofek, E., Wexler, Y.: Detecting text in natural scenes with stroke width transform. In: 2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), San Francisco, CA, USA (2010)
Google Scholar
Chen, H., Tsai, S., Schroth, G., Chen, D., Grzeszczuk, R., Girod, B.: Robust text detection in natural images with edge-enhanced maximally stable extremal regions. In: 18th IEEE International Conference on Image Processing (ICIP), Brussels, Belgium (2011)
Google Scholar
Zhou, X., Yao, C., Wen, H., Wang, Y., Zhou, S., He, W., Liang, J.: EAST: an efficient and accurate scene text detector. In: International Conference on Computer Vision and Pattern Recognition, Honolulu, Hawaii, USA (2017)
Google Scholar
Zhang, Z., Shen, W., Yao, C., Bai, X.: Symmetry-based text line detection in natural scenes. In: International Conference on Computer Vision and Pattern Recognition, Boston, Massachusetts, USA (2015)
Google Scholar
Kim, K., Hong, S., Roh, B., Cheon, Y., Park, M.: PVANET: deep but lightweight neural networks for realtime object detection. In: arXiv preprint arXiv:1608.08021 (2016)
Smith, R.: An overview of the tesseract OCR engine. In: Ninth International Conference on Document Analysis and Recognition, Parana, Brazil (2007)
Google Scholar
Gonzalez, R.: Digital Image Processing. Third Edition, Pearson Education (2008)
Google Scholar
Tesseract-ocr. https://github.com/tesseract-ocr/tesseract

Download references

Author information

Authors and Affiliations

School of Computer Science, University of Lincoln, Lincoln, LN6 7TS, UK
Liyun Gong, Miao Yu, Wenting Duan & Xujiong Ye
National Centre for Food Manufacturing, University of Lincoln, Holbeach, PE12 7PT, UK
Kjartan Gudmundsson & Mark Swainson

Authors

Liyun Gong
View author publications
You can also search for this author in PubMed Google Scholar
Miao Yu
View author publications
You can also search for this author in PubMed Google Scholar
Wenting Duan
View author publications
You can also search for this author in PubMed Google Scholar
Xujiong Ye
View author publications
You can also search for this author in PubMed Google Scholar
Kjartan Gudmundsson
View author publications
You can also search for this author in PubMed Google Scholar
Mark Swainson
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Liyun Gong .

Editor information

Editors and Affiliations

School of Engineering, Democritus University of Thrace, Xanthi, Greece
Lazaros Iliadis
University of Piraeus, Piraeus, Greece
Ilias Maglogiannis
University of Thessaly, Lamia, Greece
Vassilis Plagianakos

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Gong, L., Yu, M., Duan, W., Ye, X., Gudmundsson, K., Swainson, M. (2018). A Novel Camera Based Approach for Automatic Expiry Date Detection and Recognition on Food Packages. In: Iliadis, L., Maglogiannis, I., Plagianakos, V. (eds) Artificial Intelligence Applications and Innovations. AIAI 2018. IFIP Advances in Information and Communication Technology, vol 519. Springer, Cham. https://doi.org/10.1007/978-3-319-92007-8_12

Download citation

DOI: https://doi.org/10.1007/978-3-319-92007-8_12
Published: 22 May 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-92006-1
Online ISBN: 978-3-319-92007-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Federation for Information Processing (opens in a new tab)