Skip to main content
Log in

Towards end-to-end container code recognition

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Container code recognition can improve the efficiency and economy of the management system in the port. However, the task is different and complex due to the degradation of image quality caused by uneven illumination, background variation, smear, inaccurate character extraction, and so on. Current processing methods on container images usually provide the framework or modules on specific tasks, such as region detection and character classification, which are hard to implement or to be combined into a whole process. In this paper, we propose a fast end-to-end method of automatic recognition of container code that fills the gap by locating the region and detecting characters as well as making the classification. This allows the three tasks to work collaboratively by pipeline, which is critical to identify the container code. For evaluation, we collect around six thousand container images, including all kinds of circumstances from the local port. Compared with a few other methods and two-step approaches consisting of state-of-the-art character detector and character classifier, our system achieves some competitive results. Finally, the proposed system is verified on this dataset and the overall accuracy reaches 97.30%.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12

Similar content being viewed by others

Notes

  1. Nanjing Port is located in Nanjing, Jiangsu Province, China, and is the largest inland port in the world (depending on how you classify the ports in the Yangtze Delta), with throughput reaching 191 million tons of cargo in 2012.

References

  1. Baek Y, Lee B, Han D, Yun S, Lee H (2019) Character region awareness for text detection. In: IEEE Conference on computer vision and pattern recognition, pp 2863–2870

  2. Bosch A, Zisserman A, Munoz X (2007) Image classification using random forests and ferns. In: IEEE International conference on computer vision, pp 1–8

  3. Chen X, Yuille AL (2004) Detecting and reading text in natural scenes. In: IEEE Conference on computer vision and pattern recognition, vol 2, pp 76–85

  4. Compare M, Baraldi P, Zio E (2020) Challenges to iot-enabled predictive maintenance for industry 4.0. IEEE Internet of Things Journal 7(5):4585–4597

    Article  Google Scholar 

  5. Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: IEEE Conference on computer vision and pattern recognition, vol 1, pp 886–893

  6. De Campos TE, Babu BR, Varma M et al (2009) Character recognition in natural images. VISAPP 7

  7. Deng D, Liu H, Li X, Cai D (2018) Pixellink: Detecting scene text via instance segmentation. In: AAAI

  8. Epshtein B, Ofek E, Wexler Y (2010) Detecting text in natural scenes with stroke width transform. In: IEEE Conference on computer vision and pattern recognition, pp 2963–2970

  9. Felzenszwalb PF, Girshick RB, McAllester D, Ramanan D (2009) Object detection with discriminatively trained part-based models. IEEE Trans Pattern Anal Mach Intell 32(9):1627–1645

    Article  Google Scholar 

  10. Felzenszwalb PF, Huttenlocher DP (2005) Pictorial structures for object recognition. Int J Comput Vis 61(1):55–79

    Article  Google Scholar 

  11. Feng BY, Ren M, Zhang XY, Suen CY (2014) Automatic recognition of serial numbers in bank notes. Pattern Recogn 47(8):2621–2634

    Article  Google Scholar 

  12. Fu Y, Chen X, Gao H (2009) A new connected component analysis algorithm based on max-tree. In: IEEE International conference on dependable, autonomic and secure computing, IEEE, pp 843–844

  13. He D, Yang X, Liang C, Zhou Z, Ororbi AG, Kifer D, Lee Giles C (2017) Multi-scale fcn with cascaded instance aware segmentation for arbitrary oriented word spotting in the wild. In: IEEE Conference on computer vision and pattern recognition, pp 3519–3528

  14. He P, Huang W, He T, Zhu Q, Qiao Y, Li X (2017) Single shot text detector with regional attention. In: IEEE International conference on computer vision, pp 3047–3055

  15. He T, Tian Z, Huang W, Shen C, Qiao Y, Sun C (2018) An end-to-end textspotter with explicit alignment and attention. In: IEEE Conference on computer vision and pattern recognition, pp 5020–5029

  16. He W, Zhang XY, Yin F, Liu CL (2017) Deep direct regression for multi-oriented scene text detection. In: IEEE International conference on computer vision, pp 745–753

  17. Li C, Liu S, Xia Q, Wang H, Chen H (2019) Automatic container code localization and recognition via an efficient code detector and sequence recognition. In: IEEE/ASME International conference on advanced intelligent mechatronics, pp 532–537. https://doi.org/10.1109/AIM.2019.8868819

  18. Li L, Ghasemi A (2019) Iot-enabled machine learning for an algorithmic spectrum decision process. IEEE Internet of Things Journal 6(2):1911–1919

    Article  Google Scholar 

  19. Liao M, Shi B, Bai X (2018) Textboxes++: A single-shot oriented scene text detector. IEEE Trans Image Process 27(8):3676–3690

    Article  MathSciNet  Google Scholar 

  20. Liu W, Anguelov D, Erhan D, Szegedy C, Reed S, Fu CY, Berg AC (2016) Ssd: Single shot multibox detector. In: European conference on computer vision, Springer, pp 21–37

  21. Liu X, Liang D, Yan S, Chen D, Qiao Y, Yan J (2018) Fots: Fast oriented text spotting with a unified network. In: IEEE Conference on computer vision and pattern recognition, pp 5676–5685

  22. Liu Y, Jin L (2017) Deep matching prior network: Toward tighter multi-oriented text detection. In: IEEE Conference on computer vision and pattern recognition, pp 1962–1969

  23. Lyu P, Yao C, Wu W, Yan S, Bai X (2018) Multi-oriented scene text detection via corner localization and region segmentation. In: IEEE Conference on computer vision and pattern recognition, pp 7553–7563

  24. Ozuysal M, Fua P, Lepetit V (2007) Fast keypoint recognition in ten lines of code. In: IEEE Conference on computer vision and pattern recognition, pp 9–16

  25. Shi B, Bai X, Belongie S (2017) Detecting oriented text in natural images by linking segments. In: IEEE Conference on computer vision and pattern recognition, pp 2550–2558

  26. Shi B, Yang M, Wang X, Lyu P, Yao C, Bai X (2018) Aster: an attentional scene text recognizer with flexible rectification. IEEE Trans Pattern Anal Mach Intell 12(8):1243–1256

    Google Scholar 

  27. Shotton J, Johnson M, Cipolla R (2008) Semantic texton forests for image categorization and segmentation. In: IEEE Conference on computer vision and pattern recognition, pp 1–8

  28. Viola P, Jones M, et al. (2001) Rapid object detection using a boosted cascade of simple features. IEEE Conference on Computer Vision and Pattern Recognition 1:511–518

    Google Scholar 

  29. Wang K, Babenko B, Belongie S (2011) End-to-end scene text recognition. In: International conference on computer vision, IEEE, pp 1457–1464

  30. Wang K, Belongie S (2010) Word spotting in the wild. In: European conference on computer vision, Springer, pp 591–604

  31. Wu W, Liu Z, Chen M, Yang X, He X (2012) An automated vision system for container-code recognition. Expert Syst Appl 39(3):2842–2855

    Article  Google Scholar 

  32. Yao C, Bai X, Liu W, Ma Y, Tu Z (2012) Detecting texts of arbitrary orientations in natural images. In: IEEE Conference on computer vision and pattern recognition, IEEE, pp 1083–1090

  33. Yu T, Wang X, Shami A (2019) Uav-enabled spatial data sampling in large-scale iot systems using denoising autoencoder neural network. IEEE Internet of Things Journal 6(2):1856–1865

    Article  Google Scholar 

  34. Zhang Z, Zhang C, Shen W, Yao C, Liu W, Bai X (2016) Multi-oriented text detection with fully convolutional networks. In: IEEE Conference on computer vision and pattern recognition, pp 4159–4167

  35. Zhou X, Yao C, Wen H, Wang Y, Zhou S, He W, Liang J (2017) East: An efficient and accurate scene text detector. In: IEEE Conference on computer vision and pattern recognition, pp 5551–5560

Download references

Acknowledgements

We specially thanks the Nanjing Port for providing us the datasets and technique support. This work is supported in part by the Key Program of the National Natural Science Foundation of China (61932013), the Natural Science Foundation of Jiangsu Province of China (BK20200739), the Natural Science Foundation of the Jiangsu Higher Education Institutions of China (20KJB520003) and the Research Foundation of Jiangsu for “333 high level talents training project” (BRA2020065). This work is also supported by the China Postdoctoral Science Foundation (2021M691655) and the Postdoctoral Science Foundation of Jiangsu Province of China (2021K172B). This article is also sponsored by NUPTSF (NY219149, NY220189). Yanchao Li is also supported by Henan Key Laboratory of Food Safety Data Intelligence, ZZULI (KF2020YB01).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yanchao Li.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Li, Y., Li, H. & Gao, G. Towards end-to-end container code recognition. Multimed Tools Appl 81, 15901–15918 (2022). https://doi.org/10.1007/s11042-022-12477-z

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-022-12477-z

Keywords

Navigation