Towards end-to-end container code recognition

Li, Yanchao; Li, Hao; Gao, Guangwei

doi:10.1007/s11042-022-12477-z

Towards end-to-end container code recognition

Published: 02 March 2022

Volume 81, pages 15901–15918, (2022)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

239 Accesses
2 Citations
1 Altmetric
Explore all metrics

Abstract

Container code recognition can improve the efficiency and economy of the management system in the port. However, the task is different and complex due to the degradation of image quality caused by uneven illumination, background variation, smear, inaccurate character extraction, and so on. Current processing methods on container images usually provide the framework or modules on specific tasks, such as region detection and character classification, which are hard to implement or to be combined into a whole process. In this paper, we propose a fast end-to-end method of automatic recognition of container code that fills the gap by locating the region and detecting characters as well as making the classification. This allows the three tasks to work collaboratively by pipeline, which is critical to identify the container code. For evaluation, we collect around six thousand container images, including all kinds of circumstances from the local port. Compared with a few other methods and two-step approaches consisting of state-of-the-art character detector and character classifier, our system achieves some competitive results. Finally, the proposed system is verified on this dataset and the overall accuracy reaches 97.30%.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 3

Automatic Container Code Recognition Using MultiDeep Pipeline

Build an Effective System for Container Code Recognition

Performance Evaluation of Container Identification Detection Algorithm

Notes

Nanjing Port is located in Nanjing, Jiangsu Province, China, and is the largest inland port in the world (depending on how you classify the ports in the Yangtze Delta), with throughput reaching 191 million tons of cargo in 2012.

References

Baek Y, Lee B, Han D, Yun S, Lee H (2019) Character region awareness for text detection. In: IEEE Conference on computer vision and pattern recognition, pp 2863–2870
Bosch A, Zisserman A, Munoz X (2007) Image classification using random forests and ferns. In: IEEE International conference on computer vision, pp 1–8
Chen X, Yuille AL (2004) Detecting and reading text in natural scenes. In: IEEE Conference on computer vision and pattern recognition, vol 2, pp 76–85
Compare M, Baraldi P, Zio E (2020) Challenges to iot-enabled predictive maintenance for industry 4.0. IEEE Internet of Things Journal 7(5):4585–4597
Article Google Scholar
Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: IEEE Conference on computer vision and pattern recognition, vol 1, pp 886–893
De Campos TE, Babu BR, Varma M et al (2009) Character recognition in natural images. VISAPP 7
Deng D, Liu H, Li X, Cai D (2018) Pixellink: Detecting scene text via instance segmentation. In: AAAI
Epshtein B, Ofek E, Wexler Y (2010) Detecting text in natural scenes with stroke width transform. In: IEEE Conference on computer vision and pattern recognition, pp 2963–2970
Felzenszwalb PF, Girshick RB, McAllester D, Ramanan D (2009) Object detection with discriminatively trained part-based models. IEEE Trans Pattern Anal Mach Intell 32(9):1627–1645
Article Google Scholar
Felzenszwalb PF, Huttenlocher DP (2005) Pictorial structures for object recognition. Int J Comput Vis 61(1):55–79
Article Google Scholar
Feng BY, Ren M, Zhang XY, Suen CY (2014) Automatic recognition of serial numbers in bank notes. Pattern Recogn 47(8):2621–2634
Article Google Scholar
Fu Y, Chen X, Gao H (2009) A new connected component analysis algorithm based on max-tree. In: IEEE International conference on dependable, autonomic and secure computing, IEEE, pp 843–844
He D, Yang X, Liang C, Zhou Z, Ororbi AG, Kifer D, Lee Giles C (2017) Multi-scale fcn with cascaded instance aware segmentation for arbitrary oriented word spotting in the wild. In: IEEE Conference on computer vision and pattern recognition, pp 3519–3528
He P, Huang W, He T, Zhu Q, Qiao Y, Li X (2017) Single shot text detector with regional attention. In: IEEE International conference on computer vision, pp 3047–3055
He T, Tian Z, Huang W, Shen C, Qiao Y, Sun C (2018) An end-to-end textspotter with explicit alignment and attention. In: IEEE Conference on computer vision and pattern recognition, pp 5020–5029
He W, Zhang XY, Yin F, Liu CL (2017) Deep direct regression for multi-oriented scene text detection. In: IEEE International conference on computer vision, pp 745–753
Li C, Liu S, Xia Q, Wang H, Chen H (2019) Automatic container code localization and recognition via an efficient code detector and sequence recognition. In: IEEE/ASME International conference on advanced intelligent mechatronics, pp 532–537. https://doi.org/10.1109/AIM.2019.8868819
Li L, Ghasemi A (2019) Iot-enabled machine learning for an algorithmic spectrum decision process. IEEE Internet of Things Journal 6(2):1911–1919
Article Google Scholar
Liao M, Shi B, Bai X (2018) Textboxes++: A single-shot oriented scene text detector. IEEE Trans Image Process 27(8):3676–3690
Article MathSciNet Google Scholar
Liu W, Anguelov D, Erhan D, Szegedy C, Reed S, Fu CY, Berg AC (2016) Ssd: Single shot multibox detector. In: European conference on computer vision, Springer, pp 21–37
Liu X, Liang D, Yan S, Chen D, Qiao Y, Yan J (2018) Fots: Fast oriented text spotting with a unified network. In: IEEE Conference on computer vision and pattern recognition, pp 5676–5685
Liu Y, Jin L (2017) Deep matching prior network: Toward tighter multi-oriented text detection. In: IEEE Conference on computer vision and pattern recognition, pp 1962–1969
Lyu P, Yao C, Wu W, Yan S, Bai X (2018) Multi-oriented scene text detection via corner localization and region segmentation. In: IEEE Conference on computer vision and pattern recognition, pp 7553–7563
Ozuysal M, Fua P, Lepetit V (2007) Fast keypoint recognition in ten lines of code. In: IEEE Conference on computer vision and pattern recognition, pp 9–16
Shi B, Bai X, Belongie S (2017) Detecting oriented text in natural images by linking segments. In: IEEE Conference on computer vision and pattern recognition, pp 2550–2558
Shi B, Yang M, Wang X, Lyu P, Yao C, Bai X (2018) Aster: an attentional scene text recognizer with flexible rectification. IEEE Trans Pattern Anal Mach Intell 12(8):1243–1256
Google Scholar
Shotton J, Johnson M, Cipolla R (2008) Semantic texton forests for image categorization and segmentation. In: IEEE Conference on computer vision and pattern recognition, pp 1–8
Viola P, Jones M, et al. (2001) Rapid object detection using a boosted cascade of simple features. IEEE Conference on Computer Vision and Pattern Recognition 1:511–518
Google Scholar
Wang K, Babenko B, Belongie S (2011) End-to-end scene text recognition. In: International conference on computer vision, IEEE, pp 1457–1464
Wang K, Belongie S (2010) Word spotting in the wild. In: European conference on computer vision, Springer, pp 591–604
Wu W, Liu Z, Chen M, Yang X, He X (2012) An automated vision system for container-code recognition. Expert Syst Appl 39(3):2842–2855
Article Google Scholar
Yao C, Bai X, Liu W, Ma Y, Tu Z (2012) Detecting texts of arbitrary orientations in natural images. In: IEEE Conference on computer vision and pattern recognition, IEEE, pp 1083–1090
Yu T, Wang X, Shami A (2019) Uav-enabled spatial data sampling in large-scale iot systems using denoising autoencoder neural network. IEEE Internet of Things Journal 6(2):1856–1865
Article Google Scholar
Zhang Z, Zhang C, Shen W, Yao C, Liu W, Bai X (2016) Multi-oriented text detection with fully convolutional networks. In: IEEE Conference on computer vision and pattern recognition, pp 4159–4167
Zhou X, Yao C, Wen H, Wang Y, Zhou S, He W, Liang J (2017) East: An efficient and accurate scene text detector. In: IEEE Conference on computer vision and pattern recognition, pp 5551–5560

Download references

Acknowledgements

We specially thanks the Nanjing Port for providing us the datasets and technique support. This work is supported in part by the Key Program of the National Natural Science Foundation of China (61932013), the Natural Science Foundation of Jiangsu Province of China (BK20200739), the Natural Science Foundation of the Jiangsu Higher Education Institutions of China (20KJB520003) and the Research Foundation of Jiangsu for “333 high level talents training project” (BRA2020065). This work is also supported by the China Postdoctoral Science Foundation (2021M691655) and the Postdoctoral Science Foundation of Jiangsu Province of China (2021K172B). This article is also sponsored by NUPTSF (NY219149, NY220189). Yanchao Li is also supported by Henan Key Laboratory of Food Safety Data Intelligence, ZZULI (KF2020YB01).

Author information

Authors and Affiliations

School of Computer Science, Nanjing University of Posts and Telecommunications, Nanjing, China
Yanchao Li & Guangwei Gao
School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing, China
Hao Li

Authors

Yanchao Li
View author publications
You can also search for this author in PubMed Google Scholar
Hao Li
View author publications
You can also search for this author in PubMed Google Scholar
Guangwei Gao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yanchao Li.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Li, Y., Li, H. & Gao, G. Towards end-to-end container code recognition. Multimed Tools Appl 81, 15901–15918 (2022). https://doi.org/10.1007/s11042-022-12477-z

Download citation

Received: 02 December 2020
Revised: 22 June 2021
Accepted: 25 January 2022
Published: 02 March 2022
Issue Date: May 2022
DOI: https://doi.org/10.1007/s11042-022-12477-z

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Towards end-to-end container code recognition

Abstract

Access this article

Similar content being viewed by others

Automatic Container Code Recognition Using MultiDeep Pipeline

Build an Effective System for Container Code Recognition

Performance Evaluation of Container Identification Detection Algorithm

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Towards end-to-end container code recognition

Abstract

Access this article

Similar content being viewed by others

Automatic Container Code Recognition Using MultiDeep Pipeline

Build an Effective System for Container Code Recognition

Performance Evaluation of Container Identification Detection Algorithm

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation