Abstract
Container code recognition can improve the efficiency and economy of the management system in the port. However, the task is different and complex due to the degradation of image quality caused by uneven illumination, background variation, smear, inaccurate character extraction, and so on. Current processing methods on container images usually provide the framework or modules on specific tasks, such as region detection and character classification, which are hard to implement or to be combined into a whole process. In this paper, we propose a fast end-to-end method of automatic recognition of container code that fills the gap by locating the region and detecting characters as well as making the classification. This allows the three tasks to work collaboratively by pipeline, which is critical to identify the container code. For evaluation, we collect around six thousand container images, including all kinds of circumstances from the local port. Compared with a few other methods and two-step approaches consisting of state-of-the-art character detector and character classifier, our system achieves some competitive results. Finally, the proposed system is verified on this dataset and the overall accuracy reaches 97.30%.
Similar content being viewed by others
Notes
Nanjing Port is located in Nanjing, Jiangsu Province, China, and is the largest inland port in the world (depending on how you classify the ports in the Yangtze Delta), with throughput reaching 191 million tons of cargo in 2012.
References
Baek Y, Lee B, Han D, Yun S, Lee H (2019) Character region awareness for text detection. In: IEEE Conference on computer vision and pattern recognition, pp 2863–2870
Bosch A, Zisserman A, Munoz X (2007) Image classification using random forests and ferns. In: IEEE International conference on computer vision, pp 1–8
Chen X, Yuille AL (2004) Detecting and reading text in natural scenes. In: IEEE Conference on computer vision and pattern recognition, vol 2, pp 76–85
Compare M, Baraldi P, Zio E (2020) Challenges to iot-enabled predictive maintenance for industry 4.0. IEEE Internet of Things Journal 7(5):4585–4597
Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: IEEE Conference on computer vision and pattern recognition, vol 1, pp 886–893
De Campos TE, Babu BR, Varma M et al (2009) Character recognition in natural images. VISAPP 7
Deng D, Liu H, Li X, Cai D (2018) Pixellink: Detecting scene text via instance segmentation. In: AAAI
Epshtein B, Ofek E, Wexler Y (2010) Detecting text in natural scenes with stroke width transform. In: IEEE Conference on computer vision and pattern recognition, pp 2963–2970
Felzenszwalb PF, Girshick RB, McAllester D, Ramanan D (2009) Object detection with discriminatively trained part-based models. IEEE Trans Pattern Anal Mach Intell 32(9):1627–1645
Felzenszwalb PF, Huttenlocher DP (2005) Pictorial structures for object recognition. Int J Comput Vis 61(1):55–79
Feng BY, Ren M, Zhang XY, Suen CY (2014) Automatic recognition of serial numbers in bank notes. Pattern Recogn 47(8):2621–2634
Fu Y, Chen X, Gao H (2009) A new connected component analysis algorithm based on max-tree. In: IEEE International conference on dependable, autonomic and secure computing, IEEE, pp 843–844
He D, Yang X, Liang C, Zhou Z, Ororbi AG, Kifer D, Lee Giles C (2017) Multi-scale fcn with cascaded instance aware segmentation for arbitrary oriented word spotting in the wild. In: IEEE Conference on computer vision and pattern recognition, pp 3519–3528
He P, Huang W, He T, Zhu Q, Qiao Y, Li X (2017) Single shot text detector with regional attention. In: IEEE International conference on computer vision, pp 3047–3055
He T, Tian Z, Huang W, Shen C, Qiao Y, Sun C (2018) An end-to-end textspotter with explicit alignment and attention. In: IEEE Conference on computer vision and pattern recognition, pp 5020–5029
He W, Zhang XY, Yin F, Liu CL (2017) Deep direct regression for multi-oriented scene text detection. In: IEEE International conference on computer vision, pp 745–753
Li C, Liu S, Xia Q, Wang H, Chen H (2019) Automatic container code localization and recognition via an efficient code detector and sequence recognition. In: IEEE/ASME International conference on advanced intelligent mechatronics, pp 532–537. https://doi.org/10.1109/AIM.2019.8868819
Li L, Ghasemi A (2019) Iot-enabled machine learning for an algorithmic spectrum decision process. IEEE Internet of Things Journal 6(2):1911–1919
Liao M, Shi B, Bai X (2018) Textboxes++: A single-shot oriented scene text detector. IEEE Trans Image Process 27(8):3676–3690
Liu W, Anguelov D, Erhan D, Szegedy C, Reed S, Fu CY, Berg AC (2016) Ssd: Single shot multibox detector. In: European conference on computer vision, Springer, pp 21–37
Liu X, Liang D, Yan S, Chen D, Qiao Y, Yan J (2018) Fots: Fast oriented text spotting with a unified network. In: IEEE Conference on computer vision and pattern recognition, pp 5676–5685
Liu Y, Jin L (2017) Deep matching prior network: Toward tighter multi-oriented text detection. In: IEEE Conference on computer vision and pattern recognition, pp 1962–1969
Lyu P, Yao C, Wu W, Yan S, Bai X (2018) Multi-oriented scene text detection via corner localization and region segmentation. In: IEEE Conference on computer vision and pattern recognition, pp 7553–7563
Ozuysal M, Fua P, Lepetit V (2007) Fast keypoint recognition in ten lines of code. In: IEEE Conference on computer vision and pattern recognition, pp 9–16
Shi B, Bai X, Belongie S (2017) Detecting oriented text in natural images by linking segments. In: IEEE Conference on computer vision and pattern recognition, pp 2550–2558
Shi B, Yang M, Wang X, Lyu P, Yao C, Bai X (2018) Aster: an attentional scene text recognizer with flexible rectification. IEEE Trans Pattern Anal Mach Intell 12(8):1243–1256
Shotton J, Johnson M, Cipolla R (2008) Semantic texton forests for image categorization and segmentation. In: IEEE Conference on computer vision and pattern recognition, pp 1–8
Viola P, Jones M, et al. (2001) Rapid object detection using a boosted cascade of simple features. IEEE Conference on Computer Vision and Pattern Recognition 1:511–518
Wang K, Babenko B, Belongie S (2011) End-to-end scene text recognition. In: International conference on computer vision, IEEE, pp 1457–1464
Wang K, Belongie S (2010) Word spotting in the wild. In: European conference on computer vision, Springer, pp 591–604
Wu W, Liu Z, Chen M, Yang X, He X (2012) An automated vision system for container-code recognition. Expert Syst Appl 39(3):2842–2855
Yao C, Bai X, Liu W, Ma Y, Tu Z (2012) Detecting texts of arbitrary orientations in natural images. In: IEEE Conference on computer vision and pattern recognition, IEEE, pp 1083–1090
Yu T, Wang X, Shami A (2019) Uav-enabled spatial data sampling in large-scale iot systems using denoising autoencoder neural network. IEEE Internet of Things Journal 6(2):1856–1865
Zhang Z, Zhang C, Shen W, Yao C, Liu W, Bai X (2016) Multi-oriented text detection with fully convolutional networks. In: IEEE Conference on computer vision and pattern recognition, pp 4159–4167
Zhou X, Yao C, Wen H, Wang Y, Zhou S, He W, Liang J (2017) East: An efficient and accurate scene text detector. In: IEEE Conference on computer vision and pattern recognition, pp 5551–5560
Acknowledgements
We specially thanks the Nanjing Port for providing us the datasets and technique support. This work is supported in part by the Key Program of the National Natural Science Foundation of China (61932013), the Natural Science Foundation of Jiangsu Province of China (BK20200739), the Natural Science Foundation of the Jiangsu Higher Education Institutions of China (20KJB520003) and the Research Foundation of Jiangsu for “333 high level talents training project” (BRA2020065). This work is also supported by the China Postdoctoral Science Foundation (2021M691655) and the Postdoctoral Science Foundation of Jiangsu Province of China (2021K172B). This article is also sponsored by NUPTSF (NY219149, NY220189). Yanchao Li is also supported by Henan Key Laboratory of Food Safety Data Intelligence, ZZULI (KF2020YB01).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Li, Y., Li, H. & Gao, G. Towards end-to-end container code recognition. Multimed Tools Appl 81, 15901–15918 (2022). https://doi.org/10.1007/s11042-022-12477-z
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-022-12477-z