Complex Glyph Enhancement for License Plate Generation

Chen, Yu-Xiang; Liu, Qi; Chen, Song-Lu; Zhou, Fang; Liu, Yan; Chen, Feng; Yin, Xu-Cheng

doi:10.1007/978-3-031-46305-1_25

Yu-Xiang Chen¹⁴,
Qi Liu¹⁴,
Song-Lu Chen¹⁴,
Fang Zhou¹⁴,
Yan Liu¹⁴,
Feng Chen¹⁵ &
…
Xu-Cheng Yin¹⁴

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14355))

Included in the following conference series:

International Conference on Image and Graphics

321 Accesses

Abstract

The complex glyphs of license plates usually comes with a long-tail distribution, leading to poor recognition performance of the tail class. Supplementing the training data with generated license plates is an effective solution for this issue. However, for complex glyphs, the previous methods are prone to generate incomplete structures and blurry strokes. The first reason is that the small portion of complex glyphs on the license plate contributes little to the overall loss. Secondly, due to the complex structure and dense strokes, the glyphs are prone to be generated inaccurately. To solve the above problems, firstlly, we propose a divide-and-conquer method that generates complex and simple glyphs separately and then fuses them into a complete license plate, thus enhancing the generation of complex glyphs in loss computation. Secondly, we increase the generated resolution of complex glyph to enable the model to learn dense structures and fine strokes. Besides, considering the computational cost, low-resolution generation is used for the rest of the simple glyphs. Extensive experiments demonstrate that our method can significantly enhances the realism of the complex glyph, and generated images can boost recognition performance by 3\(\%\) on SYSU. Additionally, we provide a dataset of 30,000 generated Chinese license plates with uniform Chinese distribution to promote research (https://github.com/ICIG2023-91/GCLPD).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 59.99; Price excludes VAT (USA)

Softcover Book: USD 79.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Barratt, S.T., Sharma, R.: A note on the inception score. CoRR abs/1801.01973 (2018)
Google Scholar
Chen, X., Xie, Y., Sun, L., Lu, Y.: DGFont++: robust deformable generative networks for unsupervised font generation. CoRR abs/2212.14742 (2022)
Google Scholar
Gatys, L.A., Ecker, A.S., Bethge, M.: Image style transfer using convolutional neural networks. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition, pp. 2414–2423 (2016)
Google Scholar
Goodfellow, I.J., et al.: Generative adversarial networks. CoRR abs/1406.2661 (2014)
Google Scholar
Gupta, A., Andrew Zisserman, A.V.: Synthetic data for text localisation in natural images. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition, pp. 2315–2324 (2016)
Google Scholar
Hassan, A.U., Ahmed, H., Choi, J.: Unpaired font family synthesis using conditional generative adversarial networks. Knowl. Based Syst. 229, 107304 (2021)
Article Google Scholar
Isola, P., Zhu, J., Zhou, T., Efros, A.A.: Image-to-image translation with conditional adversarial networks, pp. 5967–5976 (2017)
Google Scholar
Jiang, Y., Lian, Z., Tang, Y., Xiao, J.: SCFont: structure-guided Chinese font generation via deep stacked networks. In: The Thirty-Third AAAI Conference on Artificial Intelligence, AAAI 2019, pp. 4015–4022. AAAI Press (2019)
Google Scholar
Johnson, J., Alahi, A., Fei-Fei, L.: Perceptual losses for real-time style transfer and super-resolution. In: Computer Vision - ECCV 2016–14th European Conference, vol. 9906, pp. 694–711 (2016)
Google Scholar
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. In: 3rd International Conference on Learning Representations (2015)
Google Scholar
Lyu, P., Bai, X., Yao, C., Zhu, Z., Huang, T., Liu, W.: Auto-encoder guided GAN for Chinese calligraphy synthesis. In: 14th IAPR International Conference on Document Analysis and Recognition, ICDAR 2017, pp. 1095–1100. IEEE (2017)
Google Scholar
Ronneberger, O., Fischer, P., Brox, T.: U-Net: convolutional networks for biomedical image segmentation. In: Medical Image Computing and Computer-Assisted Intervention, vol. 9351, pp. 234–241 (2015)
Google Scholar
Sajjadi, M.S.M., Bachem, O., Lucic, M., Bousquet, O., Gelly, S.: Assessing generative models via precision and recall. CoRR abs/1806.00035 (2018)
Google Scholar
Sun, M., Zhou, F., Yang, C., Yin, X.: Image generation framework for unbalanced license plate data set. In: 2019 International Conference on Data Mining Workshops, pp. 883–889 (2019)
Google Scholar
Sun, Y.F., Liu, Q., Chen, S.L., Zhou, F., Yin, X.C.: Robust Chinese license plate generation via foreground text and background separation. In: Image and Graphics - 11th International Conference, vol. 12890, pp. 290–302 (2021)
Google Scholar
Wang, X., Man, Z., You, M., Shen, C.: Adversarial generation of training examples: applications to moving vehicle license plate recognition. CoRR abs/1707.03124 (2017)
Google Scholar
Wu, C., Xu, S., Song, G., Zhang, S.: How many labeled license plates are needed? In: Pattern Recognition and Computer Vision - First Chinese Conference, vol. 11259, pp. 334–346 (2018)
Google Scholar
Wu, L., Zhang, C., Liu, J., Han, J., Liu, J., Ding, E., Bai, X.: Editing text in the wild. In: Proceedings of the 27th ACM International Conference on Multimedia, pp. 1500–1508 (2019)
Google Scholar
Xu, Z., et al.: Towards end-to-end license plate detection and recognition: A large dataset and baseline. In: Computer Vision - ECCV 2018–15th European Conference, vol. 11217, pp. 261–277 (2018)
Google Scholar
Yu, D., Li, X., Zhang, C., Liu, T., Han, J., Liu, J., Ding, E.: Towards accurate scene text recognition with semantic reasoning networks. In: 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 12110–12119 (2020)
Google Scholar
Zhang, L., Wang, P., Li, H., Li, Z., Shen, C., Zhang, Y.: A robust attentional framework for license plate recognition in the wild. IEEE Trans. Intell. Transp. Syst. 22(11), 6967–6976 (2021)
Article Google Scholar
Zhao, Y., Yu, Z., Li, X., Cai, M.: Chinese license plate image database building methodology for license plate recognition. J. Electron. Imaging 28(1), 013001 (2019)
Article Google Scholar
Zhou, W., Bovik, A.C., Sheikh, H.R., Simoncelli, E.P.: Image quality assessment: from error visibility to structural similarity. IEEE Trans. Image Process. 13(4), 600–612 (2004)
Article Google Scholar
Zhu, J.Y., Park, T., Isola, P., Efros, A.A.: Unpaired image-to-image translation using cycle-consistent adversarial networks. In: IEEE International Conference on Computer Vision, pp. 2242–2251 (2017)
Google Scholar

Download references

Acknowledgement

The research is supported by National Key Research and Development Program of China (2020AAA0109700), National Natural Science Foundation of China (62076024, 62006018, U22B2055).

Author information

Authors and Affiliations

University of Science and Technology Beijing, Beijing, China
Yu-Xiang Chen, Qi Liu, Song-Lu Chen, Fang Zhou, Yan Liu & Xu-Cheng Yin
EEasy Technology Company Ltd., Zhuhai, China
Feng Chen

Authors

Yu-Xiang Chen
View author publications
You can also search for this author in PubMed Google Scholar
Qi Liu
View author publications
You can also search for this author in PubMed Google Scholar
Song-Lu Chen
View author publications
You can also search for this author in PubMed Google Scholar
Fang Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Yan Liu
View author publications
You can also search for this author in PubMed Google Scholar
Feng Chen
View author publications
You can also search for this author in PubMed Google Scholar
Xu-Cheng Yin
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Fang Zhou .

Editor information

Editors and Affiliations

Dalian University of Technology, Dalian, China
Huchuan Lu
University of Sydney, Sydney, NSW, Australia
Wanli Ouyang
Shenzhen University, Shenzhen, China
Hui Huang
Tsinghua University, Beijing, China
Jiwen Lu
Dalian University of Technology, Dalian, China
Risheng Liu
Institute of Automation, CAS, Beijing, China
Jing Dong
University of Technology Sydney, Sydney, NSW, Australia
Min Xu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chen, YX. et al. (2023). Complex Glyph Enhancement for License Plate Generation. In: Lu, H., et al. Image and Graphics. ICIG 2023. Lecture Notes in Computer Science, vol 14355. Springer, Cham. https://doi.org/10.1007/978-3-031-46305-1_25

Download citation

DOI: https://doi.org/10.1007/978-3-031-46305-1_25
Published: 29 October 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-46304-4
Online ISBN: 978-3-031-46305-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Complex Glyph Enhancement for License Plate Generation