Urdu Handwritten Ligature Generation Using Generative Adversarial Networks (GANs)

Sharif, Marium; Ul-Hasan, Adnan; Shafait, Faisal

doi:10.1007/978-3-031-21648-0_29

Urdu Handwritten Ligature Generation Using Generative Adversarial Networks (GANs)

Conference paper
First Online: 25 November 2022

1038 Accesses
2 Citations

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13639))

Abstract

Deep learning has significantly improved handwriting text recognition, esp. for Latin scripts. Arabic scripts including Urdu is a family of complex scripts and they pose difficult challenges for deep learning architectures. Data availability is a significant obstacle in developing Urdu handwriting recognition systems. Since gathering data is a costly and challenging task, there is a need to increase training data using novel approaches. One possible solution is to make a model that can generate similar yet different samples from the existing data samples. In this paper, we propose such models based on Generative Adversarial Networks (GANs) that have the ability to synthesize realistic samples similar to the original dataset. Our generator is class conditioned to produce Urdu samples of varying characters that differ in style. Visual and quantitative analysis convey that generated samples are of realistic nature and can be used to increase datasets. Synthesized samples integrated with the existing training set is shown to increase the performance of a handwriting recognition system.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 79.99; Price excludes VAT (USA)

Softcover Book: USD 99.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Amazon textract: intelligently extract text and data with OCR (2019)
Google Scholar
Cloud vision API: detect text in images (2019)
Google Scholar
Ahmed, S.B., Naz, S., Swati, S., Razzak, I., Umar, A.I., Khan, A.A.: UCOM offline dataset-an Urdu handwritten dataset generation. Int. Arab J. Inf. Technol. (IAJIT) 14(2) (2017)
Google Scholar
Alonso, E., Moysset, B., Messina, R.: Adversarial generation of handwritten text images conditioned on sequences. In: 2019 International Conference on Document Analysis and Recognition (ICDAR), pp. 481–486. IEEE (2019)
Google Scholar
Arjovsky, M., Chintala, S., Bottou, L.: Wasserstein generative adversarial networks. In: International Conference on Machine Learning, pp. 214–223. PMLR (2017)
Google Scholar
Pierre, B.: Autoencoders, unsupervised learning, and deep architectures. In: Proceedings of ICML Workshop on Unsupervised and Transfer Learning, pp. 37–49. JMLR Workshop and Conference Proceedings (2012)
Google Scholar
Chang, B., Zhang, Q., Pan, S., Meng, L.: Generating handwritten Chinese characters using cyclegan. In: 2018 IEEE Winter Conference on Applications of Computer Vision (WACV), pp. 199–207. IEEE (2018)
Google Scholar
Dehghan, M., Faez, K., Ahmadi, M., Shridhar, M.: Handwritten Farsi (Arabic) word recognition: a holistic approach using discrete hmm. Pattern Recogn. 34(5), 1057–1065 (2001)
Article MATH Google Scholar
Farooqui, F.F., Hassan, M., Younis, M.S., Siddhu, M.K.: Offline hand written Urdu word spotting using random data generation. IEEE Access 8, 131119–131136 (2020)
Google Scholar
Fogel, S., Averbuch-Elor, H., Cohen, S., Mazor, S., Litman, R.: Scrabblegan: semi-supervised varying length handwritten text generation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 4324–4333 (2020)
Google Scholar
Goodfellow, I., et al.: Generative adversarial nets. Adv. Neural Inf. Process. Syst. 27 (2014)
Google Scholar
Gulrajani, I., Ahmed, F., Arjovsky, M., Dumoulin, V., Courville, A.C.: Improved training of wasserstein gans. Adv. Neural Inf. Process. Syst. 30 (2017)
Google Scholar
Heusel, M., Ramsauer, H., Unterthiner, T., Nessler, B., Hochreiter, S.: Gans trained by a two time-scale update rule converge to a local nash equilibrium. Adv. Neural Inf. Process. Syst. 30 (2017)
Google Scholar
Ioffe, S., Szegedy, C.: Batch normalization: accelerating deep network training by reducing internal covariate shift. In: International Conference on Machine Learning, pp. 448–456. PMLR (2015)
Google Scholar
Khrulkov, V., Oseledets, I.: Geometry score: a method for comparing generative adversarial networks. arXiv preprint arXiv:1802.02664 (2018)
Radford, A., Metz, L., Chintala, S.: Unsupervised representation learning with deep convolutional generative adversarial networks. arXiv preprint arXiv:1511.06434 (2015)
Salimans, T., Goodfellow, I., Zaremba, W., Cheung, V., Radford, A., Chen, X.: Improved techniques for training gans. Adv. Neural Inf. Process. Syst. 29 (2016)
Google Scholar
Shorten, C., Khoshgoftaar, T.M.: A survey on image data augmentation for deep learning. J. Big Data 6(1), 1–48 (2019)
Article Google Scholar
Zhong, Z., Jin, L., Xie, Z.: High performance offline handwritten Chinese character recognition using googlenet and directional feature maps. In: 2015 13th International Conference on Document Analysis and Recognition (ICDAR), pp. 846–850. IEEE (2015)
Google Scholar
Zhu, J.Y., Park, T., Isola, P., Efros, A.A.: Unpaired image-to-image translation using cycle-consistent adversarial networks. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2223–2232 (2017)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Electrical Engineering and Computer Sciences, National University of Sciences and Technology, Islamabad, Pakistan
Marium Sharif & Faisal Shafait
Deep Learning Lab, National Center of Artificial Intelligence, National University of Sciences and Technology, Islamabad, Pakistan
Adnan Ul-Hasan & Faisal Shafait

Authors

Marium Sharif
View author publications
You can also search for this author in PubMed Google Scholar
Adnan Ul-Hasan
View author publications
You can also search for this author in PubMed Google Scholar
Faisal Shafait
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Marium Sharif .

Editor information

Editors and Affiliations

Walmart Inc., Hoboken, NJ, USA
Utkarsh Porwal
Universitat Autònoma de Barcelona, Barcelona, Spain
Alicia Fornés
National University of Sciences and Technology (NUST), Islamabad, Pakistan
Faisal Shafait

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sharif, M., Ul-Hasan, A., Shafait, F. (2022). Urdu Handwritten Ligature Generation Using Generative Adversarial Networks (GANs). In: Porwal, U., Fornés, A., Shafait, F. (eds) Frontiers in Handwriting Recognition. ICFHR 2022. Lecture Notes in Computer Science, vol 13639. Springer, Cham. https://doi.org/10.1007/978-3-031-21648-0_29

Download citation

DOI: https://doi.org/10.1007/978-3-031-21648-0_29
Published: 25 November 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-21647-3
Online ISBN: 978-3-031-21648-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics