Abstract
In order to improve the performance of woodblock printing Mongolian words recognition, a method based on cycle-consistent generative adversarial network (CycleGAN) has been proposed for data augmentation. A well-trained CycleGAN model can learn image-to-image translation without paired examples. To be specific, the style of machine printing word images can be transformed into the corresponding word images with the style of woodblock printing by utilizing a CycleGAN, and vice versa. In this way, new instances of woodblock printing Mongolian word images are able to be generated by using the two generative models of CycleGAN. Thus, the aim of data augmentation could be attained. Given a dataset of woodblock printing Mongolian word images, experimental results demonstrate that the performance of woodblock printing Mongolian words recognition can be improved through such the data augmentation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Wei, H., Gao, G.: A keyword retrieval system for historical Mongolian document images. Int. J. Doc. Anal. Recognit. (IJDAR) 17(1), 33–45 (2014). https://doi.org/10.1007/s10032-013-0203-6
Gao, G., Su, X., Wei, H., Gong, Y.: Classical Mongolian words recognition in historical document. In: Proceedings of the 11th International Conference on Document Analysis and Recognition, pp. 692–697. IEEE (2011)
Su, X., Gao, G., Wei, H., Bao, F.: Enhancing the mongolian historical document recognition system with multiple knowledge-based strategies. In: Arik, S., Huang, T., Lai, W., Liu, Q. (eds.) Neural Information Processing. ICONIP 2015. Lecture Notes in Computer Science, vol. 9490, pp. 536–544. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-26535-3_61
Su, X., Gao, G., Wei, H., Bao, F.: A knowledge-based recognition system for historical Mongolian documents. Int. J. Doc. Anal. Recognit. (IJDAR) 19(3), 221–235 (2016). https://doi.org/10.1007/s10032-016-0267-1
Wei, H., Gao, G.: A holistic recognition approach for woodblock-print Mongolian words based on convolutional neural network. In: Proceedings of the 26th IEEE International Conference on Image Processing, pp. 2726–2730. IEEE (2019)
Kang, Y., Wei, H., Zhang, H., Gao, G.: Woodblock-printing Mongolian words recognition by BI-LSTM with attention mechanism. In: Proceedings of the 15th International Conference on Document Analysis and Recognition, pp. 910–915. IEEE (2019)
Wei, H., Gao, G., Bao, Y.: A method for removing inflectional suffixes in word spotting of Mongolian Kanjur. In: Proceedings of the 11th International Conference on Document Analysis and Recognition, pp. 88–92. IEEE (2011)
Goodfellow, I., et al.: Generative adversarial nets. In: Proceedings of Advances in Neural Information Processing Systems, pp. 2672–2680 (2014)
Sangkloy, P., Lu, J., Fang, C., Yu, F., Hays, J.: Scribbler: controlling deep image synthesis with sketch and color. In: Proceedings of 2017 IEEE International Conference on Computer Vision and Pattern Recognition, pp. 5400–5409. IEEE (2017)
Isola, P., Zhu, J.Y., Zhou, T., Efros, A.A.: Image-to-image translation with conditional adversarial networks. In: Proceedings of 2017 IEEE International Conference on Computer Vision and Pattern Recognition, pp. 1125–1134. IEEE (2017)
Tian, Y.: Zi2zi: Master Chinese calligraphy with conditional adversarial networks (2017). https://github.com/kaonashi-tyc/zi2zi
Zhu, J.Y., Park, T., Isola, P., Efros, A.A.: Unpaired image-to-image translation using cycle-consistent adversarial networks. In: Proceedings of 2017 IEEE International Conference on Computer Vision, pp. 2223–2232. IEEE (2017)
Li, M., Huang, H., Ma, L., Liu, W., Zhang, T., Jiang, Y.: Unsupervised image-to-image translation with stacked cycle-consistent adversarial networks. In: Proceedings of 2018 European Conference on Computer Vision, pp. 184–199. IEEE (2018)
Zhu, X., Liu, Y., Li, J., Wan, T., Qin, Z.: Emotion classification with data augmentation using generative adversarial networks. In: Phung, D., Tseng, V., Webb, G., Ho, B., Ganji, M., Rashidi, L. (eds.) Advances in Knowledge Discovery and Data Mining. PAKDD 2018. Lecture Notes in Computer Science, vol. 10939, pp. 349–360. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-93040-4_28
Shi, Z., Liu, M., Cao, Q., Ren, H., Luo, T.: A data augmentation method based on cycle-consistent adversarial networks for fluorescence encoded microsphere image analysis. Signal Process. 161, 195–202 (2019)
Hammami, M., Friboulet, D., Kechichian, R.: Cycle GAN-based data augmentation for multi-organ detection in CT images via Yolo. In: Proceedings of the 28th International Conference on Image Processing, pp. 390–393. IEEE (2020)
Chang, B., Zhang, Q., Pan, S., Meng, L.: Generating handwritten Chinese characters using CycleGAN. In: Proceedings of 2018 IEEE Winter Conference on Applications of Computer Vision, pp. 199–207. IEEE (2018)
Wu, L., Chen, X., Meng, L., Meng, X.: Multitask adversarial learning for Chinese font style transfer. In: Proceedings of 2020 International Joint Conference on Neural Networks, pp. 1–8. IEEE (2020)
Farooqui, F.F., Hassan, M., Younis, M.S., Siddhu, M.K.: Offline hand written Urdu word spotting using random data generation. IEEE Access 8, 131119–131136 (2020)
Zhang, H., Wei, H., Bao, F., Gao, G.: Segmentation-free printed traditional Mongolian OCR using sequence to sequence with attention model. In: Proceedings of the 14th International Conference on Document Analysis and Recognition, pp. 585–590. IEEE (2017)
Wei, H., Zhang, H., Zhang, J., Liu, K.: Multi-task learning based traditional Mongolian words recognition. In: Proceedings of the 25th International Conference on Pattern Recognition, pp. 1275–1281. IEEE (2021)
Chawla, N.V., Bowyer, K.W., Hall, L.O., Kegelmeyer, W.P.: SMOTE: synthetic minority over-sampling technique. J. Artif. Intell. Res. 16, 321–357 (2002)
Acknowledgments
This study is supported by the Project for Science and Technology of Inner Mongolia Autonomous Region under Grant 2019GG281, the Natural Science Foundation of Inner Mongolia Autonomous Region under Grant 2019ZD14, the Program for Young Talents of Science and Technology in Universities of Inner Mongolia Autonomous Region under Grant NJYT-20-A05, and the Natural Science Foundation of China under Grant 61463038 and 61763034.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Wei, H., Liu, K., Zhang, J., Fan, D. (2021). Data Augmentation Based on CycleGAN for Improving Woodblock-Printing Mongolian Words Recognition. In: Lladós, J., Lopresti, D., Uchida, S. (eds) Document Analysis and Recognition – ICDAR 2021. ICDAR 2021. Lecture Notes in Computer Science(), vol 12824. Springer, Cham. https://doi.org/10.1007/978-3-030-86337-1_35
Download citation
DOI: https://doi.org/10.1007/978-3-030-86337-1_35
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-86336-4
Online ISBN: 978-3-030-86337-1
eBook Packages: Computer ScienceComputer Science (R0)