Abstract
Denoising of Chinese calligraphic tablet images is of great importance in regard to the study of both content and character shapes in these images. Formerly GAN (generative adversarial network) based image denoising methods model the noise in the generator and then perform denoising by CNN (convolutional neural networks) algorithms. These methods still leave room for improvement. In this paper, a triple discriminators equipped GAN for generative denoising is proposed, with the three channels of discriminators enhancing the denoising result by different means. Another noise modeling module based on CycleGAN is used to produce the paired input data. Quantitative index are obtained for these methods; the PSNR and SSIM of our method on publicly available data is 21.84 and 0.93 respectively, which is preferable to BM3D, DnCNN, FormResNet, CycleGAN and our previous method.
Similar content being viewed by others
References
Barham P, Chen J, Chen Z (n.d.) TensorFlow : a system for large-scale machine learning, OSDI’16: proceedings of the 12th USENIX conference on operating systems design and implementation, URL https://doi.org/10.5555/3026877.3026899
Buades A, Coll B, Morel J-M (2011) URL. Non-Local Means Denoising, Computer Vision and Pattern Recognition 1:208–212. https://doi.org/10.5201/ipol.2011.bcm_nlm
Chen Y, Lai YK, Liu YJ (2018) CartoonGAN: generative adversarial networks for photo Cartoonization, IEEE/CVF Conference on Computer Vision and Pattern Recognition. URL https://doi.org/10.1109/CVPR.2018.00986.
Dabov K, Foi A, Katkovnik V, Egiazarian K (2007) Image denoising by sparse 3-D transform-domain collaborative filtering. IEEE Trans. Image Process. 16:2080–2095. https://doi.org/10.1109/TIP.2007.901238
El Helou M, Susstrunk S (2020) Blind Universal Bayesian Image Denoising with Gaussian Noise Level Learning. IEEE Trans Image Process 29:4885–4897. https://doi.org/10.1109/TIP.2020.2976814
Elad M, Aharon M (2006) Image denoising via sparse and redundant representations over learned dictionaries. IEEE Trans Image Process 15:3736–3745. https://ieeexplore.ieee.org/document/4011956. Accessed 13 Nov 2006
Goodfellow I, NIPS (2016) Tutorial: generative adversarial networks, (2016). https://arxiv.org/abs/1701.00160. Accessed 31 Dec 2016
Goodfellow IJ, Pouget-Abadie J, Mirza M (2014) Generative adversarial nets. Neural Inf Process Syst (NIPS 2014):2672–2680. https://doi.org/10.5555/2969033.2969125
He K, Zhang X, Ren S, Sun J (2015) Deep residual learning for image recognition, Computer Science. URL https://www.computer.org/csdl/proceedings-article/cvpr/2016/8851a770/12OmNxvwoXv
Hou B, Liu Q, Wang H, Wang Y (2020) From W-Net to CDGAN : Bi-temporal Change Detection via Deep Learning Techniques, IEEE Transact Geosci Remote Sens 1–12. URL https://ieeexplore.ieee.org/document/8891676
Huang G, Weinberger KQ (n.d.) Densely Connected Convolutional Networks, IEEE Conference on Computer Vision and Pattern Recognition (CVPR),URL https://ieeexplore.ieee.org/document/8099726
Huang ZK, Li ZH, Huang H, Li ZB, Hou LY (2016) Comparison of different image denoising algorithms for Chinese calligraphy images. Neurocomputing 188:102–112. https://doi.org/10.1016/j.neucom.2014.11.106
Isola P, Efros AA, Ai B, Berkeley UC (2018) Image-to-image translation with conditional adversarial networks, IEEE conference on computer vision and pattern recognition. http://openaccess.thecvf.com/content_cvpr_2017/papers/Isola_Image-To-Image_Translation_With_CVPR_2017_paper.pdf. Accessed 9 Nov 2017
Jain V, Seung HS (2009) Natural image denoising with convolutional networks, Conference on Advances in Neural Information Processing Systems Curran Associates Inc, 769–776. https://doi.org/10.5555/2981780.2981876
Ji G, Wei Q, Wang KL (2020) In-air handwritten Chinese text recognition with temporal convolutional recurrent network. Pattern Recognit
Jia X, Liu S, Feng X, Zhang L (2019) Focnet: A fractional optimal control network for image denoising. In: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). https://ieeexplore.ieee.org/document/8954104. Accessed 9 Jan 2020
Jiao J, Tu WC, He S, Lau RWH (2017) FormResNet: Formatted residual learning for image restoration. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW). https://ieeexplore.ieee.org/document/8014874. Accessed 24 Aug 2017
D.P. Kingma, J.L. Ba, Adam (2014) A method for stochastic optimization, Computer Science 1–15. URL http://www.oalib.com/paper/4068193
C. Luo, L. Jin, Z. Sun, I. Engineering, MORAN: a multi-object rectified attention network for scene text recognition, pattern Recognitiont. (2019). URL https://doi.org/10.1016/j.patcog.2019.01.020
Maas AL, Ng A.Y (2013) Rectifier nonlinearities improve neural network acoustic models, international conference on machine learning 28. https://www.mendeley.com/catalogue/a4a3dd28-b56b-3e0c-ac53-2817625a2215/. Accessed 30 Sept 2013
Mao X-J, Shen C, Yang Y-B (2016) Image Restoration Using Convolutional Auto-encoders with Symmetric Skip Connections, Conference on Neural Information Processing Systems. URL https://dblp.uni-trier.de/rec/journals/corr/MaoSY16a.html
My VD,Manh ND,Phuong LT, Sang-Woong L (2021) HI-GAN: A hierarchical generative adversarial network for blind denoising of real photographs[J]. Inf Sci, 570. Access date: [2021-03-06].
Wang Z, Bovik AC, Sheikh HR (2004) Image quality assessment: from error visibility to structural similarity. IEEE Trans Image Process 13:600–612. ieeexplore.ieee.org/document/1284395. Accessed 13 Apr 2004
Wu Y, Shutao L (2021) A novel fusion paradigm for multi-channel image denoising[J]. Inf Fusion 77:62–69
Xia Y, He D, Qin T (2016) Dual learning for machine translation, neural information processing systems. http://papers.nips.cc/paper/6469-dual-learning-for-machine-translation. Accessed 31 Dec 2016
Xie J, Xu L, Chen E (2012) Image denoising and inpainting with deep neural networks. Advances in neural information processing systems. https://doi.org/10.5555/2999134.2999173
Zach C, Eth Z, Klopschitz M, Pollefeys M, Eth Z (2010) Disambiguating visual relations using loop constraints. IEEE computer society conference on computer vision and pattern recognition. URL https://ieeexplore.ieee.org/document/5539801
Zhang K, Zuo W, Chen Y, Meng D, Zhang L (2017) Beyond a Gaussian denoiser: Residual learning of deep CNN for image denoising. IEEE Trans Image Process 26(2017):3142–3155. https://doi.org/10.1109/TIP.2017.2662206, 3142, 3155
Zhang K, Zuo W, Zhang L (2018) FFDNet: Toward a fast and flexible solution for CNN-Based image denoising. IEEE Trans Image Process 27:4608–4622. https://doi.org/10.1109/TIP.2018.2839891
Zhang Y, Tian Y, Kong Y (2018) Residual dense network for image super-resolution. In: IEEE conference on computer vision and pattern recognition. http://openaccess.thecvf.com/content_cvpr_2018/html/1329.html. Accessed 27 Dec 2018
Zhang J, Guo M, Fan J (2020) A novel generative adversarial net for calligraphic tablet images denoising. Multimed Tools Appl 79:119–140. https://doi.org/10.1007/s11042-019-08052-8
J.Y. Zhu, T. Park, P. Isola, A.A. Efros (2017) Unpaired image-to-image translation using cycle-consistent adversarial networks, IEEE Int Conf Computer Vis https://doi.org/10.1109/ICCV.2017.244.
Acknowledgements
This work is supported under the support of funds of Shaanxi Department of science and technology 2022QFY01-17, 2022JM-326. We thank Professor Songhua Xu for his kind discussion of the method in the paper.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Zhang, J., Shi, J., Li, M. et al. Triple discriminators - equipped GAN for Denoising of Chinese calligraphic tablet images. Multimed Tools Appl 81, 42691–42711 (2022). https://doi.org/10.1007/s11042-022-13478-8
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-022-13478-8