Abstract
Face sketch-to-photo transformation aims at generates face photo images from sketched face images. Although transformations have progressed significantly with the development of deep learning techniques in recent years, generating face photos with realistic photo styles and rich facial details is still challenging. In this paper, a new realistic face sketch-to-photo transformation method is proposed based on the feature-filtered residual attention network (FRAN), which is able to propagate more precise feature information in the deep network. Specifically, a feature-filtered residual module is constructed by filtering feature maps in the residual block to filtrate short-term feature information. In addition, a decoder-guided attention module is designed to integrate and filtrate the long-term feature information. Moreover, to synthesize face photo images with more facial details, a Sobel operator-based detail loss is proposed to constrain the network training. The experimental results on the public datasets demonstrate that FRAN generates more realistic face photo images than state-of-the-art approaches in terms of visual perception and quality evaluation. Furthermore, the face photo images generated by FRAN obtain higher face recognition accuracy than those created by the compared methods.
Similar content being viewed by others
References
Yang S, Wang Z, Liu J, Guo Z (2021) Controllable sketch-to-image translation for robust face synthesis. IEEE Trans Image Process 30:8797–8810
Wan W, Yang Y, Lee HJ (2021) Generative adversarial learning for detail-preserving face sketch synthesis. Neurocomputing 438:107–121
Zhang Y, Yu L, Sun B, He J (2022) ENG-face: cross-domain heterogeneous face synthesis with enhanced asymmetric CycleGAN. Appl Intell 52:15295–15307. https://doi.org/10.1007/s10489-022-03302-z
Yu J, Xu X, Gao F, Shi S, Wang M, Tao D, Huang Q (2021) Toward realistic face photo–sketch synthesis via composition-aided GANs. IEEE Trans Cybern 51(9):4350–4362
Tang X, Wang X (2002) Face photo recognition using sketch. In: Proceedings of IEEE International Conference on Image Processing (ICIP), pp 257-260
Song Y, Bao L, Yang Q, Yang MH (2014) Real-time exemplar-based face sketch synthesis. In: European Conference on Computer Vision (ECCV), pp 800–813
Wang N, Gao X, Li J (2018) Random sampling for fast face sketch synthesis. Pattern Recogn 76:215–227
Jiao L, Zhang S, Li L, Liu F, Ma W (2018) A modified convolutional neural network for face sketch synthesis. Pattern Recogn 76:439–446
Sheng B, Li P, Gao C, Ma K (2018) Deep neural representation guided face sketch synthesis. IEEE Trans Vis Comput Graph 25(12):3216–3230
Yan L, Zheng W, Gou C, Wang F (2021) IsGAN: identity-sensitive generative adversarial network for face photo-sketch synthesis. Pattern Recogn 119:108077
Zhang C, Liu D, Peng C, Wang N, Gao X (2022) Edge aware domain transformation for face sketch synthesis. IEEE Trans Inf Forensic Secur 17:2761–2770
Wang X, Tang X (2009) Face photo-sketch synthesis and recognition. IEEE Trans Pattern Anal Mach Intell 31(11):1955–1967
Gao X, Wang N, Tao D, Li X (2012) Face sketch-photo synthesis and retrieval using sparse representation. IEEE Trans Circ Syst Video Technol 22(8):1213–1226
Li J, Yu X, Peng C, Wang N (2017) Adaptive representation-based face sketch-photo synthesis. Neurocomputing 269:152–159
Peng C, Gao X, Wang N, Li J (2017) Superpixel-based face sketch–photo synthesis. IEEE Trans Circ Syst Video Technol 27(2):288–299
Zhang M, Wang R, Gao X, Li J, Tao D (2019) Dual-transfer face sketch–photo synthesis. IEEE Trans Image Process 28(2):642–657
Lin Y, Fu K, Ling S, Wang J, Cheng P (2022) Toward identity preserving face synthesis between sketches and photos using deep feature injection. IEEE Trans Ind Inf 18(1):327–336
Goodfellow I, Pouget-Abadie J, Mirza M, Xu B, WardeFarley D, Ozair S, Courville A, Bengio Y (2020) Generative adversarial networks. Communications of the ACM 63(11):139–144
Yadav NK, Singh SK, Dubey SR (2022) CSA-GAN: cyclic synthesized attention guided generative adversarial network for face synthesis. Appl Intell 52:12704–12723. https://doi.org/10.1007/s10489-021-03064-0
Kazemi H, Iranmanesh M, Dabouei A, Soleymani S, Nasrabadi N M (2018) Facial attributes guided deep sketch-to-photo synthesis. In Proceedings of IEEE Winter Conference on Application of Computer Vision Workshops, pp 1–8
Wang L, Sindagi V A, Patel V M (2018) High-quality facial photo-sketch synthesis using multi-adversarial networks. In Proceedings of 13th IEEE International Conference on Automatic Face & Gesture Recognition, pp 83–90
Chao W, Chang L, Wang X, Cheng J, Deng X, Duan F (2019) High-fidelity face sketch-to-photo synthesis using generative adversarial network. In: Proceedings of IEEE International Conference on Image Processing (ICIP), pp 4699-4703
Lei Y, Du W, Hu Q (2020) Face sketch-to-photo transformation with multi-scale self-attention GAN. Neurocomputing 396:12–23
Duan S, Chen Z, Wu QMJ, Cai L, Lu D (2021) Multi-scale gradients self-attention residual learning for face photo-sketch transformation. IEEE Trans Inf Forensic Secur 16:1218–1230
Li L, Tang J, Shao Z, Tan X, Ma L (2022) Sketch-to-photo face generation based on semantic consistency preserving and similar connected component refinement. Vis Comput 38(11):3577–3594
Jing Y, Yang Y, Feng Z, Ye J, Yu Y, Song M (2019) Neural style transfer: a review. IEEE Trans Vis Comput Graph 26(11):3365–3385
Jam J, Kendrick C, Walker K, Drouard V, Hsu JG, Yap MH (2021) A comprehensive review of past and present image inpainting methods. Comput Vis Image Underst 203:103147
Wan W, Lee H J (2019) Generative adversarial multi-task learning for face sketch synthesis and recognition. In: Proceedings of IEEE International Conference on Image Processing (ICIP), pp 4065-4069
Isola P, Zhu J Y, Zhou T, Efros AA (2017) Image-to-image translation with conditional adversarial networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp 1125–1134
Wang T, Liu M, Zhu J, Tao A, Kautz J, Catanzaro B (2018) High-resolution image synthesis and semantic manipulation with conditional GANs. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 8798–8807
Zhu J Y, Park T, Isola P, Efros A A (2017) Unpaired image-to image translation using cycle-consistent adversarial networks. In: International Conference on Computer Vision, pp 2223–2232
Yi Z, Zhang H, Tan P, Gong M (2017) DualGAN: unsupervised dual learning for image-to-image translation. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2849–2857
Kim T, Cha M, Kim H, Lee J K, Kim J (2017) Learning to discover cross-domain relations with generative adversarial networks. In: Proceedings of International Conference on Machine Learning, pp 1857–1865
Babu KK, Dubey SR (2021) CSGAN: cyclic-synthesized generative adversarial networks for image-to-image transformation. Expert Syst Appl 169:114431
Ji Y, Zhang H, Wu QMJ (2018) Salient object detection via multi-scale attention CNN. Neurocomputing 322:130–140
Chang W Y, Tsai M Y, Lo S C (2021) ResSaNet: a hybrid backbone of residual block and self-attention module for masked face recognition. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp 1468-1476
Wang F, Jiang M, Qian C, Yang S, Li C, Zhang H, Wang X, Tang X (2017) Residual attention network for image classification. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 3156-3164
Wang X, Girshick R, Gupta A, He K (2018) Non-local neural networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 7794-7803
Woo S, Park J, Lee J Y, Kweon I S. (2018) CBAM: convolutional block attention module. In: Proceedings of the European Conference on Computer Vision (ECCV), pp 3-19
Hu J, Shen L, Sun G (2018) Squeeze-and-excitation networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 7132-7141
Fu J, Liu J, Tian H, Li Y, Bao Y, Fang Z, Lu H (2019) Dual attention network for scene segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 3146–3154
Lu E, Hu X (2022) Image super-resolution via channel attention and spatial attention. Appl Intell 52(2):2260–2268
Vairalkar MK, Nimbhorkar SU (2012) Edge detection of images using Sobel operator. Int J Emerg Technol Adv Eng 2(1):291–293
Johnson J, Alahi A, Fei-Fei L (2016) Perceptual losses for real-time style transfer and super-resolution. In: Proceedings of European Conference on Computer Vision, 694–711
Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556
Wang L, Sindagi V, Patel V (2018) High-quality facial photo-sketch synthesis using multi-adversarial networks. In: Proceedings of IEEE International Conference on Automatic Face & Gesture Recognition, pp 83-90
Kim J, Kim M, Kang H, Lee K (2020) U-GAT-IT: unsupervised generative attentional networks with adaptive layer-instance normalization for image-to-image translation. In: Proceedings of International Conference on Learning Representations, pp 1-11
Tang H, Liu H, Xu D, Torr PHS, Sebe N (2021) AttentionGAN: unpaired image-to-image translation using attention-guided generative adversarial networks. IEEE Trans Neural Netw Learn Syst:1–16. https://doi.org/10.1109/TNNLS.2021.3105725
Martinez AM, Benavente R (1998) The AR face database. CVC Technical Report #24
Messer K, Matas J, Kittler J, Luettin J, Maitre G (1999) XM2VTSDB: The extended of M2VTS database. In: Proceedings of International Conference on Audio- and Video-Based Person Authentication, pp 72–77
Kingma DP, Ba J (2014) Adam: a method for stochastic optimization. arXiv preprint arXiv:1412.6980
Wang Z, Bovik AC, Sheikh HR, Simoncelli EP (2004) Image quality assessment: from error visibility to structural similarity. IEEE Trans Image Process 13(4):600–612
Zhang L, Zhang L, Mou X, Zhang D (2011) FSIM: a feature similarity index for image quality assessment. IEEE Trans Image Process 20(8):2378–2386
Wan W, Lee HJ (2019) A joint training model for face sketch synthesis. Appl Sci 9:1731
Cao Q, Shen L, Xie W, Parkhi O M, Zisserman A (2018) VGGFace2: a dataset for recognising faces across pose and age. In: Proceedings of IEEE International Conference on Automatic Face & Gesture Recognition, pp 67-74
Acknowledgments
This study has been supported in part by the National Natural Science Foundation of China (62261025, 62262023, 62072218, 61862030), by the Natural Science Foundation of Jiangxi Province (20192ACB20002, 20192ACBL21008), by the Project of the Education Department of Jiangxi Province (GJJ200541), and by the Postdoctoral Research Projects of Jiangxi Province (2020KY44).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Wan, W., Yang, Y., Huang, S. et al. FRAN: feature-filtered residual attention network for realistic face sketch-to-photo transformation. Appl Intell 53, 15946–15956 (2023). https://doi.org/10.1007/s10489-022-04352-z
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-022-04352-z