Abstract
Learning-based face photo-sketch synthesis has made great progress in the past few years because of the development of the generate adversarial networks (GANs) [1]. However, these existing GAN-based methods mostly yield poor texture and details on the synthesized sketch/photo which leads low perceptual similarity between the synthesized sketch/photo and real sketch/photo. In order to tackle this problem, we first introduce the perceptual loss into our objective loss function which can measure the difference of content and style between synthesized sketch/photo and real sketch/photo in feature level. Second, we propose a feature map based loss termed content feature loss, which are utilized to supervise the generator in our network to make the synthesized sketch/photo have more perceptual quality. To achieve this, we use the pre-trained VGG network to extract the feature maps of the sketch/photo as a feature extractor, and calculate the Euclidean difference between these feature maps and the feature maps from the hidden layers of the generator. Extensive experiments both synthesis quality and recognition ability assessment of the public face photo-sketch database are conducted to show that our method can obtain better results in comparison with existing state-of-the-art methods.
Supported in part by the National Key Research and Development Program of China under Grant 2018AAA0103202, in part by the National Natural Science Foundation of China under Grant 61922066, Grant 61876142, Grant 61671339, Grant 61772402, Grant U1605252, Grant 61976166, and Grant 62036007, in part by the National High-Level Talents Special Support Program of China under Grant CS31117200001, in part by the Fundamental Research Funds for the Central Universities under Grant JB190117, in part by the Xidian University Intellifusion Joint Innovation Laboratory of Artificial Intelligence, and in part by the Innovation Fund of Xidian University.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Goodfellow, I., et al.: Generative adversarial nets. In: Advances in Neural Information Processing Systems, pp. 2672–2680 (2014)
Tang, X., Wang, X.: Face photo recognition using sketch. In Proceedings. International Conference on Image Processing, vol. 1, pp. I. IEEE (2002)
Liu, Q., Tang, X., Jin, H., Lu, H., Ma, S.: A nonlinear approach for face sketch synthesis and recognition. In: 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), vol. 1, pp. 1005–1010. IEEE (2005)
Gao, X., Wang, N., Tao, D., Li, X.: Face sketch-photo synthesis and retrieval using sparse representation. IEEE Trans. Circuits Syst. Video Technol. 22(8), 1213–1226 (2012)
Wang, N., Zhu, M., Li, J., Song, B., Li, Z., Fast face sketch synthesis: Data-driven vs. model-driven. Neurocomputing 257, 214–221 (2017)
Wang, N., Gao, X., Li, J.: Random sampling for fast face sketch synthesis. Pattern Recogn. 76, 215–227 (2018)
Zhang, L., Lin, L., Wu, X., Ding, S., Zhang, L.: End-to-end photo-sketch generation via fully convolutional representation learning. In: Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, pp. 627–634 (2015)
Wang, N., Zha, W., Li, J., Gao, X.: Back projection: an effective postprocessing method for GAN-based face sketch synthesis. Pattern Recogn. Lett. 107, 59–65 (2018)
Isola, P., Zhu, J.-Y., Zhou, T., Efros, A.A.: Image-to-image translation with conditional adversarial networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1125–1134 (2017)
Zhu, J.-Y., Park, T., Isola, P., Efros, A.A.: Unpaired image-to-image translation using cycle-consistent adversarial networks. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2223–2232 (2017)
Gatys, L., Ecker, A.S., Bethge, M.: Texture synthesis using convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 262–270 (2015)
Gatys, L.A., Ecker, A.S., Bethge, M.: Image style transfer using convolutional neural networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2414–2423 (2016)
Gong, C., Shi, H., Liu, T., Zhang, C., Yang, J., Tao, D.: Loss decomposition and centroid estimation for positive and unlabeled learning. IEEE Trans. Pattern Anal. Mach. Intell., 1 (2019)
Gong, C., Tao, D., Liu, W., Liu, L., Yang, J.: Label propagation via teaching-to-learn and learning-to-teach. IEEE Trans. Neural Netw. Learn. Syst. 28(6), 1452–1465 (2016)
Gong, C., Tao, D., Maybank, S.J., Liu, W., Kang, G., Yang, J.: Multi-modal curriculum learning for semi-supervised image classification. IEEE Trans. Image Proces. 25(7), 3249–3260 (2016)
Wang, L., Sindagi, V., Patel, V.: High-quality facial photo-sketch synthesis using multi-adversarial networks. In: 2018 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2018), pp. 83–90. IEEE (2018)
Zhang, S., Ji, R., Jie, H., Xiaoqiang, L., Li, X.: Face sketch synthesis by multidomain adversarial learning. IEEE Trans. Neural Netw. Learn. Syst. 30(5), 1419–1428 (2018)
Yu, J., Xu, X., Gao, F., Shi, S., Wang, M., Tao, D., Huang, Q.: Toward realistic face photo-sketch synthesis via composition-aided GANs. IEEE Trans. Cybern., 1–13 (2020)
Chen, C., Liu, W., Tan, X., Wong, K.-Y.K.: Semi-supervised learning for face sketch synthesis in the wild. In: Jawahar, C.V., Li, H., Mori, G., Schindler, K. (eds.) ACCV 2018. LNCS, vol. 11361, pp. 216–231. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-20887-5_14
Johnson, J., Alahi, A., Fei-Fei, L.: Perceptual losses for real-time style transfer and super-resolution. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9906, pp. 694–711. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46475-6_43
Wang, X., Tang, X.: Face photo-sketch synthesis and recognition. IEEE Trans. Pattern Anal. Mach. Intell. 31(11), 1955–1967 (2008)
Fan, D.-P., et al.: Scoot: a perceptual metric for facial sketches. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 5612–5622 (2019)
Chen, L.-F., Liao, H.-Y.M., Ko, M.-T., Lin, J.-C., Yu, G.-J.: A new LDA-based face recognition system which can solve the small sample size problem. Pattern Recogn. 33(10), 1713–1726 (2000)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Li, K., Wang, N., Gao, X. (2020). Feature Space Based Loss for Face Photo-Sketch Synthesis. In: Peng, Y., et al. Pattern Recognition and Computer Vision. PRCV 2020. Lecture Notes in Computer Science(), vol 12305. Springer, Cham. https://doi.org/10.1007/978-3-030-60633-6_31
Download citation
DOI: https://doi.org/10.1007/978-3-030-60633-6_31
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-60632-9
Online ISBN: 978-3-030-60633-6
eBook Packages: Computer ScienceComputer Science (R0)