Feature Space Based Loss for Face Photo-Sketch Synthesis

Li, Keyu; Wang, Nannan; Gao, Xinbo

doi:10.1007/978-3-030-60633-6_31

Keyu Li¹⁶,
Nannan Wang¹⁶ &
Xinbo Gao^17,18

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 12305))

Included in the following conference series:

Chinese Conference on Pattern Recognition and Computer Vision (PRCV)

2570 Accesses

Abstract

Learning-based face photo-sketch synthesis has made great progress in the past few years because of the development of the generate adversarial networks (GANs) [1]. However, these existing GAN-based methods mostly yield poor texture and details on the synthesized sketch/photo which leads low perceptual similarity between the synthesized sketch/photo and real sketch/photo. In order to tackle this problem, we first introduce the perceptual loss into our objective loss function which can measure the difference of content and style between synthesized sketch/photo and real sketch/photo in feature level. Second, we propose a feature map based loss termed content feature loss, which are utilized to supervise the generator in our network to make the synthesized sketch/photo have more perceptual quality. To achieve this, we use the pre-trained VGG network to extract the feature maps of the sketch/photo as a feature extractor, and calculate the Euclidean difference between these feature maps and the feature maps from the hidden layers of the generator. Extensive experiments both synthesis quality and recognition ability assessment of the public face photo-sketch database are conducted to show that our method can obtain better results in comparison with existing state-of-the-art methods.

Supported in part by the National Key Research and Development Program of China under Grant 2018AAA0103202, in part by the National Natural Science Foundation of China under Grant 61922066, Grant 61876142, Grant 61671339, Grant 61772402, Grant U1605252, Grant 61976166, and Grant 62036007, in part by the National High-Level Talents Special Support Program of China under Grant CS31117200001, in part by the Fundamental Research Funds for the Central Universities under Grant JB190117, in part by the Xidian University Intellifusion Joint Innovation Laboratory of Artificial Intelligence, and in part by the Innovation Fund of Xidian University.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Goodfellow, I., et al.: Generative adversarial nets. In: Advances in Neural Information Processing Systems, pp. 2672–2680 (2014)
Google Scholar
Tang, X., Wang, X.: Face photo recognition using sketch. In Proceedings. International Conference on Image Processing, vol. 1, pp. I. IEEE (2002)
Google Scholar
Liu, Q., Tang, X., Jin, H., Lu, H., Ma, S.: A nonlinear approach for face sketch synthesis and recognition. In: 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), vol. 1, pp. 1005–1010. IEEE (2005)
Google Scholar
Gao, X., Wang, N., Tao, D., Li, X.: Face sketch-photo synthesis and retrieval using sparse representation. IEEE Trans. Circuits Syst. Video Technol. 22(8), 1213–1226 (2012)
Article Google Scholar
Wang, N., Zhu, M., Li, J., Song, B., Li, Z., Fast face sketch synthesis: Data-driven vs. model-driven. Neurocomputing 257, 214–221 (2017)
Article Google Scholar
Wang, N., Gao, X., Li, J.: Random sampling for fast face sketch synthesis. Pattern Recogn. 76, 215–227 (2018)
Article Google Scholar
Zhang, L., Lin, L., Wu, X., Ding, S., Zhang, L.: End-to-end photo-sketch generation via fully convolutional representation learning. In: Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, pp. 627–634 (2015)
Google Scholar
Wang, N., Zha, W., Li, J., Gao, X.: Back projection: an effective postprocessing method for GAN-based face sketch synthesis. Pattern Recogn. Lett. 107, 59–65 (2018)
Article Google Scholar
Isola, P., Zhu, J.-Y., Zhou, T., Efros, A.A.: Image-to-image translation with conditional adversarial networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1125–1134 (2017)
Google Scholar
Zhu, J.-Y., Park, T., Isola, P., Efros, A.A.: Unpaired image-to-image translation using cycle-consistent adversarial networks. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2223–2232 (2017)
Google Scholar
Gatys, L., Ecker, A.S., Bethge, M.: Texture synthesis using convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 262–270 (2015)
Google Scholar
Gatys, L.A., Ecker, A.S., Bethge, M.: Image style transfer using convolutional neural networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2414–2423 (2016)
Google Scholar
Gong, C., Shi, H., Liu, T., Zhang, C., Yang, J., Tao, D.: Loss decomposition and centroid estimation for positive and unlabeled learning. IEEE Trans. Pattern Anal. Mach. Intell., 1 (2019)
Google Scholar
Gong, C., Tao, D., Liu, W., Liu, L., Yang, J.: Label propagation via teaching-to-learn and learning-to-teach. IEEE Trans. Neural Netw. Learn. Syst. 28(6), 1452–1465 (2016)
Article Google Scholar
Gong, C., Tao, D., Maybank, S.J., Liu, W., Kang, G., Yang, J.: Multi-modal curriculum learning for semi-supervised image classification. IEEE Trans. Image Proces. 25(7), 3249–3260 (2016)
Google Scholar
Wang, L., Sindagi, V., Patel, V.: High-quality facial photo-sketch synthesis using multi-adversarial networks. In: 2018 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2018), pp. 83–90. IEEE (2018)
Google Scholar
Zhang, S., Ji, R., Jie, H., Xiaoqiang, L., Li, X.: Face sketch synthesis by multidomain adversarial learning. IEEE Trans. Neural Netw. Learn. Syst. 30(5), 1419–1428 (2018)
Article Google Scholar
Yu, J., Xu, X., Gao, F., Shi, S., Wang, M., Tao, D., Huang, Q.: Toward realistic face photo-sketch synthesis via composition-aided GANs. IEEE Trans. Cybern., 1–13 (2020)
Google Scholar
Chen, C., Liu, W., Tan, X., Wong, K.-Y.K.: Semi-supervised learning for face sketch synthesis in the wild. In: Jawahar, C.V., Li, H., Mori, G., Schindler, K. (eds.) ACCV 2018. LNCS, vol. 11361, pp. 216–231. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-20887-5_14
Chapter Google Scholar
Johnson, J., Alahi, A., Fei-Fei, L.: Perceptual losses for real-time style transfer and super-resolution. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9906, pp. 694–711. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46475-6_43
Chapter Google Scholar
Wang, X., Tang, X.: Face photo-sketch synthesis and recognition. IEEE Trans. Pattern Anal. Mach. Intell. 31(11), 1955–1967 (2008)
Article Google Scholar
Fan, D.-P., et al.: Scoot: a perceptual metric for facial sketches. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 5612–5622 (2019)
Google Scholar
Chen, L.-F., Liao, H.-Y.M., Ko, M.-T., Lin, J.-C., Yu, G.-J.: A new LDA-based face recognition system which can solve the small sample size problem. Pattern Recogn. 33(10), 1713–1726 (2000)
Google Scholar

Download references

Author information

Authors and Affiliations

State Key Laboratory of Integrated Services Networks, School of Telecommunications Engineering, Xidian University, Xi’an, 710071, China
Keyu Li & Nannan Wang
State Key Laboratory of Integrated Services Networks, School of Electronic Engineering, Xidian University, Xi’an, 710071, China
Xinbo Gao
Chongqing Key Laboratory of Image Cognition, Chongqing University of Posts and Telecommunications, Chongqing, 400065, China
Xinbo Gao

Authors

Keyu Li
View author publications
You can also search for this author in PubMed Google Scholar
Nannan Wang
View author publications
You can also search for this author in PubMed Google Scholar
Xinbo Gao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Nannan Wang .

Editor information

Editors and Affiliations

Peking University, Beijing, China
Yuxin Peng
Nanjing University of Information Science and Technology, Nanjing, China
Qingshan Liu
Dalian University of Technology, Dalian, China
Huchuan Lu
Chinese Academy of Sciences, Beijing, China
Zhenan Sun
Chinese Academy of Sciences, Beijing, China
Chenglin Liu
Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China
Xilin Chen
Peking University, Beijing, China
Hongbin Zha
Nanjing University of Science and Technology, Nanjing, China
Jian Yang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Li, K., Wang, N., Gao, X. (2020). Feature Space Based Loss for Face Photo-Sketch Synthesis. In: Peng, Y., et al. Pattern Recognition and Computer Vision. PRCV 2020. Lecture Notes in Computer Science(), vol 12305. Springer, Cham. https://doi.org/10.1007/978-3-030-60633-6_31

Download citation

DOI: https://doi.org/10.1007/978-3-030-60633-6_31
Published: 11 October 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-60632-9
Online ISBN: 978-3-030-60633-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics