SCDAE: Ethnicity and Gender Alteration on CLF and UTKFace Dataset

Chandaliya, Praveen Kumar; Kumar, Vardhman; Harjani, Mayank; Nain, Neeta

doi:10.1007/978-981-15-4018-9_27

Praveen Kumar Chandaliya⁹,
Vardhman Kumar⁹,
Mayank Harjani⁹ &
…
Neeta Nain⁹

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1148))

Included in the following conference series:

International Conference on Computer Vision and Image Processing

852 Accesses
6 Citations

Abstract

Global face attributes like Gender, Ethnicity, and Age are attracting attention due to their specific explanation of human faces. Mostly prior face attribute alteration works are on large-scale CelebA and LFW dataset. We address more challenging problem called global face attribute alteration on data sets like CLF and UTKFace. Our approach is based on sampling with global condition attribute. It consists of five components Encoder (\(E_{Z}\)), Encoder (\(E{_Y}\)), Sampling (S), Latent Space (ZL), and Decoder (D). The \(E{_Z}\) with S component is responsible to generate structured latent vector Z and \(E_{Y}\) produces condition vector L which we modify according to desired condition, latent vector Z and modified condition vector L are concatenated to make Latent Space ZL to help global face attribute alteration and Decoder D is used to generate modified images. We trained our SCDAE (Sampling and Condition based Deep AutoEncoder) model for gender and ethnicity alteration on CLF and UTKFace dataset. Both qualitative and quantitative experiments show that our approach can alter untouched global attributes and generates more realistic faces in term of person identity and age uniformity which is comparable to human observation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Hinton, G., Salakhutdinov, R.: Reducing the dimensionality of data with neural networks. Science 313(5786), 504–507 (2006)
Article MathSciNet Google Scholar
Kingma, D.P., Welling, M.: Auto-encoding variational bayes. In: ICLR (2014)
Google Scholar
Goodfellow, I., et al.: Generative adversarial nets. In: Advances in Neural Information Processing Systems, vol. 27, pp. 2672–2680. Curran Associates Inc. (2014)
Google Scholar
Radford, A., Metz, L., Chintala, S.: Unsupervised representation learning with deep convolutional generative adversarial networks. In: ICLR (2016)
Google Scholar
Zhu, J., Krähenbühl, P., Shechtman, E., Efros, A.A.: Generative visual manipulation on the natural image manifold. CoRR, vol. abs/1609.03552 (2016)
Google Scholar
Xu, Z., Yang, X., Li, X., Sun, X.: The effectiveness of instance normalization: a strong baseline for single image dehazing. CoRR, vol. abs/1805.03305 (2018)
Google Scholar
Gatys, L.A., Ecker, A.S., Bethge, M.: A neural algorithm of artistic style. CoRR, vol. abs/1508.06576 (2015)
Google Scholar
Shi, Y., Debayan, D., Jain, A.K.: WarpGAN: automatic caricature generation (2018)
Google Scholar
Reed, S.E., Akata, Z., Yan, X., Logeswaran, L., Schiele, B., Lee, H.: Generative adversarial text to image synthesis. CoRR, vol. abs/1605.05396 (2016)
Google Scholar
Mirza, M., Osindero, S.: Conditional generative adversarial nets. CoRR, vol. abs/1411.1784 (2014)
Google Scholar
Liu, Z., Luo, P., Wang, X., Tang, X.: Deep learning face attributes in the wild. In: Proceedings of International Conference on Computer Vision (ICCV) (2015)
Google Scholar
Huang, G.B., Mattar, M., Lee, H., Learned-Miller, E.: Learning to align from scratch. In: NIPS (2012)
Google Scholar
Kumar, N., Berg, A.C., Belhumeur, P.N., Nayar, S.K.: Attribute and simile classifiers for face verification. In: IEEE International Conference on Computer Vision ICCV (2009)
Google Scholar
Kumar, N., Belhumeur, P., Nayar, S.: FaceTracer: a search engine for large collections of images with faces. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008. LNCS, vol. 5305, pp. 340–353. Springer, Berlin (2008). https://doi.org/10.1007/978-3-540-88693-8_25
Chapter Google Scholar
Chandaliya, P.K., Garg, P., Nain, N.: Retrieval of facial images re-rendered with natural aging effect using child facial image and age. In: The 14th International Conference on Signal Image Technology and Internet Based System, Spain, 26–29 November 2018, pp. 457–464 (2018)
Google Scholar
Song, Y., Zhang, Z., Qi, H.: Age progression/regression by conditional adversarial autoencoder. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE (2017)
Google Scholar
Deb, D., Nain, N., Jain, A.K.: Longitudinal study of child face recognition. In: 2018 International Conference on Biometrics, ICB 2018, Gold Coast, Australia, 20–23 February 2018, pp. 225–232 (2018)
Google Scholar
Zhang, Z., Song, Y., Qi, H.: Age progression/regression by conditional adversarial autoencoder. In: CVPR, pp. 4352–4360. IEEE Computer Society (2017)
Google Scholar
Zheng, X., Guo, Y., Huang, H., Li, Y., He, R.: A survey to deep facial attribute analysis. CoRR, vol. abs/1812.10265 (2018)
Google Scholar
Chen, X., Duan, Y., Houthooft, R., Schulman, J., Sutskever, I., Abbeel, P.: InfoGAN: interpretable representation learning by information maximizing generative adversarial nets. CoRR, vol. abs/1606.03657 (2016)
Google Scholar
Li, M., Zuo, W., Zhang, D.: Deep identity-aware transfer of facial attributes. CoRR, vol. abs/1610.05586 (2016)
Google Scholar
Liu, M., Breuel, T., Kautz, J.: Unsupervised image-to-image translation networks. CoRR, vol. abs/1703.00848 (2017)
Google Scholar
Shen, W., Liu, R.: Learning residual images for face attribute manipulation. CoRR, vol. abs/1612.05363 (2016)
Google Scholar
Wang, Y., Wang, S., Qi, G., Tang, J., Li, B.: Weakly supervised facial attribute manipulation via deep adversarial network. In: WACV, pp. 112–121. IEEE Computer Society (2018)
Google Scholar
Zhang, G., Kan, M., Shan, S., Chen, X.: Generative adversarial network with spatial attention for face attribute editing. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11210, pp. 422–437. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01231-1_26
Chapter Google Scholar
Larsen, A.B.L., Sønderby, S.K., Winther, O.: Autoencoding beyond pixels using a learned similarity metric. CoRR, vol. abs/1512.09300 (2015)
Google Scholar
Yan, X., Yang, J., Sohn, K., Lee, H.: Attribute2Image: conditional image generation from visual attributes. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9908, pp. 776–791. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46493-0_47
Chapter Google Scholar
Perarnau, G., van de Weijer, J., Raducanu, B., Álvarez, J.M.: Invertible conditional GANs for image editing. CoRR, vol. abs/1611.06355 (2016)
Google Scholar
Lample, G., Zeghidour, N., Usunier, N., Bordes, A., Denoyer, L., Ranzato, M.: Fader networks: manipulating images by sliding attributes, pp. 5969–5978 (2017)
Google Scholar
Chandaliya, P.K., Nain, N.: Conditional perceptual adversarial variational autoencoder for age progression and regression on children face. In: The 12th IAPR International Conference On Biometrics, Crete Greece, 4–7 June 2019, pp. 200–208 (2019)
Google Scholar
Lu, Y., Tai, Y.-W., Tang, C.-K.: Attribute-guided face generation using conditional cycleGAN. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11216, pp. 293–308. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01258-8_18
Chapter Google Scholar
Choi, Y., Choi, M., Kim, M., Ha, J., Kim, S., Choo, J.: StarGAN: unified generative adversarial networks for multi-domain image-to-image translation. In: 2018 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2018, Salt Lake City, UT, USA, 18–22 June 2018, pp. 8789–8797 (2018)
Google Scholar
Zhang, K., Zhang, Z., Li, Z., Qiao, Y.: Joint face detection and alignment using multitask cascaded convolutional networks. IEEE Signal Process. Lett. 23(10), 1499–1503 (2016)
Article Google Scholar
Faceplus. https://www.faceplusplus.com

Download references

Acknowledgments

We gratefully acknowledge the support of NVIDIA Corporation with the donation of the TITAN V GPU used for this research.

Author information

Authors and Affiliations

Malaviya National Institute of Technology Jaipur, Jaipur, India
Praveen Kumar Chandaliya, Vardhman Kumar, Mayank Harjani & Neeta Nain

Authors

Praveen Kumar Chandaliya
View author publications
You can also search for this author in PubMed Google Scholar
Vardhman Kumar
View author publications
You can also search for this author in PubMed Google Scholar
Mayank Harjani
View author publications
You can also search for this author in PubMed Google Scholar
Neeta Nain
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Praveen Kumar Chandaliya .

Editor information

Editors and Affiliations

Malaviya National Institute of Technology, Jaipur, Rajasthan, India
Neeta Nain
Malaviya National Institute of Technology, Jaipur, Rajasthan, India
Santosh Kumar Vipparthi
Indian Institute of Technology Roorkee, Roorkee, Uttarakhand, India
Balasubramanian Raman

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chandaliya, P.K., Kumar, V., Harjani, M., Nain, N. (2020). SCDAE: Ethnicity and Gender Alteration on CLF and UTKFace Dataset. In: Nain, N., Vipparthi, S., Raman, B. (eds) Computer Vision and Image Processing. CVIP 2019. Communications in Computer and Information Science, vol 1148. Springer, Singapore. https://doi.org/10.1007/978-981-15-4018-9_27

Download citation

DOI: https://doi.org/10.1007/978-981-15-4018-9_27
Published: 29 March 2020
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-4017-2
Online ISBN: 978-981-15-4018-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics