Learning Controllable Face Generator from Disjoint Datasets

Li, Jing; Wong, Yongkang; Sim, Terence

doi:10.1007/978-3-030-29888-3_17

Learning Controllable Face Generator from Disjoint Datasets

Conference paper
First Online: 22 August 2019

1586 Accesses
1 Citations

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 11678))

Abstract

Recently, GANs have become popular for synthesizing photorealistic facial images with desired facial attributes. However, crucial to the success of such networks is the availability of large-scale datasets that are fully-attributed, i.e., datasets in which the Cartesian product of all attribute values is present, as otherwise the learning becomes skewed. Such fully-attributed datasets are impractically expensive to collect. Many existing datasets are only partially-attributed, and do not have any subjects in common. It thus becomes important to be able to jointly learn from such datasets. In this paper, we propose a GAN-based facial image generator that can be trained on partially-attributed disjoint datasets. The key idea is to use a smaller, fully-attributed dataset to bridge the learning. Our generator (i) provides independent control of multiple attributes, and (ii) renders photorealistic facial images with target attributes.

This research is supported by the National Research Foundation, Prime Minister’s Office, Singapore under its Strategic Capability Research Centres Funding Initiative.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 64.99; Price excludes VAT (USA)

Softcover Book: USD 84.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

1.
For convenience, we normalized each feature vector in embedding subspaces in network.

References

Almaddah, A., Vural, S., Mae, Y., Ohara, K., Arai, T.: Face relighting using discriminative 2D spherical spaces for face recognition. Mach. Vis. Appl. 25(4), 845–857 (2014)
Article Google Scholar
Choi, Y., Choi, M., Kim, M., Ha, J., Kim, S., Choo, J.: StarGAN: unified generative adversarial networks for multi-domain image-to-image translation. In: CVPR 2018, pp. 8789–8797 (2018)
Google Scholar
Gao, W., et al.: The CAS-PEAL large-scale chinese face database and baseline evaluations. IEEE Trans. Syst. Man Cybern. Part A 38(1), 149–161 (2008)
Article Google Scholar
Goodfellow, I.J., et al.: Generative adversarial nets. In: NIPS 2014, pp. 2672–2680 (2014)
Google Scholar
Gross, R., Matthews, I.A., Cohn, J.F., Kanade, T., Baker, S.: Multi-PIE. Image Vis. Comput. 28(5), 807–813 (2010)
Article Google Scholar
Heo, J., Savvides, M.: 3-D generic elastic models for fast and texture preserving 2-D novel pose synthesis. IEEE Trans. Inf. Forensics Secur. 7(2), 563–576 (2012)
Article Google Scholar
Heusel, M., Ramsauer, H., Unterthiner, T., Nessler, B., Hochreiter, S.: GANs trained by a two time-scale update rule converge to a local nash equilibrium. In: NIPS 2017, pp. 6629–6640 (2017)
Google Scholar
King, D.E.: Dlib-ml: a machine learning toolkit. J. Mach. Learn. Res. 10, 1755–1758 (2009)
Google Scholar
Kulkarni, T.D., Whitney, W.F., Kohli, P., Tenenbaum, J.B.: Deep convolutional inverse graphics network. In: NIPS 2015, pp. 2539–2547 (2015)
Google Scholar
Langner, O., Dotsch, R., Bijlstra, G., Wigboldus, D.H., Hawk, S.T., Van Knippenberg, A.: Presentation and validation of the radboud faces database. Cogn. Emot. 24(8), 1377–1388 (2010)
Article Google Scholar
Li, M., Zuo, W., Zhang, D.: Convolutional network for attribute-driven and identity-preserving human face generation. CoRR abs/1608.06434 (2016)
Google Scholar
Liu, Z., Luo, P., Wang, X., Tang, X.: Deep learning face attributes in the wild. In: ICCV 2015, pp. 3730–3738 (2015)
Google Scholar
van der Maaten, L., Hinton, G.: Visualizing data using t-SNE. J. Mach. Learn. Res. 9(Nov), 2579–2605 (2008)
MATH Google Scholar
Schroff, F., Kalenichenko, D., Philbin, J.: FaceNet: a unified embedding for face recognition and clustering. In: CVPR 2015, pp. 815–823 (2015)
Google Scholar
Shen, Y., Luo, P., Yan, J., Wang, X., Tang, X.: FaceID-GAN: learning a symmetry three-player GAN for identity-preserving face synthesis. In: CVPR 2018, pp. 821–830 (2018)
Google Scholar
Sim, T., Baker, S., Bsat, M.: The CMU pose, illumination, and expression (PIE) database. In: AFGR 2002, pp. 53–58 (2002)
Google Scholar
Thies, J., Zollhöfer, M., Nießner, M., Valgaerts, L., Stamminger, M., Theobalt, C.: Real-time expression transfer for facial reenactment. ACM Trans. Graph. 34(6), 183:1–183:14 (2015)
Article Google Scholar
Wang, C., Wang, C., Xu, C., Tao, D.: Tag disentangled generative adversarial networks for object image re-rendering. In: IJCAI (2017)
Google Scholar
Wu, X., He, R., Sun, Z., Tan, T.: A light CNN for deep face representation with noisy labels. IEEE Trans. Inf. Forensics Secur. 13(11), 2884–2896 (2018)
Article Google Scholar
Yang, J., Reed, S.E., Yang, M., Lee, H.: Weakly-supervised disentangling with recurrent transformations for 3D view synthesis. In: NIPS 2015, pp. 1099–1107 (2015)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computing, National University of Singapore, Singapore, Singapore
Jing Li, Yongkang Wong & Terence Sim

Authors

Jing Li
View author publications
You can also search for this author in PubMed Google Scholar
Yongkang Wong
View author publications
You can also search for this author in PubMed Google Scholar
Terence Sim
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jing Li .

Editor information

Editors and Affiliations

Department of Computer and Electrical Engineering and Applied Mathematics, University of Salerno, Fisciano (SA), Italy
Mario Vento
Department of Computer and Electrical Engineering and Applied Mathematics, University of Salerno, Fisciano (SA), Italy
Gennaro Percannella

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Li, J., Wong, Y., Sim, T. (2019). Learning Controllable Face Generator from Disjoint Datasets. In: Vento, M., Percannella, G. (eds) Computer Analysis of Images and Patterns. CAIP 2019. Lecture Notes in Computer Science(), vol 11678. Springer, Cham. https://doi.org/10.1007/978-3-030-29888-3_17

Download citation

DOI: https://doi.org/10.1007/978-3-030-29888-3_17
Published: 22 August 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-29887-6
Online ISBN: 978-3-030-29888-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics