skip to main content
research-article
Public Access

Human-in-the-loop differential subspace search in high-dimensional latent space

Published:12 August 2020Publication History
Skip Abstract Section

Abstract

Generative models based on deep neural networks often have a high-dimensional latent space, ranging sometimes to a few hundred dimensions or even higher, which typically makes them hard for a user to explore directly. We propose differential subspace search to allow efficient iterative user exploration in such a space, without relying on domain- or data-specific assumptions. We develop a general framework to extract low-dimensional subspaces based on a local differential analysis of the generative model, such that a small change in such a subspace would provide enough change in the resulting data. We do so by applying singular value decomposition to the Jacobian of the generative model and forming a subspace with the desired dimensionality spanned by a given number of singular vectors stochastically selected on the basis of their singular values, to maintain ergodicity. We use our framework to present 1D subspaces to the user via a 1D slider interface. Starting from an initial location, the user finds a new candidate in the presented 1D subspace, which is in turn updated at the new candidate location. This process is repeated until no further improvement can be made. Numerical simulations show that our method can better optimize synthetic black-box objective functions than the alternatives that we tested. Furthermore, we conducted a user study using complex generative models and the results show that our method enables more efficient exploration of high-dimensional latent spaces than the alternatives.

Skip Supplemental Material Section

Supplemental Material

a85-chiu.mp4

mp4

24 MB

3386569.3392409.mp4

Presentation video

mp4

812.7 MB

References

  1. Adobe. 2017. Using the Brainstorming tool in After Effects CS6. Retrieved April 19, 2020 from https://helpx.adobe.com/after-effects/atv/cs6-tutorials/brainstorming.html.Google ScholarGoogle Scholar
  2. David Bau, Jun-Yan Zhu, Hendrik Strobelt, Bolei Zhou, Joshua B. Tenenbaum, William T. Freeman, and Antonio Torralba. 2019. GAN Dissection: Visualizing and Understanding Generative Adversarial Networks. In Proc. of ICLR 2019. https://openreview.net/forum?id=Hyg_X2C5FXGoogle ScholarGoogle Scholar
  3. Yoshua Bengio, Aaron Courville, and Pascal Vincent. 2013. Representation Learning: A Review and New Perspectives. IEEE Transactions on Pattern Analysis and Machine Intelligence 35, 8 (2013), 1798--1828. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Eric Brochu, Nando de Freitas, and Abhijeet Ghosh. 2007. Active Preference Learning with Discrete Choice Data. In Advances in Neural Information Processing Systems 20 (NIPS 2007). 409--416. https://dl.acm.org/doi/10.5555/2981562.2981614Google ScholarGoogle Scholar
  5. Andrew Brock, Theodore Lim, James M Ritchie, and Nick Weston. 2017. Neural Photo Editing with Introspective Adversarial Networks. In Proc. of ICLR 2017. https://openreview.net/forum?id=HkNKFiGexGoogle ScholarGoogle Scholar
  6. Emmanuel J. Candès and Benjamin Recht. 2009. Exact Matrix Completion via Convex Optimization. Foundations of Computational Mathematics 9, 6 (2009), 717--772. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Angel X. Chang, Thomas Funkhouser, Leonidas Guibas, Pat Hanrahan, Qixing Huang, Zimo Li, Silvio Savarese, Manolis Savva, Shuran Song, Hao Su, Jianxiong Xiao, Li Yi, and Fisher Yu. 2015. ShapeNet: An Information-Rich 3D Model Repository. https://arxiv.org/abs/1512.03012Google ScholarGoogle Scholar
  8. Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya Sutskever, and Pieter Abbeel. 2016. InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets. In Advances in Neural Information Processing Systems 29 (NIPS 2016). 2172--2180. https://dl.acm.org/doi/abs/10.5555/3157096.3157340Google ScholarGoogle Scholar
  9. Zhiqin Chen and Hao Zhang. 2019. Learning implicit fields for generative shape modeling. In Proc. of CVPR 2019. 5939--5948. Google ScholarGoogle ScholarCross RefCross Ref
  10. Chris Donahue, Julian McAuley, and Miller Puckette. 2018. Adversarial Audio Synthesis. https://arxiv.org/abs/1802.04208Google ScholarGoogle Scholar
  11. Jesse Engel, Kumar Krishna Agrawal, Shuo Chen, Ishaan Gulrajani, Chris Donahue, and Adam Roberts. 2019. GANSynth: Adversarial Neural Audio Synthesis. In Proc. of ICLR 2019. https://openreview.net/forum?id=H1xQVn09FXGoogle ScholarGoogle Scholar
  12. Jesse Engel, Cinjon Resnick, Adam Roberts, Sander Dieleman, Mohammad Norouzi, Douglas Eck, and Karen Simonyan. 2017. Neural Audio Synthesis of Musical Notes with WaveNet Autoencoders. In Proc. of Machine Learning Research - Volume 70 (ICML 2017). JMLR.org, 1068--1077. https://dl.acm.org/doi/abs/10.5555/3305381.3305492Google ScholarGoogle Scholar
  13. Lore Goetschalckx, Alex Andonian, Aude Oliva, and Phillip Isola. 2019. GANalyze: Toward Visual Definitions of Cognitive Image Properties. In Proc. of ICCV 2019. 5744--5753.Google ScholarGoogle ScholarCross RefCross Ref
  14. Ishaan Gulrajani, Faruk Ahmed, Martin Arjovsky, Vincent Dumoulin, and Aaron Courville. 2017. Improved Training of Wasserstein GANs. In Advances in Neural Information Processing Systems 30 (NIPS 2017). 5769--5779. https://dl.acm.org/doi/abs/10.5555/3295222.3295327Google ScholarGoogle Scholar
  15. Toby Chong Long Hin, I-Chao Shen, Issei Sato, and Takeo Igarashi. 2019. Interactive Subspace Exploration on Generative Image Modelling. https://arxiv.org/abs/1906.09840Google ScholarGoogle Scholar
  16. Ali Jahanian, Lucy Chai, and Phillip Isola. 2020. On the "steerability" of generative adversarial networks. In Proc. of ICLR 2020. https://openreview.net/forum?id=HylsTT4FvBGoogle ScholarGoogle Scholar
  17. Tero Karras, Timo Aila, Samuli Laine, and Jaakko Lehtinen. 2018. Progressive Growing of GANs for Improved Quality, Stability, and Variation. In Proc. of ICLR 2018. https://openreview.net/forum?id=Hk99zCeAbGoogle ScholarGoogle Scholar
  18. Diederik P. Kingma, Danilo J. Rezende, Shakir Mohamed, and Max Welling. 2014. Semi-Supervised Learning with Deep Generative Models. In Advances in Neural Information Processing Systems 27 (NIPS 2014). 3581--3589. https://dl.acm.org/doi/10.5555/2969033.2969226Google ScholarGoogle Scholar
  19. Yuki Koyama, Daisuke Sakamoto, and Takeo Igarashi. 2014. Crowd-Powered Parameter Analysis for Visual Design Exploration. In Proc. of UIST 2014. 65--74. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Yuki Koyama, Issei Sato, Daisuke Sakamoto, and Takeo Igarashi. 2017. Sequential Line Search for Efficient Visual Design Optimization by Crowds. ACM Transactions on Graphics 36, 4 (Proc. of SIGGRAPH 2017) (July 2017), 48:1--48:11. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Yann LeCun, Léon Bottou, Yoshua Bengio, and Patrick Haffner. 1998. Gradient-based learning applied to document recognition. Proceedings of the IEEE 86, 11 (1998), 2278--2324. Google ScholarGoogle ScholarCross RefCross Ref
  22. Norbert Lindow, Daniel Baum, and Hans-Christian Hege. 2012. Perceptually Linear Parameter Variations. Computer Graphics Forum 31, 2--4 (Proc. of EUROGRAPHICS 2012) (May 2012), 535--544. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Seppo Linnainmaa. 1970. The representation of the cumulative rounding error of an algorithm as a Taylor expansion of the local rounding errors. Master's thesis. University of Helsinki, Finland.Google ScholarGoogle Scholar
  24. Ziwei Liu, Ping Luo, Xiaogang Wang, and Xiaoou Tang. 2015. Deep Learning Face Attributes in the Wild. In Proc. of ICCV 2015. 3730--3738. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Joe Marks, Brad Andalman, Paul A. Beardsley, William T. Freeman, Sarah F. Frisken-Gibson, Jessica K. Hodgins, Thomas Kang, Brian V. Mirtich, Hanspeter Pfister, Wheeler Ruml, Kathy Ryall, Joshua E. Seims, and Stuart M. Shieber. 1997. Design Galleries: A General Approach to Setting Parameters for Computer Graphics and Animation. In Proc. of SIGGRAPH '97. 389--400. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Lucas Maystre. 2018. Efficient Learning from Comparisons. Ph.D. Dissertation. École Polytechnique Fédérale de Lausanne.Google ScholarGoogle Scholar
  27. Mehdi Mirza and Simon Osindero. 2014. Conditional Generative Adversarial Nets. https://arxiv.org/abs/1411.1784Google ScholarGoogle Scholar
  28. Takeru Miyato, Toshiki Kataoka, Masanori Koyama, and Yuichi Yoshida. 2018. Spectral Normalization for Generative Adversarial Networks. In Proc. of ICLR 2018. https://openreview.net/forum?id=B1QRgziT-Google ScholarGoogle Scholar
  29. Riccardo Moriconi, Marc P. Deisenroth, and K. S. Sesh Kumar. 2019. High-dimensional Bayesian optimization using low-dimensional feature spaces. https://arxiv.org/abs/1902.10675Google ScholarGoogle Scholar
  30. Yurii Nesterov. 2018. Lectures on Convex Optimization. Springer. Google ScholarGoogle ScholarCross RefCross Ref
  31. Carl Edward Rasmussen and Christopher K. I. Williams. 2006. Gaussian Processes for Machine Learning. The MIT Press.Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. Salah Rifai, Yann N. Dauphin, Pascal Vincent, Yoshua Bengio, and Xavier Muller. 2011. The Manifold Tangent Classifier. In Advances in Neural Information Processing Systems 24 (NIPS 2011). 2294--2302. https://dl.acm.org/doi/10.5555/2986459.2986715Google ScholarGoogle Scholar
  33. Jürgen Schmidhuber. 2015. Deep learning in neural networks: An overview. Neural Networks 61 (2015), 85--117. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. Bobak Shahriari, Kevin Swersky, Ziyu Wang, Ryan P. Adams, and Nando de Freitas. 2015. Taking the human out of the loop: A review of Bayesian optimization. Proceedings of the IEEE 104, 1 (2015), 148--175. Google ScholarGoogle ScholarCross RefCross Ref
  35. Yujun Shen, Jinjin Gu, Xiaoou Tang, and Bolei Zhou. 2020. Interpreting the Latent Space of GANs for Semantic Face Editing. In Proc. of CVPR 2020. To appear.Google ScholarGoogle ScholarCross RefCross Ref
  36. Jasper Snoek, Hugo Larochelle, and Ryan P. Adams. 2012. Practical Bayesian Optimization of Machine Learning Algorithms. In Advances in Neural Information Processing Systems 25 (NIPS 2012). 2951--2959. https://dl.acm.org/doi/abs/10.5555/2999325.2999464Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. Hideyuki Takagi. 2001. Interactive Evolutionary Computation: Fusion of the Capabilities of EC Optimization and Human Evaluation. Proceedings of the IEEE 89, 9 (Sep. 2001), 1275--1296. Google ScholarGoogle ScholarCross RefCross Ref
  38. Jerry O. Talton, Daniel Gibson, Lingfeng Yang, Pat Hanrahan, and Vladlen Koltun. 2009. Exploratory Modeling with Collaborative Design Spaces. ACM Transactions on Graphics 28, 5 (Proc. of SIGGRAPH Asia 2009) (Dec. 2009), 167:1--167:10. Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. Nobuyuki Umetani. 2017. Exploring Generative 3D Shapes Using Autoencoder Networks. In SIGGRAPH Asia 2017 Technical Briefs. 24:1--24:4. Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. Ziyu Wang, Frank Hutter, Masrour Zoghi, David Matheson, and Nando de Freitas. 2016. Bayesian Optimization in a Billion Dimensions via Random Embeddings. Journal of Artificial Intelligence Research 55, 1 (February 2016), 361--387. Google ScholarGoogle ScholarCross RefCross Ref
  41. Ceyuan Yang, Yujun Shen, and Bolei Zhou. 2019. Semantic Hierarchy Emerges in Deep Generative Representations for Scene Synthesis. https://arxiv.org/abs/1911.09267Google ScholarGoogle Scholar
  42. Jun-Yan Zhu, Philipp Krähenbühl, Eli Shechtman, and Alexei A. Efros. 2016. Generative Visual Manipulation on the Natural Image Manifold. In Computer Vision - ECCV 2016. 597--613. Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Human-in-the-loop differential subspace search in high-dimensional latent space

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in

      Full Access

      • Published in

        cover image ACM Transactions on Graphics
        ACM Transactions on Graphics  Volume 39, Issue 4
        August 2020
        1732 pages
        ISSN:0730-0301
        EISSN:1557-7368
        DOI:10.1145/3386569
        Issue’s Table of Contents

        Copyright © 2020 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 12 August 2020
        Published in tog Volume 39, Issue 4

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader