Explaining StyleGAN Synthesized Swimmer Images in Low-Dimensional Space

Mansouri Yarahmadi, Ashkan; Breuß, Michael; Khan Mohammadi, Mohsen

doi:10.1007/978-3-031-44237-7_16

Ashkan Mansouri Yarahmadi¹⁵,
Michael Breuß¹⁵ &
Mohsen Khan Mohammadi¹⁵

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14184))

Included in the following conference series:

International Conference on Computer Analysis of Images and Patterns

403 Accesses

Abstract

In many existing AI methods, the reasons behind the decisions made by a trained model are not easy to explain. This often leads to a black-box design that is not interpretable, which makes it a delicate issue to adopt such methods in an application related to safety. We consider generative adversarial networks that are often used to generate data for further use in deep learning applications where not much data is available. In particular, we deal with the StyleGAN approach for generating synthetic observations of swimmers. This paper provides a pipeline that can clearly explain the synthesized images after projecting them to a lower dimensional space. These understood images can later be chosen to train a swimmer safety observation framework. The main goal of our paper is to achieve a higher level of abstraction by which one can explain the variation of synthesized swimmer images in low dimension space. A standard similarity measure is used to evaluate our pipeline and validate a low intra-class variation of established swimmer clusters representing similar swimming style within a low dimensional space.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 49.99; Price excludes VAT (USA)

Softcover Book: USD 64.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Alaluf, Y., Patashnik, O., Cohen-Or, D.: ReStyle: a residual-based StyleGAN encoder via iterative refinement (2021)
Google Scholar
Ballard, A., et al.: Energy landscapes for machine learning. Phys. Chem. 19, 12585–12603 (2017)
Google Scholar
Collins, E., Bala, R., Price, B., Süsstrunk, S.: Editing in style: uncovering the local semantics of GANs. arXiv (2020). https://arxiv.org/abs/2004.14367
Goodfellow, I., et al.: Generative adversarial networks. arXiv (2014). https://arxiv.org/abs/1406.2661
Hartigan, J., Wong, M.: A k-means clustering algorithm. JSTOR: Appl. Stat. 28, 100–108 (1979)
MATH Google Scholar
Horoi, S., Huang, J., Wolf, G., Krishnaswamy, S.: Visualizing high-dimensional trajectories on the loss-landscape of ANNs (2021)
Google Scholar
Hinton, G.E., Salakhutdinov, R.R.: Reducing the dimensionality of data with neural networks. Science 313, 504–507 (2006)
Article MathSciNet MATH Google Scholar
Hong, S., et al.: 3D-StyleGAN: a style-based generative adversarial network for generative modeling of three-dimensional medical images. arXiv (2021). https://arxiv.org/abs/2107.09700
Karras, T., Laine, S., Aila, T.: A style-based generator architecture for generative adversarial networks. In: 2019 IEEE/CVF Conference On Computer Vision And Pattern Recognition (CVPR), pp. 4396–4405 (2019)
Google Scholar
Kostic, A., et al.: The dynamics of the human infant gut microbiome in development and in progression toward type 1 diabetes. Cell Host Microbe 20, 121 (2016)
Article Google Scholar
Karras, T., Laine, S., Aila, T.: A style-based generator architecture for generative adversarial networks. CoRR. abs/1812.04948 (2018). https://arxiv.org/abs/1812.04948
Karras, T., et al.: Alias-free generative adversarial networks. arXiv (2021). https://arxiv.org/abs/2106.12423
Karras, T., Aila, T., Laine, S., Lehtinen, J.: Progressive growing of GANs for improved quality, stability, and variation. arXiv (2017). https://arxiv.org/abs/1710.10196
Krizhevsky, A.: Learning multiple layers of features from tiny images (2009)
Google Scholar
Github (2022). https://github.com/datitran/raccoon-dataset/blob/master/generate-tfrecord.py
Karras, T., Laine, S., Aittala, M., Hellsten, J., Lehtinen, J., Aila, T.: Analyzing and improving the image quality of StyleGAN. arXiv (2019). https://arxiv.org/abs/1912.04958
Lecun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proc. IEEE 86, 2278–2324 (1998)
Article Google Scholar
Lee, C., Liu, Z., Wu, L., Luo, P.: MaskGAN: towards diverse and interactive facial image manipulation. In: IEEE Conference on Computer Vision And Pattern Recognition (CVPR) (2020)
Google Scholar
Lewis, K., Varadharajan, S., Kemelmacher-Shlizerman, I.: TryOnGAN: body-aware try-on via layered interpolation. arXiv (2021). https://arxiv.org/abs/2101.02285
Linderman, G., Rachh, M., Hoskins, J., Steinerberger, S., Kluger, Y.: Fast interpolation-based t-SNE for improved visualization of single-cell RNA-seq data. Nat. Methods 16, 1 (2019)
Article Google Scholar
Maaten, L., Hinton, G.: Visualizing data using t-SNE. J. Mach. Learn. Res. 9, 2579–2605 (2008)
MATH Google Scholar
Moon, K., et al.: Visualizing structure and transitions in high-dimensional biological data (2019)
Google Scholar
Nguyen, A., Yosinski, J., Clune, J.: Deep neural networks are easily fooled: high confidence predictions for unrecognizable images. In: 2015 IEEE Conference On Computer Vision And Pattern Recognition (CVPR), pp. 427–436 (2015)
Google Scholar
Saeed, W., Omlin, C.: Explainable AI (XAI): a systematic meta-survey of current challenges and future opportunities. Knowl.-Based Syst. 263, 110273 (2023)
Google Scholar
Szegedy, C., et al.: Intriguing properties of neural networks (2014)
Google Scholar
Skorokhodov, I., Tulyakov, S., Elhoseiny, M.: StyleGAN-V: a continuous video generator with the price, image quality and perks of StyleGAN2. arXiv (2021). https://arxiv.org/abs/2112.14683
Samaria, F., Harter, A.: Parameterisation of a stochastic model for human face identification. In: IEEE Workshop On Applications Of Computer Vision - Proceedings, vol. 22, pp. 138–142 (1995)
Google Scholar
Song, W., Wang, L., Liu, P., Choo, K.: Improved t-SNE based manifold dimensional reduction for remote sensing data processing. Multimed. Tools Appl. 78, 4311–4326 (2019)
Article Google Scholar
Wu, J., Wang, J., Xiao, H., Ling, J.: Visualization of high dimensional turbulence simulation data using t-SNE (2017)
Google Scholar
Zhang, R.: Making convolutional networks shift-invariant again. arXiv (2019). https://arxiv.org/abs/1904.11486

Download references

Acknowledgement

The authors acknowledge partial funding of their work by Bundesministerium für Digitales und Verkehr (BMDV) within the project RescueFly as well as by Bundesministerium für Bildung und Forschung (BMBF) within the project KI@MINT.

Author information

Authors and Affiliations

Institute of Mathematics, Brandenburg Technical University, Platz der Deutschen Einheit 1, 03046, Cottbus, Germany
Ashkan Mansouri Yarahmadi, Michael Breuß & Mohsen Khan Mohammadi

Authors

Ashkan Mansouri Yarahmadi
View author publications
You can also search for this author in PubMed Google Scholar
Michael Breuß
View author publications
You can also search for this author in PubMed Google Scholar
Mohsen Khan Mohammadi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mohsen Khan Mohammadi .

Editor information

Editors and Affiliations

Cyprus University of Technology, Limassol, Cyprus
Nicolas Tsapatsoulis
Cyprus University of Technology/CYENS Center of Excellence, Limassol, Cyprus
Andreas Lanitis
The University of New Mexico, Albuquerque, NM, USA
Marios Pattichis
University of Cyprus/CYENS Center of Excellence, Nicosia, Cyprus
Constantinos Pattichis
University of Cyprus/KIOS Center of Excellence, Nicosia, Cyprus
Christos Kyrkou
Cyprus University of Technology, Limassol, Cyprus
Efthyvoulos Kyriacou
Cyprus University of Technology/CYENS Center of Excellence, Limassol, Cyprus
Zenonas Theodosiou
CYENS Center of Excellence, Nicosia, Cyprus
Andreas Panayides

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Mansouri Yarahmadi, A., Breuß, M., Khan Mohammadi, M. (2023). Explaining StyleGAN Synthesized Swimmer Images in Low-Dimensional Space. In: Tsapatsoulis, N., et al. Computer Analysis of Images and Patterns. CAIP 2023. Lecture Notes in Computer Science, vol 14184. Springer, Cham. https://doi.org/10.1007/978-3-031-44237-7_16

Download citation

DOI: https://doi.org/10.1007/978-3-031-44237-7_16
Published: 20 September 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-44236-0
Online ISBN: 978-3-031-44237-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Explaining StyleGAN Synthesized Swimmer Images in Low-Dimensional Space