skip to main content
10.1145/3635638.3635663acmotherconferencesArticle/Chapter ViewAbstractPublication PagesmlmiConference Proceedingsconference-collections
research-article

Swarm GAN: Stabilizing Training of Generative Adversarial Networks via Swarm Intelligence

Published: 16 January 2024 Publication History

Abstract

Generative adversarial networks (GANs) have seen significant research interest over the past decade, yet core issues of training instability and mode collapse persist. This work proposes SwarmGAN, a novel GAN framework incorporating swarm intelligence to address these limitations. Specifically, swarm intelligence exhibits properties well-suited to enhance GAN training: emergent complex behaviors arising from simple individual agents, decentralized adaptability to instantaneous data and hyperparameters, and robustness through simple iterative interactions. SwarmGAN incorporates a particle swarm optimization algorithm to guide generator and discriminator updates. Convolutional neural network architectures and gradient penalties further ensure baseline generation quality and diversity. Extensive experiments over diverse image datasets demonstrate the effectiveness of SwarmGAN. Quantitative evaluations using Fréchet Inception Distance, Inception Score, Peak Signal-to-Noise Ratio, and Structural Similarity Index Score validate performance improvements across stability, sample quality, and convergence speed. The proposed integration of swarm intelligence into adversarial networks shows promising capability to address long-standing GAN challenges.

References

[1]
Martin Arjovsky, Soumith Chintala, and Léon Bottou. 2017. Wasserstein generative adversarial networks. In International conference on machine learning. PMLR, 214–223.
[2]
Andrew Brock, Jeff Donahue, and Karen Simonyan. 2018. Large scale GAN training for high fidelity natural image synthesis. arXiv preprint arXiv:1809.11096 (2018).
[3]
Xi Chen, Yan Duan, Rein Houthooft, John Schulman, Ilya Sutskever, and Pieter Abbeel. 2016. InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets. In Advances in Neural Information Processing Systems, D. Lee, M. Sugiyama, U. Luxburg, I. Guyon, and R. Garnett (Eds.). Vol. 29. Curran Associates, Inc.https://proceedings.neurips.cc/paper_files/paper/2016/file/7c9d0b1f96aebd7b5eca8c3edaa19ebb-Paper.pdf
[4]
Gustavo H. de Rosa and João P. Papa. 2021. A survey on text generation using generative adversarial networks. Pattern Recognition 119 (2021), 108098. https://doi.org/10.1016/j.patcog.2021.108098
[5]
Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. 2014. Generative adversarial nets. Advances in neural information processing systems 27 (2014).
[6]
Ishaan Gulrajani, Faruk Ahmed, Martin Arjovsky, Vincent Dumoulin, and Aaron C Courville. 2017. Improved training of wasserstein gans. Advances in neural information processing systems 30 (2017).
[7]
Hyunsu Kim, Yunjey Choi, Junho Kim, Sungjoo Yoo, and Youngjung Uh. 2021. Exploiting Spatial Dimensions of Latent in GAN for Real-Time Image Editing. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 852–861.
[8]
Qunwei Li, Bhavya Kailkhura, Rushil Anirudh, Yi Zhou, Yingbin Liang, and Pramod Varshney. 2018. MR-GAN: Manifold Regularized Generative Adversarial Networks. arxiv:1811.10427 [cs.LG]
[9]
Mehdi Mirza and Simon Osindero. 2014. Conditional Generative Adversarial Nets. arxiv:1411.1784 [cs.LG]
[10]
Takeru Miyato, Toshiki Kataoka, Masanori Koyama, and Yuichi Yoshida. 2018. Spectral Normalization for Generative Adversarial Networks. arxiv:1802.05957 [cs.LG]
[11]
Alec Radford, Luke Metz, and Soumith Chintala. 2015. Unsupervised representation learning with deep convolutional generative adversarial networks. arXiv preprint arXiv:1511.06434 (2015).
[12]
Tim Salimans, Ian Goodfellow, Wojciech Zaremba, Vicki Cheung, Alec Radford, Xi Chen, and Xi Chen. 2016. Improved Techniques for Training GANs. In Advances in Neural Information Processing Systems, D. Lee, M. Sugiyama, U. Luxburg, I. Guyon, and R. Garnett (Eds.). Vol. 29. Curran Associates, Inc.https://proceedings.neurips.cc/paper_files/paper/2016/file/8a3363abe792db2d8761d6403605aeb7-Paper.pdf
[13]
Siphesihle P Sithungu and Elizabeth M Ehlers. 2022. GAAINet: A Generative Adversarial Artificial Immune Network Model for Intrusion Detection in Industrial IoT Systems. Journal of Advances in Information Technology Vol 13, 5 (2022).
[14]
Chaoyue Wang, Chang Xu, Xin Yao, and Dacheng Tao. 2019. Evolutionary generative adversarial networks. IEEE Transactions on Evolutionary Computation 23, 6 (2019), 921–934.
[15]
Tengfei Wang, Yong Zhang, Yanbo Fan, Jue Wang, and Qifeng Chen. 2022. High-Fidelity GAN Inversion for Image Attribute Editing. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 11379–11388.
[16]
Lantao Yu, Weinan Zhang, Jun Wang, and Yong Yu. 2017. SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient. Proceedings of the AAAI Conference on Artificial Intelligence 31, 1 (Feb. 2017). https://doi.org/10.1609/aaai.v31i1.10804
[17]
Han Zhang, Ian Goodfellow, Dimitris Metaxas, and Augustus Odena. 2019. Self-Attention Generative Adversarial Networks. In Proceedings of the 36th International Conference on Machine Learning(Proceedings of Machine Learning Research, Vol. 97), Kamalika Chaudhuri and Ruslan Salakhutdinov (Eds.). PMLR, 7354–7363. https://proceedings.mlr.press/v97/zhang19d.html
[18]
Long Zhang and Lin Zhao. 2021. High-quality face image generation using particle swarm optimization-based generative adversarial networks. Future Generation Computer Systems 122 (2021), 98–104.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
MLMI '23: Proceedings of the 6th International Conference on Machine Learning and Machine Intelligence
October 2023
196 pages
ISBN:9798400709456
DOI:10.1145/3635638
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 16 January 2024

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Generative Adversarial Networks
  2. Swarm Intelligence
  3. machine learning
  4. unsupervised learning

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Funding Sources

Conference

MLMI 2023

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 63
    Total Downloads
  • Downloads (Last 12 months)59
  • Downloads (Last 6 weeks)8
Reflects downloads up to 05 Mar 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media