skip to main content
10.1145/3653081.3653092acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiotaaiConference Proceedingsconference-collections
research-article

Sketch2RealGAN: A Conditional GAN-based Method for Generating Clothing Images from Sketches

Published:03 May 2024Publication History

ABSTRACT

This algorithm is an innovative algorithm that can automatically generate clothing images from clothing sketches. In order to improve the generation performance, the algorithm uses feature extraction network to extract semantic information from the original sketch, and uses semantic feature encoder to encode these semantic information into feature tensors. High quality clothing image generation is realized by input both the original sketch and the feature tensor into the conditional generation adversarial network. In addition, a two-stage generation algorithm is proposed to generate clothing images from original sketches. The algorithm uses the same model structure in both stages, and finally realizes the clothing image generation by gradually generating the intermediate image.

References

  1. Mirza M, Osindero S. Conditional generative adversarial nets [J]. ar**v preprint ar**v:1411.1784, 2014.Google ScholarGoogle Scholar
  2. Radford A, Metz L, Chintala S. Unsupervised representation learning with deep convolutional generative adversarial networks [J]. ar**v preprint ar**v:1511.06434, 2015.Google ScholarGoogle Scholar
  3. Arjovsky M, Chintala S, Bottou L. Wasserstein generative adversarial networks [C]//International conference on machine learning, 2017: 214-223.Google ScholarGoogle Scholar
  4. Gulrajani I, Ahmed F, Arjovsky M, Improved training of wasserstein gans [J]. Advances in neural information processing systems, 2017, 30.Google ScholarGoogle Scholar
  5. Wu J, Huang Z, Thoma J, Wasserstein divergence for gans [C]//Proceedings of the European conference on computer vision (ECCV), 2018: 653-668.Google ScholarGoogle Scholar
  6. Karras T, Aila T, Laine S, Progressive growing of gans for improved quality, stability, and variation [J]. ar**v preprint ar**v:1710.10196, 2017.Google ScholarGoogle Scholar
  7. Karras T, Laine S, Aila T. A style-based generator architecture for generative adversarial networks [C]//Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2019: 4401-4410.Google ScholarGoogle Scholar
  8. Karras T, Laine S, Aittala M, Analyzing and improving the image quality of stylegan [C]//Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2020: 8110-8119.Google ScholarGoogle Scholar
  9. Isola P, Zhu J-Y, Zhou T, Image-to-image translation with conditional adversarial networks [C]//Proceedings of the IEEE conference on computer vision and pattern recognition, 2017: 1125-1134.Google ScholarGoogle Scholar
  10. Wang T-C, Liu M-Y, Zhu J-Y, High-resolution image synthesis and semantic manipulation with conditional gans [C]//Proceedings of the IEEE conference on computer vision and pattern recognition, 2018: 8798-8807.Google ScholarGoogle Scholar
  11. Zhu J-Y, Park T, Isola P, Unpaired image-to-image translation using cycle-consistent adversarial networks [C]//Proceedings of the IEEE international conference on computer vision, 2017: 2223-2232.Google ScholarGoogle Scholar
  12. Liu M Y, Breuel T, Kautz J. Unsupervised image-to-image translation networks [J]. Advances in neural information processing systems, 2017, 30.Google ScholarGoogle Scholar
  13. Liu M Y, Tuzel O. Coupled generative adversarial networks [J]. Advances in neural information processing systems, 2016, 29.Google ScholarGoogle Scholar
  14. Huang X, Liu M-Y, Belongie S, Multimodal unsupervised image-to-image translation [C]//Proceedings of the European conference on computer vision (ECCV), 2018: 172-189.Google ScholarGoogle Scholar
  15. Choi Y, Choi M, Kim M, Stargan: Unified generative adversarial networks for multi-domain image-to-image translation [C]//Proceedings of the IEEE conference on computer vision and pattern recognition, 2018: 8789-8797.Google ScholarGoogle Scholar
  16. Chen W, Hays J. Sketchygan: Towards diverse and realistic sketch to image synthesis [C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2018: 9416-9425.Google ScholarGoogle Scholar
  17. Sandler M, Howard A, Zhu M, Mobilenetv2: Inverted residuals and linear bottlenecks [C]//Proceedings of the IEEE conference on computer vision and pattern recognition, 2018: 4510-4520.Google ScholarGoogle Scholar
  18. Deng J, Dong W, Socher R, Imagenet: A large-scale hierarchical image database [C]//2009 IEEE conference on computer vision and pattern recognition, 2009: 248-255.Google ScholarGoogle Scholar
  19. Park T, Liu M-Y, Wang T-C, Semantic image synthesis with spatially-adaptive normalization [C]//Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2019: 2337-2346.Google ScholarGoogle Scholar
  20. Lei H, Chen S, Wang M, A new algorithm for sketch-based fashion image retrieval based on cross-domain transformation [J]. Wireless Communications and Mobile Computing, 2021, 2021: 1-14.Google ScholarGoogle Scholar
  21. Lee H-Y, Tseng H-Y, Huang J-B, Diverse image-to-image translation via disentangled representations [C]//Proceedings of the European conference on computer vision (ECCV), 2018: 35-51.Google ScholarGoogle Scholar

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Other conferences
    IoTAAI '23: Proceedings of the 2023 5th International Conference on Internet of Things, Automation and Artificial Intelligence
    November 2023
    902 pages
    ISBN:9798400716485
    DOI:10.1145/3653081

    Copyright © 2023 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 3 May 2024

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • research-article
    • Research
    • Refereed limited
  • Article Metrics

    • Downloads (Last 12 months)2
    • Downloads (Last 6 weeks)2

    Other Metrics

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format