Abstract
The high-fidelity image generation has been a subject of active research in the recent past. It provides benchmark towards image decoder’s performance. The autoregressive image models generate small images successfully but scalability has been a problem. The challenges include vast encoding of previous context and learning distribution which maintains global semantic coherence and exactness. These issues have been addressed through subscale pixel network (SPN) and multidimensional upscaling. To improve the accuracy further, in this work a hierarchical version of image generation model is presented. It disentangles background, object shape and appearance to hierarchically generate images of fine-grained object categories. To achieve this information theory associates a factor to latent code and condition relationships between codes to induce hierarchy. The hierarchical model’s learned features are used to cluster real images. The experimental results on ImageNet and CelebAHQ datasets for different image sizes highlight hierarchical model’s superiority against the benchmarks. The images are generated with better fidelity with respect to large scale samples.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Vaswani, A., et al: Attention is all you need. arXiv 1706.03762 (2017)
Wu, Y., et al: Google’s neural machine translation system: bridging the gap between human and machine translation. arXiv 1609.08144 (2016)
Van den Oord, A., et al: Wavenet: a generative model for raw audio. arXiv 1609.03499 (2016)
Parmar, N., et al: Image transformer. arXiv 1802.05751 (2018)
Kalchbrenner, N.: Video pixel networks. arXiv 1610.00527 (2016)
Arora, S., et al: Do GANs actually learn the distribution? An empirical study. arXiv 1706.08224 (2017)
Menick, J., et al: Generating high fidelity images with subscale pixel networks and multidimensional upscaling. In: 7th International Conference on Learning Representations (2019)
Chaudhuri, A.: Some experiments on generating high fidelity images. Technical report, Samsung R&D Institute Delhi, India (2019)
ImageNet. http://www.image-net.org/
Kingma, D.P., et al: Generative flow with invertible 1 × 1 convolutions. arXiv 1807.03039 (2018)
Kolesnikov, A., et al: Deep probabilistic modeling of natural images using a pyramid decomposition. arXiv 1612.08185 (2016)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Chaudhuri, A., Ghosh, S.K. (2020). Hierarchical Approach Towards High Fidelity Image Generation. In: Silhavy, R. (eds) Artificial Intelligence and Bioinspired Computational Methods. CSOC 2020. Advances in Intelligent Systems and Computing, vol 1225. Springer, Cham. https://doi.org/10.1007/978-3-030-51971-1_20
Download citation
DOI: https://doi.org/10.1007/978-3-030-51971-1_20
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-51970-4
Online ISBN: 978-3-030-51971-1
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)