Skip to main content

Structure-Aware Photorealistic Style Transfer Using Ghost Bottlenecks

  • Conference paper
  • First Online:
Pattern Recognition and Artificial Intelligence (ICPRAI 2022)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13363))

  • 1678 Accesses

Abstract

Photorealistic style transfer synthesizes a new image from a pair of content and style images. The transfer should express the visual patterns in the former while preserving the content details following the latter. However, most existing methods may generate an image that suffers disrupted content details and unexpected visual cues from the style image; hence, they do not satisfy the photorealism. We tackle this issue with a new style transfer architecture that effectively unifies a content encoder, an Xception style encoder, a Ghost Bottlenecks subnet, and a decoder. In our framework, the style features extracted from the Xception module balance well with the content features obtained from an encoder; the Ghost Bottlenecks subnet then integrates these features and feeds them into a decoder to produce the resulting style transferred image. Experimental results demonstrate that our model surmounts the structure distortion problem to satisfy photorealistic style transfer and hence obtains impressive visual effects.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://github.com/anishathalye/neural-style.

  2. 2.

    https://github.com/naoto0804/pytorch-AdaIN.

  3. 3.

    https://github.com/JianqiangRen/AAMS.

  4. 4.

    https://github.com/mingsun-tse/collaborative-distillation.

  5. 5.

    https://github.com/Aaditya-Singh/SAFIN.

References

  1. Gatys, L.A., Ecker, A.S., Bethge, M.: Image style transfer using convolutional neural networks. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2016)

    Google Scholar 

  2. Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. In: International Conference on Learning Representations (2015)

    Google Scholar 

  3. Han, K., Wang, Y., Tian, Q., Guo, J., Xu, C., Xu, C.: GhostNet: more features from cheap operations. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2020)

    Google Scholar 

  4. Chollet, F.: Xception: deep learning with depthwise separable convolutions. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2017)

    Google Scholar 

  5. Heeger, D.J., Bergen, J.R.: Pyramid-based texture analysis/synthesis. In: Proceedings of the 22nd Annual Conference on Computer Graphics and Interactive Techniques, pp. 229–238 (1995)

    Google Scholar 

  6. Efros, A.A., Freeman, W.T.: Image quilting for texture synthesis and transfer. In: Proceedings of the 28nd Annual Conference on Computer Graphics and Interactive Techniques, pp. 341–346 (2001)

    Google Scholar 

  7. Ulyanov, D., Vedaldi, A., Lempitsky, V.: Improved texture networks: maximizing quality and diversity in feed-forward stylization and texture synthesis. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2017)

    Google Scholar 

  8. Chen, T.Q., Schmidt, M.: Fast Patch-Based Style Transfer of Arbitrary Style. arXiv preprint arXiv:1612.04337 (2016)

  9. Johnson, J., Alahi, A., Fei-Fei, L.: Perceptual losses for real-time style transfer and super-resolution. In: European Conference on Computer Vision (ECCV) (2016)

    Google Scholar 

  10. Li, Y., Fang, C., Yang, J., Wang, Z., Lu, X., Yang, M.-H.: Universal style transfer via feature transforms. In: Proceedings of the 31st International Conference on Neural Information Processing Systems (NeurIPS) (2017)

    Google Scholar 

  11. Huang, X., Belongie, S.: Arbitrary style transfer in real-time with adaptive instance normalization. In: IEEE International Conference on Computer Vision (ICCV) (2017)

    Google Scholar 

  12. Li, S., Xu, X., Nie, L., Chua, T.-S.: Laplacian-steered neural style transfer. In: Proceedings of the 25th ACM international conference on Multimedia (2017)

    Google Scholar 

  13. Cheng, M.-M., Liu, X.-C., Wang, J., Lu, S.-P., Lai, Y.-K., Rosin, P.L.: Structure-preserving neural style transfer. IEEE Trans. Image Process. 29, 909–920 (2020)

    Article  MATH  Google Scholar 

  14. Ranzato, M., Huang, F.-J., Boureau, Y.-L., LeCun, Y.: Unsupervised learning of invariant feature hierarchies with applications to object recognition. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2007)

    Google Scholar 

  15. Kingma, D.P., Welling, M.: Auto-encoding variational Bayes. In: International Conference on Learning Representations (2014)

    Google Scholar 

  16. Goodfellow, I.J., et al.: Generative adversarial nets. In: Proceedings of the 27th International Conference on Neural Information Processing Systems (NIPS), vol. 2, pp. 2672–2680 (2014)

    Google Scholar 

  17. Khan, A., Sohail, A., Zahoora, U., Qureshi, A.S.: A survey of the recent architectures of deep convolutional neural networks. In: Artificial Intelligence Review (2020)

    Google Scholar 

  18. Ulyanov, D., Vedaldi, A., Lempitsky, V.: Instance Normalization: The Missing Ingredient for Fast Stylization. arXiv preprint arXiv:1607.08022 (2016)

  19. Li, X., Liu, S., Kautz, J., Yang, M.-H.: Learning linear transformations for fast image and video style transfer. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2019)

    Google Scholar 

  20. Huang, X., Liu, M.-Y., Belongie, S., Kautz, J.: Multimodal unsupervised image-to-image translation. In: European Conference on Computer Vision (ECCV) (2018)

    Google Scholar 

  21. Ioffe, S., Szegedy, C.: Batch normalization: accelerating deep network training by reducing internal covariate shift. In: Proceedings of the 32nd International Conference on International Conference on Machine Learning (ICML), vol. 37, pp. 448–456 (2015)

    Google Scholar 

  22. Lin, T.-Y., et al.: Microsoft COCO: common objects in context. In: European Conference on Computer Vision (ECCV) (2014)

    Google Scholar 

  23. Nichol, K.: Painter by numbers, wikiart. https://www.kaggle.com/c/painter-by-numbers

  24. Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. In: International Conference on Learning Representations (ICLR) (2015)

    Google Scholar 

  25. Yao, Y., Ren, J., Xie, X., Liu, W., Liu, Y.-J., Wang, J.: Attention-aware multi-stroke style transfer. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2019)

    Google Scholar 

  26. Wang, H., Li, Y., Wang, Y., Hu, H., Yang, M.: Collaborative distillation for ultra-resolution universal style transfer. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2020)

    Google Scholar 

  27. Singh, A., Hingane, S., Gong, X., Wang, Z.: SAFIN: arbitrary style transfer with self-attentive factorized instance normalization. In: IEEE International Conference on Multimedia and Expo (ICME) (2021)

    Google Scholar 

  28. Wang, Z., Bovik, A.C., Sheikh, H.R., Simoncelli, E.P.: Image quality assessment: from error visibility to structural similarity. In: IEEE Transactions on Image Processing, vol. 13, pp. 600–612 (2004)

    Google Scholar 

  29. Canny, J.: A computational approach to edge detection. In: IEEE Transactions on Pattern Analysis and Machine Intelligence, pp. 679–698 (1986)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ngoc-Thao Nguyen .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Bui, NT., Nguyen, NT., Cao, XN. (2022). Structure-Aware Photorealistic Style Transfer Using Ghost Bottlenecks. In: El Yacoubi, M., Granger, E., Yuen, P.C., Pal, U., Vincent, N. (eds) Pattern Recognition and Artificial Intelligence. ICPRAI 2022. Lecture Notes in Computer Science, vol 13363. Springer, Cham. https://doi.org/10.1007/978-3-031-09037-0_2

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-09037-0_2

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-09036-3

  • Online ISBN: 978-3-031-09037-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics