skip to main content
research-article

Content-aware generative modeling of graphic design layouts

Published:12 July 2019Publication History
Skip Abstract Section

Abstract

Layout is fundamental to graphic designs. For visual attractiveness and efficient communication of messages and ideas, graphic design layouts often have great variation, driven by the contents to be presented. In this paper, we study the problem of content-aware graphic design layout generation. We propose a deep generative model for graphic design layouts that is able to synthesize layout designs based on the visual and textual semantics of user inputs. Unlike previous approaches that are oblivious to the input contents and rely on heuristic criteria, our model captures the effect of visual and textual contents on layouts, and implicitly learns complex layout structure variations from data without the use of any heuristic rules. To train our model, we build a large-scale magazine layout dataset with fine-grained layout annotations and keyword labeling. Experimental results show that our model can synthesize high-quality layouts based on the visual semantics of input images and keyword-based summary of input text. We also demonstrate that our model internally learns powerful features that capture the subtle interaction between contents and layouts, which are useful for layout-aware design retrieval.

Skip Supplemental Material Section

Supplemental Material

References

  1. Apostolos Antonacopoulos, David Bridson, Christos Papadopoulos, and Stefan Pletschacher. 2009. A realistic dataset for performance evaluation of document layout analysis. In Proc. ICDAR. 296--300. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Michael W Berry and Jacob Kogan. 2010. Text Mining: Applications and Theory. John Wiley & Sons.Google ScholarGoogle ScholarCross RefCross Ref
  3. Andrew Brock, Theodore Lim, JM Ritchie, and Nick Weston. 2017. Neural photo editing with introspective adversarial networks. In Proc. ICLR.Google ScholarGoogle Scholar
  4. Zoya Bylinskii, Nam Wook Kim, Peter O'Donovan, Sami Alsheikh, Spandan Madan, Hanspeter Pfister, Fredo Durand, Bryan Russell, and Aaron Hertzmann. 2017. Learning visual importance for graphic designs and data visualizations. In Proc. ACM UIST. 57--69. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Ying Cao, Antoni Chan, and Rynson Lau. 2012. Automatic stylistic manga layout. ACM TOG 31, 6 (2012). Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Ying Cao, Rynson Lau, and Antoni Chan. 2014. Look Over Here: Attention-Directing Composition of Manga Elements. ACM TOG 33, 4 (2014). Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Jaime Carbonell and Jade Goldstein. 1998. The use of MMR, diversity-based reranking for reordering documents and producing summaries. In Proc. ACM SIGIR. 335--336. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Niranjan Damera-Venkata, Josép Bento, and Eamonn O'Brien-Strain. 2011. Probabilistic document model for automated document composition. In Proc. ACM DocEng. 3--12. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Jeff Donahue, Philipp Krähenbühl, and Trevor Darrell. 2017. Adversarial feature learning. In Proc. ICLR.Google ScholarGoogle Scholar
  10. Vincent Dumoulin, Ishmael Belghazi, Ben Poole, Alex Lamb, Martin Arjovsky, Olivier Mastropietro, and Aaron Courville. 2017. Adversarially learned inference. In Proc. ICLR.Google ScholarGoogle Scholar
  11. SM Ali Eslami, Nicolas Heess, Christopher Williams, and John Winn. 2014. The shape Boltzmann machine: A strong model of object shape. IJCV 107, 2 (2014), 155--176. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. 2014. Generative adversarial nets. In Proc. NIPS. 2672--2680. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Agrim Gupta, Justin Johnson, Li Fei-Fei, Silvio Savarese, and Alexandre Alahi. 2018. Social GAN: Socially Acceptable Trajectories with Generative Adversarial Networks. In Proc. IEEE CVPR.Google ScholarGoogle ScholarCross RefCross Ref
  14. Bernardo Heynemann, Cezar Espinola, and Fabio M. Costa. 2015. Detection Algorithms. http://thumbor.readthedocs.io/en/latest/detection_algorithms.html.Google ScholarGoogle Scholar
  15. Geoffrey Hinton, Simon Osindero, and YeeWhye Teh. 2006. A fast learning algorithm for deep belief nets. Neural Computation 18, 7 (2006), 1527--1554. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Nathan Hurst, Wilmot Li, and Kim Marriott. 2009. Review of automatic document formatting. In Proc. ACM DocEng. 99--108. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Phillip Isola, JunYan Zhu, Tinghui Zhou, and Alexei Efros. 2017. Image-to-image translation with conditional adversarial networks. In Proc. CVPR.Google ScholarGoogle ScholarCross RefCross Ref
  18. Charles Jacobs, Wilmot Li, Evan Schrier, David Bargeron, and David Salesin. 2003. Adaptive grid-based document layout. ACM TOG 22, 3 (2003), 838--847. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Tom Kelly, Paul Guerrero, Anthony Steed, Peter Wonka, and Niloy J Mitra. 2018. FrankenGAN: Guided Detail Synthesis for Building Mass-Models Using Style-Synchonized GANs. ACM TOG (2018). Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Diederik Kingma and Jimmy Ba. 2015. Adam: A method for stochastic optimization. In Proc. ICLR.Google ScholarGoogle Scholar
  21. Diederik Kingma and Max Welling. 2014. Auto-encoding variational bayes. In Proc. ICLR.Google ScholarGoogle Scholar
  22. Ranjitha Kumar, Jerry Talton, Salman Ahmad, and Scott Klemmer. 2011. Bricolage: example-based retargeting for Web design. In Proc. ACM CHI. 2197--2206. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Jun Li, Kai Xu, Siddhartha Chaudhuri, Ersin Yumer, Hao Zhang, and Leonidas Guibas. 2017. GRASS: Generative Recursive Autoencoders for Shape Structures. ACM TOG 36, 4 (2017). Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Jianan Li, Tingfa Xu, Jianming Zhang, Aaron Hertzmann, and Jimei Yang. 2019. LayoutGAN: Generating Graphic Layouts with Wireframe Discriminator. In Proc. ICLR.Google ScholarGoogle Scholar
  25. Jonathan Long, Evan Shelhamer, and Trevor Darrell. 2015. Fully convolutional networks for semantic segmentation. In Proc. CVPR. 3431--3440.Google ScholarGoogle ScholarCross RefCross Ref
  26. Xudong Mao, Qing Li, Haoran Xie, Raymond YK Lau, Zhen Wang, and Stephen Paul Smolley. 2017. Least squares generative adversarial networks. In Proc. ICCV. 2813--2821.Google ScholarGoogle ScholarCross RefCross Ref
  27. Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S Corrado, and Jeff Dean. 2013. Distributed representations of words and phrases and their compositionality. In Proc. NIPS. 3111--3119. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Peter O'Donovan, Aseem Agarwala, and Aaron Hertzmann. 2014. Learning Layouts for Single-Page Graphic Designs. IEEE TVCG 20, 8 (2014), 1200--1213. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Peter O'Donovan, Aseem Agarwala, and Aaron Hertzmann. 2015. DesignScape: Design with Interactive Layout Suggestions. In Proc. ACM CHI. 1221--1224. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Xufang Pang, Ying Cao, Rynson Lau, and Antoni Chan. 2016. Directing user attention via visual flow on web designs. ACM TOG 35, 6 (2016). Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. ZA Prust. 2010. Graphic Communications. Goodheart-Wilcox Publisher.Google ScholarGoogle Scholar
  32. Alec Radford, Luke Metz, and Soumith Chintala. 2015. Unsupervised representation learning with deep convolutional generative adversarial networks. arXiv:1511.06434 (2015).Google ScholarGoogle Scholar
  33. Danilo Jimenez Rezende, Shakir Mohamed, and Daan Wierstra. 2014. Stochastic backpropagation and approximate inference in deep generative models. In Proc. ICML. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. Neha Saleem. 2015. The key characteristics of a fashion magazine. http://nehasaleemmedia.weebly.com/blog/the-key-characteristics-of-a-fashion-magazine.Google ScholarGoogle Scholar
  35. Evan Schrier, Mira Dontcheva, Charles Jacobs, Geraldine Wade, and David Salesin. 2008. Adaptive layout for dynamically aggregated documents. In Proc. IUI. 99--108. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. Karen Simonyan and Andrew Zisserman. 2015. Very deep convolutional networks for large-scale image recognition. In Proc. ICLR.Google ScholarGoogle Scholar
  37. David Smith. 2014. Magazine Design Tips: Key Elements. https://www.envision-creative.com/magazine-design-tips-key-elements/.Google ScholarGoogle Scholar
  38. Adobe Spark. 2018. https://spark.adobe.com/.Google ScholarGoogle Scholar
  39. Mary Stribley. 2015. 10 Rules of Composition All Designers Live By. https://designschool.canva.com/blog/visual-design-composition/.Google ScholarGoogle Scholar
  40. Kashyap Todi, Daryl Weir, and Antti Oulasvirta. 2016. Sketchplore: Sketch and explore with a layout optimiser. In Proc. ACM DIS. 543--555. Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. Leon Todoran, Marcel Worring, and Arnold WM Smeulders. 2005. The UvA color document dataset. International Journal on Document Analysis and Recognition 7, 4 (2005), 228--240. Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. Zhirong Wu, Shuran Song, Aditya Khosla, Fisher Yu, Linguang Zhang, Xiaoou Tang, and Jianxiong Xiao. 2015. 3d shapenets: A deep representation for volumetric shapes. In Proc. IEEE CVPR.Google ScholarGoogle Scholar
  43. Xuyong Yang, Tao Mei, Ying-Qing Xu, Yong Rui, and Shipeng Li. 2016. Automatic generation of visual-textual presentation layout. ACM TOMM 12, 2 (2016), 33. Google ScholarGoogle ScholarDigital LibraryDigital Library
  44. Charles Ying. 2014. Automating Layouts Bring Flipboard's Magazine Style To Web And Windows. https://techcrunch.com/2014/03/23/layout-in-flipboard-for-web-and-windows/?ncid=rss.Google ScholarGoogle Scholar
  45. Nanxuan Zhao, Ying Cao, and Rynson Lau. 2018. What Characterizes Personalities of Graphic Designs? ACM TOG 37, 4 (2018), 1--15. Google ScholarGoogle ScholarDigital LibraryDigital Library
  46. Jun-Yan Zhu, Philipp Krähenbühl, Eli Shechtman, and Alexei Efros. 2016. Generative visual manipulation on the natural image manifold. In Proc. ECCV. 597--613.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Content-aware generative modeling of graphic design layouts

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in

        Full Access

        • Published in

          cover image ACM Transactions on Graphics
          ACM Transactions on Graphics  Volume 38, Issue 4
          August 2019
          1480 pages
          ISSN:0730-0301
          EISSN:1557-7368
          DOI:10.1145/3306346
          Issue’s Table of Contents

          Copyright © 2019 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 12 July 2019
          Published in tog Volume 38, Issue 4

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader