research-article

Content-aware generative modeling of graphic design layouts

Authors:
Xinru Zheng

City University of Hong Kong

City University of Hong Kong
View Profile

,
Xiaotian Qiao

City University of Hong Kong

City University of Hong Kong
View Profile

,
Ying Cao

City University of Hong Kong

City University of Hong Kong
View Profile

,
Rynson W. H. Lau

City University of Hong Kong

City University of Hong Kong
View Profile

Authors Info & Claims

ACM Transactions on Graphics Volume 38 Issue 4Article No.: 133pp 1–15https://doi.org/10.1145/3306346.3322971

Published:12 July 2019Publication History

ACM Transactions on Graphics

Abstract

Layout is fundamental to graphic designs. For visual attractiveness and efficient communication of messages and ideas, graphic design layouts often have great variation, driven by the contents to be presented. In this paper, we study the problem of content-aware graphic design layout generation. We propose a deep generative model for graphic design layouts that is able to synthesize layout designs based on the visual and textual semantics of user inputs. Unlike previous approaches that are oblivious to the input contents and rely on heuristic criteria, our model captures the effect of visual and textual contents on layouts, and implicitly learns complex layout structure variations from data without the use of any heuristic rules. To train our model, we build a large-scale magazine layout dataset with fine-grained layout annotations and keyword labeling. Experimental results show that our model can synthesize high-quality layouts based on the visual semantics of input images and keyword-based summary of input text. We also demonstrate that our model internally learns powerful features that capture the subtle interaction between contents and layouts, which are useful for layout-aware design retrieval.

Supplemental Material

Available for Download

zip

a133-zheng.zip (14.1 MB)

Supplemental material

References

Apostolos Antonacopoulos, David Bridson, Christos Papadopoulos, and Stefan Pletschacher. 2009. A realistic dataset for performance evaluation of document layout analysis. In Proc. ICDAR. 296--300. Google ScholarDigital Library
Michael W Berry and Jacob Kogan. 2010. Text Mining: Applications and Theory. John Wiley & Sons.Google ScholarCross Ref
Andrew Brock, Theodore Lim, JM Ritchie, and Nick Weston. 2017. Neural photo editing with introspective adversarial networks. In Proc. ICLR.Google Scholar
Zoya Bylinskii, Nam Wook Kim, Peter O'Donovan, Sami Alsheikh, Spandan Madan, Hanspeter Pfister, Fredo Durand, Bryan Russell, and Aaron Hertzmann. 2017. Learning visual importance for graphic designs and data visualizations. In Proc. ACM UIST. 57--69. Google ScholarDigital Library
Ying Cao, Antoni Chan, and Rynson Lau. 2012. Automatic stylistic manga layout. ACM TOG 31, 6 (2012). Google ScholarDigital Library
Ying Cao, Rynson Lau, and Antoni Chan. 2014. Look Over Here: Attention-Directing Composition of Manga Elements. ACM TOG 33, 4 (2014). Google ScholarDigital Library
Jaime Carbonell and Jade Goldstein. 1998. The use of MMR, diversity-based reranking for reordering documents and producing summaries. In Proc. ACM SIGIR. 335--336. Google ScholarDigital Library
Niranjan Damera-Venkata, Josép Bento, and Eamonn O'Brien-Strain. 2011. Probabilistic document model for automated document composition. In Proc. ACM DocEng. 3--12. Google ScholarDigital Library
Jeff Donahue, Philipp Krähenbühl, and Trevor Darrell. 2017. Adversarial feature learning. In Proc. ICLR.Google Scholar
Vincent Dumoulin, Ishmael Belghazi, Ben Poole, Alex Lamb, Martin Arjovsky, Olivier Mastropietro, and Aaron Courville. 2017. Adversarially learned inference. In Proc. ICLR.Google Scholar
SM Ali Eslami, Nicolas Heess, Christopher Williams, and John Winn. 2014. The shape Boltzmann machine: A strong model of object shape. IJCV 107, 2 (2014), 155--176. Google ScholarDigital Library
Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. 2014. Generative adversarial nets. In Proc. NIPS. 2672--2680. Google ScholarDigital Library
Agrim Gupta, Justin Johnson, Li Fei-Fei, Silvio Savarese, and Alexandre Alahi. 2018. Social GAN: Socially Acceptable Trajectories with Generative Adversarial Networks. In Proc. IEEE CVPR.Google ScholarCross Ref
Bernardo Heynemann, Cezar Espinola, and Fabio M. Costa. 2015. Detection Algorithms. http://thumbor.readthedocs.io/en/latest/detection_algorithms.html.Google Scholar
Geoffrey Hinton, Simon Osindero, and YeeWhye Teh. 2006. A fast learning algorithm for deep belief nets. Neural Computation 18, 7 (2006), 1527--1554. Google ScholarDigital Library
Nathan Hurst, Wilmot Li, and Kim Marriott. 2009. Review of automatic document formatting. In Proc. ACM DocEng. 99--108. Google ScholarDigital Library
Phillip Isola, JunYan Zhu, Tinghui Zhou, and Alexei Efros. 2017. Image-to-image translation with conditional adversarial networks. In Proc. CVPR.Google ScholarCross Ref
Charles Jacobs, Wilmot Li, Evan Schrier, David Bargeron, and David Salesin. 2003. Adaptive grid-based document layout. ACM TOG 22, 3 (2003), 838--847. Google ScholarDigital Library
Tom Kelly, Paul Guerrero, Anthony Steed, Peter Wonka, and Niloy J Mitra. 2018. FrankenGAN: Guided Detail Synthesis for Building Mass-Models Using Style-Synchonized GANs. ACM TOG (2018). Google ScholarDigital Library
Diederik Kingma and Jimmy Ba. 2015. Adam: A method for stochastic optimization. In Proc. ICLR.Google Scholar
Diederik Kingma and Max Welling. 2014. Auto-encoding variational bayes. In Proc. ICLR.Google Scholar
Ranjitha Kumar, Jerry Talton, Salman Ahmad, and Scott Klemmer. 2011. Bricolage: example-based retargeting for Web design. In Proc. ACM CHI. 2197--2206. Google ScholarDigital Library
Jun Li, Kai Xu, Siddhartha Chaudhuri, Ersin Yumer, Hao Zhang, and Leonidas Guibas. 2017. GRASS: Generative Recursive Autoencoders for Shape Structures. ACM TOG 36, 4 (2017). Google ScholarDigital Library
Jianan Li, Tingfa Xu, Jianming Zhang, Aaron Hertzmann, and Jimei Yang. 2019. LayoutGAN: Generating Graphic Layouts with Wireframe Discriminator. In Proc. ICLR.Google Scholar
Jonathan Long, Evan Shelhamer, and Trevor Darrell. 2015. Fully convolutional networks for semantic segmentation. In Proc. CVPR. 3431--3440.Google ScholarCross Ref
Xudong Mao, Qing Li, Haoran Xie, Raymond YK Lau, Zhen Wang, and Stephen Paul Smolley. 2017. Least squares generative adversarial networks. In Proc. ICCV. 2813--2821.Google ScholarCross Ref
Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S Corrado, and Jeff Dean. 2013. Distributed representations of words and phrases and their compositionality. In Proc. NIPS. 3111--3119. Google ScholarDigital Library
Peter O'Donovan, Aseem Agarwala, and Aaron Hertzmann. 2014. Learning Layouts for Single-Page Graphic Designs. IEEE TVCG 20, 8 (2014), 1200--1213. Google ScholarDigital Library
Peter O'Donovan, Aseem Agarwala, and Aaron Hertzmann. 2015. DesignScape: Design with Interactive Layout Suggestions. In Proc. ACM CHI. 1221--1224. Google ScholarDigital Library
Xufang Pang, Ying Cao, Rynson Lau, and Antoni Chan. 2016. Directing user attention via visual flow on web designs. ACM TOG 35, 6 (2016). Google ScholarDigital Library
ZA Prust. 2010. Graphic Communications. Goodheart-Wilcox Publisher.Google Scholar
Alec Radford, Luke Metz, and Soumith Chintala. 2015. Unsupervised representation learning with deep convolutional generative adversarial networks. arXiv:1511.06434 (2015).Google Scholar
Danilo Jimenez Rezende, Shakir Mohamed, and Daan Wierstra. 2014. Stochastic backpropagation and approximate inference in deep generative models. In Proc. ICML. Google ScholarDigital Library
Neha Saleem. 2015. The key characteristics of a fashion magazine. http://nehasaleemmedia.weebly.com/blog/the-key-characteristics-of-a-fashion-magazine.Google Scholar
Evan Schrier, Mira Dontcheva, Charles Jacobs, Geraldine Wade, and David Salesin. 2008. Adaptive layout for dynamically aggregated documents. In Proc. IUI. 99--108. Google ScholarDigital Library
Karen Simonyan and Andrew Zisserman. 2015. Very deep convolutional networks for large-scale image recognition. In Proc. ICLR.Google Scholar
David Smith. 2014. Magazine Design Tips: Key Elements. https://www.envision-creative.com/magazine-design-tips-key-elements/.Google Scholar
Adobe Spark. 2018. https://spark.adobe.com/.Google Scholar
Mary Stribley. 2015. 10 Rules of Composition All Designers Live By. https://designschool.canva.com/blog/visual-design-composition/.Google Scholar
Kashyap Todi, Daryl Weir, and Antti Oulasvirta. 2016. Sketchplore: Sketch and explore with a layout optimiser. In Proc. ACM DIS. 543--555. Google ScholarDigital Library
Leon Todoran, Marcel Worring, and Arnold WM Smeulders. 2005. The UvA color document dataset. International Journal on Document Analysis and Recognition 7, 4 (2005), 228--240. Google ScholarDigital Library
Zhirong Wu, Shuran Song, Aditya Khosla, Fisher Yu, Linguang Zhang, Xiaoou Tang, and Jianxiong Xiao. 2015. 3d shapenets: A deep representation for volumetric shapes. In Proc. IEEE CVPR.Google Scholar
Xuyong Yang, Tao Mei, Ying-Qing Xu, Yong Rui, and Shipeng Li. 2016. Automatic generation of visual-textual presentation layout. ACM TOMM 12, 2 (2016), 33. Google ScholarDigital Library
Charles Ying. 2014. Automating Layouts Bring Flipboard's Magazine Style To Web And Windows. https://techcrunch.com/2014/03/23/layout-in-flipboard-for-web-and-windows/?ncid=rss.Google Scholar
Nanxuan Zhao, Ying Cao, and Rynson Lau. 2018. What Characterizes Personalities of Graphic Designs? ACM TOG 37, 4 (2018), 1--15. Google ScholarDigital Library
Jun-Yan Zhu, Philipp Krähenbühl, Eli Shechtman, and Alexei Efros. 2016. Generative visual manipulation on the natural image manifold. In Proc. ECCV. 597--613.Google ScholarCross Ref

Index Terms

Content-aware generative modeling of graphic design layouts
1. Computer systems organization
  1. Architectures
    1. Other architectures
      1. Neural networks
2. Computing methodologies
  1. Computer graphics
  2. Machine learning
    1. Machine learning approaches

Recommendations

Continuous and Gradual Style Changes of Graphic Designs with Generative Model
IUI '21: Proceedings of the 26th International Conference on Intelligent User Interfaces

Creating a high-quality layout design from scratch is difficult for novices. Therefore, novices often consult the works of other skilled designers for ideas regarding layout designs. Researchers have previously investigated methods to support the layout ...
Read More
Comparing the usability of grid-bag and constraint-based layouts
OzCHI '12: Proceedings of the 24th Australian Computer-Human Interaction Conference

While the usability of GUI design methods has been studied in general, the usability of layout specification methods is largely unexplored.

In this paper we provide an empirical comparison of two popular GUI layout models, grid-bag layout and constraint-...
Read More
Content-aware layout
CHI EA '07: CHI '07 Extended Abstracts on Human Factors in Computing Systems

We describe content-aware layout (CAL), a technique that automatically arranges windows on a user.s desktop. Unlike conventional window managers that automatically cascade or tile each window without regard to its content, CAL uses information about the ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in
ACM Transactions on Graphics Volume 38, Issue 4
August 2019
1480 pages
ISSN:0730-0301
EISSN:1557-7368
DOI:10.1145/3306346
Editor:
Olga Sorkine-Hornung
ETH Zurich
Issue’s Table of Contents
Copyright © 2019 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 12 July 2019
Published in tog Volume 38, Issue 4

Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
content-aware
deep generative networks
graphic design
layout
Qualifiers
- research-article
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 110
  Total Citations
  View Citations
- 1,960
  Total Downloads
- Downloads (Last 12 months)337
- Downloads (Last 6 weeks)39
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Content-aware generative modeling of graphic design layouts

ACM Transactions on Graphics

Abstract

Supplemental Material

Available for Download

References

Cited By

Index Terms

Recommendations

Continuous and Gradual Style Changes of Graphic Designs with Generative Model

Comparing the usability of grid-bag and constraint-based layouts

Content-aware layout