research-article

Attribute2Font: creating fonts you want from attributes

Authors:

Zhouhui LianAuthors Info & Claims

ACM Transactions on Graphics (TOG), Volume 39, Issue 4

Article No.: 69, Pages 69:1 - 69:15

https://doi.org/10.1145/3386569.3392456

Published: 12 August 2020 Publication History

Abstract

Font design is now still considered as an exclusive privilege of professional designers, whose creativity is not possessed by existing software systems. Nevertheless, we also notice that most commercial font products are in fact manually designed by following specific requirements on some attributes of glyphs, such as italic, serif, cursive, width, angularity, etc. Inspired by this fact, we propose a novel model, Attribute2Font, to automatically create fonts by synthesizing visually pleasing glyph images according to user-specified attributes and their corresponding values. To the best of our knowledge, our model is the first one in the literature which is capable of generating glyph images in new font styles, instead of retrieving existing fonts, according to given values of specified font attributes. Specifically, Attribute2Font is trained to perform font style transfer between any two fonts conditioned on their attribute values. After training, our model can generate glyph images in accordance with an arbitrary set of font attribute values. Furthermore, a novel unit named Attribute Attention Module is designed to make those generated glyph images better embody the prominent font attributes. Considering that the annotations of font attribute values are extremely expensive to obtain, a semi-supervised learning scheme is also introduced to exploit a large number of unlabeled fonts. Experimental results demonstrate that our model achieves impressive performance on many tasks, such as creating glyph images in new font styles, editing existing fonts, interpolation among different fonts, etc.

Supplemental Material

MP4 File

Presentation video

Transcript for: Presentation video

ZIP File

Supplemental files.

Download
32.25 MB

References

[1]

Samaneh Azadi, Matthew Fisher, Vladimir G Kim, Zhaowen Wang, Eli Shechtman, and Trevor Darrell. 2018. Multi-content gan for few-shot font style transfer. In Proceedings of the IEEE conference on computer vision and pattern recognition. 7564--7573.

[2]

Elena Balashova, Amit H Bermano, Vladimir G Kim, Stephen DiVerdi, Aaron Hertzmann, and Thomas Funkhouser. 2019. Learning A Stroke-Based Representation for Fonts. In Computer Graphics Forum, Vol. 38. Wiley Online Library, 429--442.

[3]

Neill DF Campbell and Jan Kautz. 2014. Learning a manifold of fonts. ACM Transactions on Graphics (TOG) 33, 4 (2014), 91.

Digital Library

[4]

Tianlang Chen, Zhaowen Wang, Ning Xu, Hailin Jin, and Jiebo Luo. 2019. Large-scale Tag-based Font Retrieval with Generative Feature Learning. In Proceedings of the IEEE International Conference on Computer Vision. 9116--9125.

[5]

Saemi Choi, Shun Matsumura, and Kiyoharu Aizawa. 2019. Assist Users' Interactions in Font Search with Unexpected but Useful Concepts Generated by Multimodal Learning. In Proceedings of the 2019 on International Conference on Multimedia Retrieval. ACM, 235--243.

Digital Library

[6]

Yunjey Choi, Minje Choi, Munyoung Kim, Jung-Woo Ha, Sunghun Kim, and Jaegul Choo. 2018. Stargan: Unified generative adversarial networks for multi-domain image-to-image translation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 8789--8797.

[7]

Yue Gao, Yuan Guo, Zhouhui Lian, Yingmin Tang, and Jianguo Xiao. 2019. Artistic glyph image synthesis via one-stage few-shot learning. ACM Transactions on Graphics (TOG) 38, 6 (2019), 185.

Digital Library

[8]

Elena Garces, Aseem Agarwala, Diego Gutierrez, and Aaron Hertzmann. 2014. A similarity measure for illustration style. ACM Transactions on Graphics (TOG) 33, 4 (2014), 1--9.

Digital Library

[9]

Leon A Gatys, Alexander S Ecker, and Matthias Bethge. 2016. Image style transfer using convolutional neural networks. In Proceedings of the IEEE conference on computer vision and pattern recognition. 2414--2423.

[10]

Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. 2014. Generative adversarial nets. In Advances in neural information processing systems. 2672--2680.

[11]

Yuan Guo, Zhouhui Lian, Yingmin Tang, and Jianguo Xiao. 2018. Creating New Chinese Fonts based on Manifold Learning and Adversarial Networks. In Eurographics (Short Papers). 61--64.

[12]

Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition. 770--778.

[13]

Zhenliang He, Wangmeng Zuo, Meina Kan, Shiguang Shan, and Xilin Chen. 2019. Attgan: Facial attribute editing by only changing what you want. IEEE Transactions on Image Processing (2019).

[14]

Martin Heusel, Hubert Ramsauer, Thomas Unterthiner, Bernhard Nessler, and Sepp Hochreiter. 2017. Gans trained by a two time-scale update rule converge to a local nash equilibrium. In Advances in Neural Information Processing Systems. 6626--6637.

[15]

Phillip Isola, Jun-Yan Zhu, Tinghui Zhou, and Alexei A Efros. 2017. Image-to-image translation with conditional adversarial networks. In Proceedings of the IEEE conference on computer vision and pattern recognition. 1125--1134.

[16]

Yue Jiang, Zhouhui Lian, Yingmin Tang, and Jianguo Xiao. 2017. DCFont: an end-to-end deep Chinese font generation system. In SIGGRAPH Asia 2017 Technical Briefs. ACM, 22.

[17]

Yue Jiang, Zhouhui Lian, Yingmin Tang, and Jianguo Xiao. 2019. SCFont: Structure-guided Chinese Font Generation via Deep Stacked Networks. (2019).

[18]

Diederik P. Kingma and Jimmy Lei Ba. 2015. Adam: A Method for Stochastic Optimization. ICLR (2015).

[19]

Zhouhui Lian, Bo Zhao, Xudong Chen, and Jianguo Xiao. 2018. EasyFont: A Style Learning-Based System to Easily Build Your Large-Scale Handwriting Fonts. ACM Transactions on Graphics (TOG) 38, 1 (2018), 6.

[20]

Ming Liu, Yukang Ding, Min Xia, Xiao Liu, Errui Ding, Wangmeng Zuo, and Shilei Wen. 2019. STGAN: A Unified Selective Transfer Network for Arbitrary Image Attribute Editing. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3673--3682.

[21]

Ziwei Liu, Ping Luo, Xiaogang Wang, and Xiaoou Tang. 2015. Deep Learning Face Attributes in the Wild. In Proceedings of International Conference on Computer Vision (ICCV).

Digital Library

[22]

Raphael Gontijo Lopes, David Ha, Douglas Eck, and Jonathon Shlens. 2019. A Learned Representation for Scalable Vector Graphics. arXiv preprint arXiv:1904.02632 (2019).

[23]

Zhaoliang Lun, Evangelos Kalogerakis, and Alla Sheffer. 2015. Elements of style: learning perceptual shape style similarity. ACM Transactions on graphics (TOG) 34, 4 (2015), 1--14.

Digital Library

[24]

Pengyuan Lyu, Xiang Bai, Cong Yao, Zhen Zhu, Tengteng Huang, and Wenyu Liu. 2017. Auto-encoder guided GAN for Chinese calligraphy synthesis. In 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), Vol. 1. IEEE, 1095--1100.

[25]

Chongyang Ma, Haibin Huang, Alla Sheffer, Evangelos Kalogerakis, and Rui Wang. 2014. Analogy-driven 3D style transfer. In Computer Graphics Forum, Vol. 33. Wiley Online Library, 175--184.

[26]

Roey Mechrez, Itamar Talmi, and Lihi Zelnik-Manor. 2018. The contextual loss for image transformation with non-aligned data. In Proceedings of the European Conference on Computer Vision (ECCV). 768--783.

Digital Library

[27]

Peter O'Donovan, Jānis Lībeks, Aseem Agarwala, and Aaron Hertzmann. 2014. Exploratory font selection using crowdsourced attributes. ACM Transactions on Graphics (TOG) 33, 4 (2014), 92.

Digital Library

[28]

Tim Salimans, Ian Goodfellow, Wojciech Zaremba, Vicki Cheung, Alec Radford, and Xi Chen. 2016. Improved techniques for training gans. In Advances in neural information processing systems. 2234--2242.

[29]

Ana Serrano, Diego Gutierrez, Karol Myszkowski, Hans-Peter Seidel, and Belen Masia. 2018. An intuitive control space for material appearance. arXiv preprint arXiv:1806.04950 (2018).

[30]

Dmitry Ulyanov, Andrea Vedaldi, and Victor Lempitsky. 2016. Instance normalization: The missing ingredient for fast stylization. arXiv preprint arXiv:1607.08022 (2016).

[31]

Zhangyang Wang, Jianchao Yang, Hailin Jin, Jonathan Brandt, Eli Shechtman, Aseem Agarwala, Zhaowen Wang, Yuyan Song, Joseph Hsieh, Sarah Kong, et al. 2015. Deepfont: A system for font recognition and similarity. In 23rd ACM International Conference on Multimedia, MM 2015. Association for Computing Machinery, Inc, 813--814.

Digital Library

[32]

Sanghyun Woo, Jongchan Park, Joon-Young Lee, and In So Kweon. 2018. Cbam: Convolutional block attention module. In Proceedings of the European Conference on Computer Vision (ECCV). 3--19.

Digital Library

[33]

Po-Wei Wu, Yu-Jing Lin, Che-Han Chang, Edward Y Chang, and Shih-Wei Liao. 2019. Relgan: Multi-domain image-to-image translation via relative attributes. In Proceedings of the IEEE International Conference on Computer Vision. 5914--5922.

[34]

Tao Xu, Pengchuan Zhang, Qiuyuan Huang, Han Zhang, Zhe Gan, Xiaolei Huang, and Xiaodong He. 2018. Attngan: Fine-grained text to image generation with attentional generative adversarial networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1316--1324.

[35]

Richard Zhang, Phillip Isola, Alexei A Efros, Eli Shechtman, and Oliver Wang. 2018a. The unreasonable effectiveness of deep features as a perceptual metric. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 586--595.

[36]

Yulun Zhang, Kunpeng Li, Kai Li, Lichen Wang, Bineng Zhong, and Yun Fu. 2018b. Image super-resolution using very deep residual channel attention networks. In Proceedings of the European Conference on Computer Vision (ECCV). 286--301.

Digital Library

Cited By

Wang PZhang XZhou ZChilds PLee KKleinsmann MWang S(2024)Typeface Generation through Style Descriptions With Generative ModelsProceedings of the 19th ACM SIGGRAPH International Conference on Virtual-Reality Continuum and its Applications in Industry10.1145/3703619.3706043(1-12)Online publication date: 1-Dec-2024
https://dl.acm.org/doi/10.1145/3703619.3706043
Li XMeng LWu LLi MMeng X(2024)DreamFont3D: Personalized Text-to-3D Artistic Font GenerationACM SIGGRAPH 2024 Conference Papers10.1145/3641519.3657476(1-11)Online publication date: 13-Jul-2024
https://dl.acm.org/doi/10.1145/3641519.3657476
Xiao SWang LMa XZeng W(2024)TypeDance: Creating Semantic Typographic Logos from Image through Personalized GenerationProceedings of the 2024 CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642185(1-18)Online publication date: 11-May-2024
https://dl.acm.org/doi/10.1145/3613904.3642185
Show More Cited By

Index Terms

Attribute2Font: creating fonts you want from attributes
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
  2. Computer graphics
    1. Image manipulation

Recommendations

Khmer Calligraphy Style Transfer Using SkelGAN
IAIT '23: Proceedings of the 13th International Conference on Advances in Information Technology

Font style transfer is a challenging task in computer vision, aimed at extracting the visual characteristics such as stroke contrast and apply them to the content image. In this article, we focus on utilizing SkelGAN, a modified version of the U-Net ...
HFH-Font: Few-shot Chinese Font Synthesis with Higher Quality, Faster Speed, and Higher Resolution

The challenge of automatically synthesizing high-quality vector fonts, particularly for writing systems (e.g., Chinese) consisting of huge amounts of complex glyphs, remains unsolved. Existing font synthesis techniques fall into two categories: 1) ...
MF-Net: A Novel Few-shot Stylized Multilingual Font Generation Method
MM '22: Proceedings of the 30th ACM International Conference on Multimedia

Creating a complete stylized font library that helps the audience to perceive information from the text often requires years of study and proficiency in the use of many professional tools. Accordingly, automatic stylized font generation in a deep ...

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Graphics

ACM Transactions on Graphics Volume 39, Issue 4

August 2020

1732 pages

ISSN:0730-0301

EISSN:1557-7368

DOI:10.1145/3386569

Editor:
Szymon Rusinkiewicz
Princeton University

Issue’s Table of Contents

Copyright © 2020 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 12 August 2020

Published in TOG Volume 39, Issue 4

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

National Natural Science Foundation of China
Beijing Nova Program of Science and Technology

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

43
Total Citations
View Citations
574
Total Downloads

Downloads (Last 12 months)118
Downloads (Last 6 weeks)9

Reflects downloads up to 20 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

Wang PZhang XZhou ZChilds PLee KKleinsmann MWang S(2024)Typeface Generation through Style Descriptions With Generative ModelsProceedings of the 19th ACM SIGGRAPH International Conference on Virtual-Reality Continuum and its Applications in Industry10.1145/3703619.3706043(1-12)Online publication date: 1-Dec-2024
https://dl.acm.org/doi/10.1145/3703619.3706043
Li XMeng LWu LLi MMeng X(2024)DreamFont3D: Personalized Text-to-3D Artistic Font GenerationACM SIGGRAPH 2024 Conference Papers10.1145/3641519.3657476(1-11)Online publication date: 13-Jul-2024
https://dl.acm.org/doi/10.1145/3641519.3657476
Xiao SWang LMa XZeng W(2024)TypeDance: Creating Semantic Typographic Logos from Image through Personalized GenerationProceedings of the 2024 CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642185(1-18)Online publication date: 11-May-2024
https://dl.acm.org/doi/10.1145/3613904.3642185
Shi GLi LSong M(2024)Beyond pixels: text-guided deep insights into graphic design image aestheticsJournal of Electronic Imaging10.1117/1.JEI.33.5.05305933:05Online publication date: 1-Sep-2024
https://doi.org/10.1117/1.JEI.33.5.053059
Tatsukawa YShen IQi AKoyama YIgarashi TShamir A(2024)FontCLIP: A Semantic Typography Visual‐Language Model for Multilingual Font ApplicationsComputer Graphics Forum10.1111/cgf.1504343:2Online publication date: 30-Apr-2024
https://doi.org/10.1111/cgf.15043
Hagen MCruzes DJaccheri LFails J(2024)Evaluating digital creativity support for childrenInternational Journal of Child-Computer Interaction10.1016/j.ijcci.2023.10060338:COnline publication date: 27-Feb-2024
https://dl.acm.org/doi/10.1016/j.ijcci.2023.100603
Huang SLi QLiao JWang SLiu LLi L(2024)Controllable image synthesis methods, applications and challenges: a comprehensive surveyArtificial Intelligence Review10.1007/s10462-024-10987-w57:12Online publication date: 18-Oct-2024
https://doi.org/10.1007/s10462-024-10987-w
Xiong YWang XXu YZhang YChang JZhang JBan X(2024)Dual-mechanism surface tension model for SPH-based simulationThe Visual Computer: International Journal of Computer Graphics10.1007/s00371-024-03474-440:7(4765-4776)Online publication date: 27-May-2024
https://dl.acm.org/doi/10.1007/s00371-024-03474-4
Kondo TTakezaki SHaraguchi DUchida S(2024)Font Style Interpolation with Diffusion ModelsDocument Analysis and Recognition - ICDAR 202410.1007/978-3-031-70536-6_6(86-103)Online publication date: 3-Sep-2024
https://doi.org/10.1007/978-3-031-70536-6_6
Kubota YHaraguchi DUchida S(2024)Impression-CLIP: Contrastive Shape-Impression Embedding for FontsDocument Analysis and Recognition - ICDAR 202410.1007/978-3-031-70536-6_5(70-85)Online publication date: 30-Aug-2024
https://dl.acm.org/doi/10.1007/978-3-031-70536-6_5
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Issue’s Table of Contents