Image Generation for Printed Character by Representation Learning

Gu, Kangzheng; Bai, Jiansong; Zhang, Qichen; Peng, Junjie; Zhang, Wenqiang

doi:10.1007/978-3-030-00764-5_60

Kangzheng Gu¹⁸,
Jiansong Bai¹⁹,
Qichen Zhang²⁰,
Junjie Peng²¹ &
…
Wenqiang Zhang¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11166))

Included in the following conference series:

Pacific Rim Conference on Multimedia

3079 Accesses

Abstract

With the development of convolutional neural networks, generative models can synthesize really wonderful images. But most of these models are limited in generalization and extensibility. And things become difficult when generating images with multiple specified features. Therefore, this paper introduce an expandable approach to generate images with multiple features. We use our model to generate images including a single character with specified fonts and position, by learning the representations of different features from existing images, and using these representations together. Several structures are proposed to increase the training efficiency and extensibility. Finally, we arrange some experiments and show the performance of our model.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Chen, L.C., Papandreou, G., Kokkinos, I., Murphy, K., Yuille, A.L.: Semantic image segmentation with deep convolutional nets and fully connected CRFs. Comput. Sci. 4, 357–361 (2014)
Google Scholar
Clevert, D.A., Unterthiner, T., Hochreiter, S.: Fast and accurate deep network learning by exponential linear units (ELUs). Comput. Sci. (2015)
Google Scholar
Goodfellow, I.J., et al.: Generative adversarial networks. In: Advances in Neural Information Processing Systems, vol. 3, pp. 2672–2680 (2014)
Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Computer Vision and Pattern Recognition, pp. 770–778 (2016)
Google Scholar
Hinton, G.E., Salakhutdinov, R.: Reducing the dimensionality of data with neural networks. Science 313(5786), 504–507 (2006)
Article MathSciNet Google Scholar
Isola, P., Zhu, J., Zhou, T., Efros, A.A.: Image-to-image translation with conditional adversarial networks. In: Computer Vision and Pattern Recognition, pp. 1125–1134 (2016)
Google Scholar
Kingma, D.P., Rezende, D.J., Mohamed, S., Welling, M.: Semi-supervised learning with deep generative models. In: Advances in Neural Information Processing Systems, vol. 4, pp. 3581–3589 (2014)
Google Scholar
Kingma, D.P., Welling, M.: Auto-encoding variational bayes. In: International Conference on Learning Representations (2014)
Google Scholar
Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: International Conference on Neural Information Processing Systems, pp. 1097–1105 (2012)
Google Scholar
Lecun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278–2324 (1998)
Article Google Scholar
Lee, H.Y., Huang, J.B., Singh, M., Yang, M.H.: Unsupervised representation learning by sorting sequences, pp. 667–676 (2017)
Google Scholar
Long, J., Shelhamer, E., Darrell, T.: Fully convolutional networks for semantic segmentation. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3431–3440 (2015)
Google Scholar
Mirza, M., Osindero, S.: Conditional generative adversarial nets. arXiv: Learning (2014)
Noroozi, M., Pirsiavash, H., Favaro, P.: Representation learning by learning to count, pp. 5899–5907 (2017)
Google Scholar
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. Comput. Sci. (2014)
Google Scholar
Tieleman, T., Hinton, G.: Lecture 6.5-RMSProp: divide the gradient by a running average of its recent magnitude. COURSERA Neural Netw. Mach. Learn. 4, 26–31 (2012)
Google Scholar

Download references

Author information

Authors and Affiliations

Shanghai Key Laboratory of Intelligent Information Processing, School of Computer Science, Fudan University, Shanghai, People’s Republic of China
Kangzheng Gu & Wenqiang Zhang
Department of Art and Design, Fudan University, Shanghai, People’s Republic of China
Jiansong Bai
School of Sociology and Political Science, Shanghai University, Shanghai, People’s Republic of China
Qichen Zhang
School of Computer Science, Shanghai University, Shanghai, People’s Republic of China
Junjie Peng

Authors

Kangzheng Gu
View author publications
You can also search for this author in PubMed Google Scholar
Jiansong Bai
View author publications
You can also search for this author in PubMed Google Scholar
Qichen Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Junjie Peng
View author publications
You can also search for this author in PubMed Google Scholar
Wenqiang Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Wenqiang Zhang .

Editor information

Editors and Affiliations

Hefei University of Technology, Hefei, China
Richang Hong
National Chiao Tung University, Hsinchu, Taiwan
Wen-Huang Cheng
University of Tokyo, Tokyo, Japan
Toshihiko Yamasaki
Hefei University of Technology, Hefei, China
Meng Wang
City University of Hong Kong, Hong Kong, Hong Kong
Chong-Wah Ngo

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Gu, K., Bai, J., Zhang, Q., Peng, J., Zhang, W. (2018). Image Generation for Printed Character by Representation Learning. In: Hong, R., Cheng, WH., Yamasaki, T., Wang, M., Ngo, CW. (eds) Advances in Multimedia Information Processing – PCM 2018. PCM 2018. Lecture Notes in Computer Science(), vol 11166. Springer, Cham. https://doi.org/10.1007/978-3-030-00764-5_60

Download citation

DOI: https://doi.org/10.1007/978-3-030-00764-5_60
Published: 18 September 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-00763-8
Online ISBN: 978-3-030-00764-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics