DeepStroke: Understanding Glyph Structure with Semantic Segmentation and Tabu Search

Wang, Wenguang; Lian, Zhouhui; Tang, Yingmin; Xiao, Jianguo

doi:10.1007/978-3-030-37731-1_29

DeepStroke: Understanding Glyph Structure with Semantic Segmentation and Tabu Search

Wenguang Wang^16,17,
Zhouhui Lian^16,17,
Yingmin Tang^16,17 &
…
Jianguo Xiao^16,17

Conference paper
First Online: 24 December 2019

2982 Accesses
1 Altmetric

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11961))

Abstract

Glyphs in many writing systems (e.g., Chinese) are composed of a sequence of strokes written in a specific order. Glyph structure interpreting (i.e., stroke extraction) is one of the most important processing steps in many tasks including aesthetic quality evaluation, handwriting synthesis, character recognition, etc. However, existing methods that rely heavily on accurate shape matching are not only time-consuming but also unsatisfactory in stroke extraction performance. In this paper, we propose a novel method based on semantic segmentation and tabu search to interpret the structure of Chinese glyphs. Specifically, we first employ an improved Fully Convolutional Network (FCN), DeepStroke, to extract strokes, and then use the tabu search to obtain the order how these strokes are drawn. We also build the Chinese Character Stroke Segmentation Dataset (CCSSD) consisting of 67630 character images that can be equally classified into 10 different font styles. This dataset provides a benchmark for both stroke extraction and semantic segmentation tasks. Experimental results demonstrate the effectiveness and efficiency of our method and validate its superiority against the state of the art.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 99.00; Price excludes VAT (USA)

Softcover Book: USD 129.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Chen, L.C., Papandreou, G., Kokkinos, I., Murphy, K., Yuille, A.L.: Semantic image segmentation with deep convolutional nets and fully connected CRFs. Comput. Sci. 4, 357–361 (2014)
Google Scholar
Chen, L.C., Papandreou, G., Kokkinos, I., Murphy, K., Yuille, A.L.: DeepLab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs. arXiv preprint arXiv:1606.00915 (2016)
Chen, L.C., Zhu, Y., Papandreou, G., Schroff, F., Adam, H.: Encoder-decoder with atrous separable convolution for semantic image segmentation (2018)
Chapter Google Scholar
Chen, X., Lian, Z., Tang, Y., Xiao, J.: A benchmark for stroke extraction of Chinese characters. Acta Scientiarum Naturalium Universitatis Pekinensis 2, 4 (2016)
Google Scholar
Lian, Z., Xiao, J.: Automatic shape morphing for Chinese characters. In: SIGGRAPH Asia 2012 Technical Briefs, p. 2. ACM (2012)
Google Scholar
Long, J., Shelhamer, E., Darrell, T.: Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3431–3440 (2015)
Google Scholar
Mottaghi, R., et al.: The role of context for object detection and semantic segmentation in the wild. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 891–898 (2014)
Google Scholar
Shelhamer, E., Long, J., Darrell, T.: Fully convolutional models for semantic segmentation. TPAMI (2016)
Google Scholar
Sun, Y., Qian, H., Xu, Y.: A geometric approach to stroke extraction for the Chinese calligraphy robot. In: 2014 IEEE International Conference on Robotics and Automation (ICRA), pp. 3207–3212. IEEE (2014)
Google Scholar
Wang, C., Lian, Z., Tang, Y., Xiao, J.: Automatic correspondence finding for Chinese characters using graph matching. In: Seventh International Conference on Image and Graphics, pp. 545–550 (2013)
Google Scholar
Wang, X., Liang, X., Sun, L., Liu, M.: Triangular mesh based stroke segmentation for Chinese calligraphy. In: 2013 12th International Conference on Document Analysis and Recognition (ICDAR), pp. 1155–1159. IEEE (2013)
Google Scholar
Yang, M., Yu, K., Zhang, C., Li, Z., Yang, K.: DenseASPP for semantic segmentation in street scenes. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3684–3692 (2018)
Google Scholar
Yu, C., Wang, J., Peng, C., Gao, C., Yu, G., Sang, N.: BiSeNet: bilateral segmentation network for real-time semantic segmentation. arXiv preprint arXiv:1808.00897 (2018)
Zhao, H., Shi, J., Qi, X., Wang, X., Jia, J.: Pyramid scene parsing network (2016)
Google Scholar

Download references

Acknowledgments

This work was supported by National Natural Science Foundation of China (Grant No.: 61672056 and 61672043) and Key Laboratory of Science, Technology and Standard in Press Industry (Key Laboratory of Intelligent Press Media Technology).

Author information

Authors and Affiliations

Wangxuan Institute of Computer Technology, Peking University, Beijing, China
Wenguang Wang, Zhouhui Lian, Yingmin Tang & Jianguo Xiao
Center For Chinese Font Design and Research, Peking University, Beijing, China
Wenguang Wang, Zhouhui Lian, Yingmin Tang & Jianguo Xiao

Authors

Wenguang Wang
View author publications
You can also search for this author in PubMed Google Scholar
Zhouhui Lian
View author publications
You can also search for this author in PubMed Google Scholar
Yingmin Tang
View author publications
You can also search for this author in PubMed Google Scholar
Jianguo Xiao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zhouhui Lian .

Editor information

Editors and Affiliations

Korea Advanced Institute of Science and, Daejeon, Korea (Republic of)
Yong Man Ro
National Chiao Tung University, Hsinchu, Taiwan
Wen-Huang Cheng
Korea Advanced Institute of Science and Technology, Daejeon, Korea (Republic of)
Junmo Kim
National Cheng Kung University, Tainan City, Taiwan
Wei-Ta Chu
Tsinghua University, Beijing, China
Peng Cui
Korea Advanced Institute of Science and Technology, Daejeon, Korea (Republic of)
Jung-Woo Choi
National Tsing Hua University, Hsinchu, Taiwan
Min-Chun Hu
Ghent University, Ghent, Belgium
Wesley De Neve

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wang, W., Lian, Z., Tang, Y., Xiao, J. (2020). DeepStroke: Understanding Glyph Structure with Semantic Segmentation and Tabu Search. In: Ro, Y., et al. MultiMedia Modeling. MMM 2020. Lecture Notes in Computer Science(), vol 11961. Springer, Cham. https://doi.org/10.1007/978-3-030-37731-1_29

Download citation

DOI: https://doi.org/10.1007/978-3-030-37731-1_29
Published: 24 December 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-37730-4
Online ISBN: 978-3-030-37731-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics