Part-Level Sketch Segmentation and Labeling Using Dual-CNN

Zhu, Xianyi; Xiao, Yi; Zheng, Yan

doi:10.1007/978-3-030-04167-0_34

Part-Level Sketch Segmentation and Labeling Using Dual-CNN

Xianyi Zhu¹⁶,
Yi Xiao¹⁶ &
Yan Zheng¹⁷

Conference paper
First Online: 17 November 2018

3761 Accesses
4 Citations

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 11301))

Abstract

Part-level sketch segmentation and labeling refers to segment an object sketch to semantic component parts. It is a hard task since sketches carry much fewer features than natural images. Inspired by the neural networks used in sketch classification, which shows the performance of the network is significantly affected by the kernel size, we propose a dual-convolutional neural network (CNN) method to tackle automatic sketch segmentation and labeling. The dual-CNN model contains two CNNs, one with large-size convolutional kernels to process long sketches, the other with small-size kernels to work on short ones. Both CNNs have three convolutional layers and three fully connection layers. Except for the first convolutional layer, the rest configurations of these two CNNs are same. To further enhance the performance of the method, we model position and orientation as a triple-channel input of our networks by fusing the minimal oriented rectangle bounding boxes (MORBB) of stroke and its host sketch as masks. Extensive experimental results verify our method and demonstrate that our approach outperforms state of the art.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Belongie, S.J., Malik, J., Puzicha, J.: Shape matching and object recognition using shape contexts. IEEE Trans. Pattern Anal. Mach. Intell. 24(4), 509–522 (2002)
Article Google Scholar
Chung, J., Gülçehre, Ç., Cho, K., Bengio, Y.: Gated feedback recurrent neural networks. In: Bach, F.R., Blei, D.M. (eds.) ICML 2015. PMLR, vol. 37, pp. 2067–2075. MIT Press, Cambridge (2015)
Google Scholar
Eitz, M., Hays, J., Alexa, M.: How do humans sketch objects? ACM Trans. Graph. 31(4), 44:1–44:10 (2012)
Google Scholar
Freeman, H., Shapira, R.: Determining the minimum-area encasing rectangle for an arbitrary closed curve. Commun. ACM 18(7), 409–413 (1975)
Article MathSciNet Google Scholar
Furusawa, C., Fukusato, T., Okada, N., Hirai, T., Morishima, S.: Quasi 3D rotation for hand-drawn characters. In: SIGGRAPH 2014, Posters Proceedings, p. 12:1. ACM Press, New York (2014)
Google Scholar
Galea, C., Farrugia, R.A.: Forensic face photo-sketch recognition using a deep learning-based architecture. IEEE Sig. Process. Lett. 24(11), 1586–1590 (2017)
Article Google Scholar
He, J., Wu, X., Jiang, Y., Zhao, B., Peng, Q.: Sketch recognition with deep visual-sequential fusion model. In: Liu, Q., et al. (eds.) ACM Multimedia 2017, pp. 448–456. ACM Press, New York (2017)
Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: CVPR 2016, pp. 770–778. IEEE Press, New York (2016)
Google Scholar
Huang, Z., Fu, H., Lau, R.W.: Data-driven segmentation and labeling of freehand sketches. ACM Trans. Graph. 33(6), 175:1–175:10 (2014)
Article Google Scholar
Jia, Q., Yu, M., Fan, X., Li, H.: Sequential dual deep learning with shape and texture features for sketch recognition. CoRR abs/1708.02716 (2017). http://arxiv.org/abs/1708.02716
Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. Commun. ACM 60(6), 84–90 (2017)
Article Google Scholar
Lafferty, J.D., McCallum, A., Pereira, F.C.N.: Conditional random fields: probabilistic models for segmenting and labeling sequence data. In: Brodley, C.E., Danyluk, A.P. (eds.) ICML 2001, pp. 282–289. Morgan Kaufmann, San Francisco (2001)
Google Scholar
Léon, B.: Large-scale machine learning with stochastic gradient descent. In: Lechevallier, Y., Saporta, G. (eds.) COMPSTAT 2010, pp. 177–186. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-7908-2604-3_16
Chapter Google Scholar
Li, Y., Hospedales, T.M., Song, Y., Gong, S.: Free-hand sketch recognition by multi-kernel feature learning. Comput. Vis. Image Underst. 137, 1–11 (2015)
Article Google Scholar
Liao, P., Chen, T., Chung, P.: A fast algorithm for multilevel thresholding. J. Inf. Sci. Eng. 17(5), 713–727 (2001)
Google Scholar
Lowe, D.G.: Object recognition from local scale-invariant features. In: ICCV 1999, pp. 1150–1157. IEEE Press, New York (1999)
Google Scholar
Mao, C., Qin, S.F., Wright, D.K.: A sketch-based gesture interface for rough 3D stick figure animation. In: Jorge, J.A.P., Igarashi, T. (eds.) Sketch Based Interfaces and Modeling 2005, pp. 175–183. Eurographics Association, Geneva (2005)
Google Scholar
Mikolov, T., Karafiát, M., Burget, L., Cernocký, J., Khudanpur, S.: Recurrent neural network based language model. In: Kobayashi, T., Hirose, K., Nakamura, S. (eds.) INTERSPEECH 2010, pp. 1045–1048. ISCA Press, Singapore (2010)
Google Scholar
Noris, G., et al.: Smart scribbles for sketch segmentation. Comput. Graph. Forum 31(8), 2516–2527 (2012)
Article Google Scholar
Nowak, E., Jurie, F., Triggs, B.: Sampling strategies for bag-of-features image classification. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3954, pp. 490–503. Springer, Heidelberg (2006). https://doi.org/10.1007/11744085_38
Chapter Google Scholar
Olsen, L., Samavati, F.F., Sousa, M.C., Jorge, J.A.: Sketch-based modeling: a survey. Comput. Graph. 33(1), 85–103 (2009)
Article Google Scholar
van den Oord, A., Kalchbrenner, N., Kavukcuoglu, K.: Pixel recurrent neural networks. In: Balcan, M., Weinberger, K.Q. (eds.) ICML 2016. PMLR, vol. 48, pp. 1747–1756. MIT Press, Cambridge (2016)
Google Scholar
Otsu, N.: A threshold selection method from gray-level histograms. IEEE Trans. Syst. Man Cybern. 9(1), 62–66 (1979)
Article Google Scholar
Sánchez, J., Perronnin, F., Mensink, T., Verbeek, J.J.: Image classification with the fisher vector: theory and practice. Int. J. Comput. Vis. 105(3), 222–245 (2013)
Article MathSciNet Google Scholar
Sangkloy, P., Burnell, N., Ham, C., Hays, J.: The sketchy database: learning to retrieve badly drawn bunnies. ACM Trans. Graph. 35(4), 119:1–119:12 (2016)
Article Google Scholar
Sarvadevabhatla, R.K., Kundu, J., Babu, R.V.: Enabling my robot to play pictionary: recurrent neural networks for sketch recognition. In: Hanjalic, A., et al. (eds.) ACM Multimedia 2016, pp. 247–251. ACM Press, New York (2016)
Google Scholar
Schneider, R.G., Tuytelaars, T.: Sketch classification and classification-driven analysis using fisher vectors. ACM Trans. Graph. 33(6), 174:1–174:9 (2014)
Article Google Scholar
Schneider, R.G., Tuytelaars, T.: Example-based sketch segmentation and labeling using CRFs. ACM Trans. Graph. 35(5), 151:1–151:9 (2016)
Article Google Scholar
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. CoRR abs/1409.1556 (2014). http://arxiv.org/abs/1409.1556
Sun, Z., Wang, C., Zhang, L., Zhang, L.: Free hand-drawn sketch segmentation. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012. LNCS, vol. 7572, pp. 626–639. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-33718-5_45
Chapter Google Scholar
Wang, J., Yang, J., Yu, K., Lv, F., Huang, T.S., Gong, Y.: Locality-constrained linear coding for image classification. In: CVPR 2010, pp. 3360–3367. IEEE Press, New York (2010)
Google Scholar
Xie, X., et al.: Sketch-to-design: context-based part assembly. Comput. Graph. Forum 32(8), 233–245 (2013)
Article Google Scholar
Yu, Q., Yang, Y., Liu, F., Song, Y., Xiang, T., Hospedales, T.M.: Sketch-a-net: a deep neural network that beats humans. Int. J. Comput. Vis. 122(3), 411–425 (2017)
Article MathSciNet Google Scholar
Yu, Q., Yang, Y., Song, Y., Xiang, T., Hospedales, T.M.: Sketch-a-net that beats humans. In: Xie, X., Jones, M.W., Tam, G.K.L. (eds.) BMVC 2015, pp. 7.1–7.12. BMVA Press, London (2015)
Google Scholar

Download references

Acknowledgements

The work is supported by the National Key Research & Development Program of China (Grant Num.:2018YFB0203904), NSFC from PRC (Grant Num.:61872137, 61502158, 61803150), Hunan NSF (Grant Num.: 2017JJ3042, 2018JJ3067), and China Postdoctoral Foundation (Grant Num.: 2016M590740).

Author information

Authors and Affiliations

College of Computer Science and Electronic Engineering, Hunan University, Changsha, 410082, People’s Republic of China
Xianyi Zhu & Yi Xiao
College of Electrical and Information Engineering, Hunan University, Changsha, 410082, People’s Republic of China
Yan Zheng

Authors

Xianyi Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Yi Xiao
View author publications
You can also search for this author in PubMed Google Scholar
Yan Zheng
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yi Xiao .

Editor information

Editors and Affiliations

The Chinese Academy of Sciences, Beijing, China
Long Cheng
City University of Hong Kong, Kowloon, Hong Kong
Andrew Chi Sing Leung
Kobe University, Kobe, Japan
Seiichi Ozawa

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhu, X., Xiao, Y., Zheng, Y. (2018). Part-Level Sketch Segmentation and Labeling Using Dual-CNN. In: Cheng, L., Leung, A., Ozawa, S. (eds) Neural Information Processing. ICONIP 2018. Lecture Notes in Computer Science(), vol 11301. Springer, Cham. https://doi.org/10.1007/978-3-030-04167-0_34

Download citation

DOI: https://doi.org/10.1007/978-3-030-04167-0_34
Published: 17 November 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-04166-3
Online ISBN: 978-3-030-04167-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics