Synthesizing style-preserving cartoons via non-negative style factorization

Liang, Zhang; Xiao, Jun; Zhuang, Yue-ting

doi:10.1631/jzus.C1100202

Synthesizing style-preserving cartoons via non-negative style factorization

Published: 06 March 2012

Volume 13, pages 196–207, (2012)
Cite this article

Journal of Zhejiang University SCIENCE C Aims and scope Submit manuscript

Zhang Liang¹,
Jun Xiao¹ &
Yue-ting Zhuang¹

112 Accesses
Explore all metrics

Abstract

We present a complete framework for synthesizing style-preserving 2D cartoons by learning from traditional Chinese cartoons. In contrast to reusing-based approaches which rely on rearranging or retrieving existing cartoon sequences, we aim to generate stylized cartoons with the idea of style factorization. Specifically, starting with 2D skeleton features of cartoon characters extracted by an improved rotoscoping system, we present a non-negative style factorization (NNSF) algorithm to obtain style basis and weights and simultaneously preserve class separability. Thus, factorized style basis can be combined with heterogeneous weights to re-synthesize style-preserving features, and then these features are used as the driving source in the character reshaping process via our proposed subkey-driving strategy. Extensive experiments and examples demonstrate the effectiveness of the proposed framework.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Style Transfer Based on Style Primitive Discovery

Computational Decomposition of Style for Controllable and Enhanced Style Transfer

CalliGAN: Unpaired Mutli-chirography Chinese Calligraphy Image Translation

References

Agarwala, A., Hertzmann, A., Salesin, D.H., Seitz, S.M., 2004. Keyframe-based tracking for rotoscoping and animation. ACM Trans. Graph., 23(3):584–591. [doi:10.1145/1015706.1015764]
Article Google Scholar
Aharon, M., Elad, M., Bruckstein, A., 2006. K-SVD: an algorithm for designing of overcomplete dictionaries for sparse representation. IEEE Trans. Signal Process., 54(11):4311–4322. [doi:10.1109/TSP.2006.881199]
Article Google Scholar
Alexa, M., Cohen-Or, D., Levin, D., 2000. As-Rigid-as-Possible Shape Interpolation. Proc. SIGGRAPH, p.157–164. [doi:10.1145/344779.344859]
Brand, M., Hertzmann, A., 2000. Style Machines. Proc. SIGGRAPH, p.183–192. [doi:10.1145/344779.344865]
Bregler, C., Loeb, L., Chuang, E., Deshpande, H., 2002. Turning to the masters: motion capturing cartoons. ACM Trans. Graph., 21(3):399–407. [doi:10.1145/566654.566595]
Article Google Scholar
Chenney, S., Pingel, M., Iverson, R., Szymanski, M., 2002. Simulating Cartoon Style Animation. Proc. NPAR, p.133–138. [doi:10.1145/508530.508553]
Freifeld, O., Weiss, A., Zuffi, S., Black, M.J., 2010. Contour People: a Parameterized Model of 2D Articulated Human Shape. Proc. CVPR, p.639–646. [doi:10.1109/CVPR.2010.5540154]
Guan, P., Freifeld, O., Black, M., 2010. A 2D Human Body Model Dressed in Eigen Clothing. Proc. ECCV, p.285–298.
Hoch, M., Litwinowicz, P.C., 1996. A semi-automatic system for edge tracking with snakes. Vis. Comput., 12(2):75–83. [doi:10.1007/s003710050049]
Google Scholar
Hornung, A., Dekkers, E., Kobbelt, L., 2007. Character animation from 2D pictures and 3D motion data. ACM Trans. Graph., 26(1):1–es. [doi:10.1145/1189762.1189763]
Article Google Scholar
Hoyer, P.O., 2002. Non-negative Sparse Coding. Proc. 12th IEEE Workshop on Neural Networks for Signal Processing, p.557–565. [doi:10.1109/NNSP.2002.1030067]
Hsu, E., Pulli, K., Popović, J., 2005. Style translation for human motion. ACM Trans. Graph., 24(3):1082–1089. [doi:10.1145/1073204.1073315]
Article Google Scholar
Igarashi, T., Moscovich, T., Hughes, J.F., 2005. As-rigid-as-possible shape manipulation. ACM Trans. Graph., 24(3): 1134–1141. [doi:10.1145/1073204.1073323]
Article Google Scholar
Jonker, R., Volgenant, A., 1987. A shortest augmenting path algorithm for dense and sparse linear assignment problems. Computing, 38(4):325–340. [doi:10.1007/BF02278710]
Article MathSciNet MATH Google Scholar
Kuo, P., Makris, D., Megherbi, N., Nebel, J.C., 2008. Integration of local image cues for probabilistic 2D pose recovery. LNCS, 5359:214–223. [doi:10.1007/978-3-540-89646-3_21]
Google Scholar
Kwon, J., Lee, I.K., 2008. Exaggerating character motions using sub-joint hierarchy. Comput. Graph. Forum, 27(6): 1677–1686. [doi:10.1111/j.1467-8659.2008.01177.x]
Article MathSciNet MATH Google Scholar
Lau, M., Chai, J., Xu, Y.Q., Shum, H.Y., 2009. Face poser: interactive modeling of 3D facial expressions using facial priors. ACM Trans. Graph., 29(1):1–17. [doi:10.1145/1640443.1640446]
Article Google Scholar
Lee, D.D., Seung, H.S., 2001. Algorithms for Non-negative Matrix Factorization. Proc. NIPS, 13:556–562.
Google Scholar
Li, Y., Gleicher, M., Xu, Y.Q., Shum, H.Y., 2003. Stylizing Motion with Drawings. Proc. SCA, p.309–319.
Ma, X., Le, B.H., Deng, Z., 2009. Style Learning and Transferring for Facial Animation Editing. Proc. SCA, p.123–132. [doi:10.1145/1599470.1599486]
Moeslund, T.B., Hilton, A., Krüger, V., 2006. A survey of advances in vision-based human motion capture and analysis. Comput. Vis. Image Understand., 104(2–3):90–126. [doi:10.1016/j.cviu.2006.08.002]
Article Google Scholar
Pullen, K., Bregler, C., 2002. Motion capture assisted animation: texturing and synthesis. ACM Trans. Graph., 21(3): 501–508. [doi:10.1145/566654.566608]
Article Google Scholar
Rogez, G., Orrite-Uruñuela, C., Martínez-del-Rincón, J., 2008. A spatio-temporal 2D-models framework for human pose recovery in monocular sequences. Pattern Recogn., 41(9):2926–2944. [doi:10.1016/j.patcog.2008.02.012]
Article MATH Google Scholar
Schaefer, S., McPhail, T., Warren, J., 2006. Image deformation using moving least squares. ACM Trans. Graph., 25(3): 533–540. [doi:10.1145/1141911.1141920]
Article Google Scholar
Sýkora, D., Sedlacek, D., Jinchao, S., Dingliana, J., Collins, S., 2010. Adding depth to cartoons using sparse depth (in)equalities. Comput. Graph. Forum, 29(2):615–623. [doi:10.1111/j.1467-8659.2009.01631.x]
Article Google Scholar
Tenenbaum, J.B., Freeman, W.T., 2000. Separating style and content with bilinear models. Neur. Comput., 12(6): 1247–1283. [doi:10.1162/089976600300015349]
Article Google Scholar
Tenenbaum, J.B., Silva, V., Langford, J.C., 2000. A global geometric framework for nonlinear dimensionality reduction. Science, 290(5500):2319–2323. [doi:10.1126/science.290.5500.2319]
Article Google Scholar
Torresani, L., Hackney, P., Bregler, C., 2007. Learning Motion Style Synthesis from Perceptual Observations. Proc. NIPS, 19:1393–1400.
Google Scholar
Wang, H., Li, H., 2002. Cartoon Motion Capture by Shape Matching. Proc. Conf. on Computer Graphics and Applications, p.454–456.
Wang, J., Drucker, S.M., Agrawala, M., Cohen, M.F., 2006. The cartoon animation filter. ACM Trans. Graph., 25(3): 1169–1173. [doi:10.1145/1141911.1142010]
Article Google Scholar
Wang, J.M., Fleet, D.J., Hertzmann, A., 2007. Multifactor Gaussian Process Models for Style-Content Separation. Proc. ICML, p.975–982. [doi:10.1145/1273496.1273619]
Weng, Y., Xu, W., Wu, Y., Zhou, K., Guo, B., 2006. 2D shape deformation using nonlinear least squares optimization. Vis. Comput., 22(9–11):653–660. [doi:10.1007/s00371-006-0054-y]
Article Google Scholar
Yan, H.B., Hu, S., Martin, R.R., Yang, Y.L., 2008. Shape deformation using a skeleton to drive simplex transformations. IEEE Trans. Visual. Comput. Graph., 14(3):693–706. [doi:10.1109/TVCG.2008.28]
Article Google Scholar
Yang, Y., Zhuang, Y., Xu, D., Pan, Y., Tao, D., Maybank, S., 2009. Retrieval Based Interactive Cartoon Synthesis via Unsupervised Bi-distance Metric Learning. Proc. Conf. on Multimedia, p.311–320. [doi:10.1145/1631272.1631316]
Zhou, S., Fu, H., Liu, L., Cohen-Or, D., Han, X., 2010. Parametric Reshaping of Human Bodies in Images. Proc. SIGGRAPH, p.1–10. [doi:10.1145/1778765.1778863]

Download references

Author information

Authors and Affiliations

Institute of Artificial Intelligence, School of Computer Science and Technology, Zhejiang University, Hangzhou, 310027, China
Zhang Liang, Jun Xiao & Yue-ting Zhuang

Authors

Zhang Liang
View author publications
You can also search for this author in PubMed Google Scholar
Jun Xiao
View author publications
You can also search for this author in PubMed Google Scholar
Yue-ting Zhuang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jun Xiao.

Additional information

Project supported by the National Basic Research Program (973) of China (No. 2012CB316400), the National Natural Science Foundation of China (No. 60903134), and the Natural Science Foundation of Zhejiang Province, China (No. Y1101129)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Liang, Z., Xiao, J. & Zhuang, Yt. Synthesizing style-preserving cartoons via non-negative style factorization. J. Zhejiang Univ. - Sci. C 13, 196–207 (2012). https://doi.org/10.1631/jzus.C1100202

Download citation

Received: 08 July 2011
Accepted: 07 November 2011
Published: 06 March 2012
Issue Date: March 2012
DOI: https://doi.org/10.1631/jzus.C1100202

Key words

CLC number

TP391

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Synthesizing style-preserving cartoons via non-negative style factorization

Abstract

Access this article

Similar content being viewed by others

Style Transfer Based on Style Primitive Discovery

Computational Decomposition of Style for Controllable and Enhanced Style Transfer

CalliGAN: Unpaired Mutli-chirography Chinese Calligraphy Image Translation

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Key words

CLC number

Navigation

Synthesizing style-preserving cartoons via non-negative style factorization

Abstract

Access this article

Similar content being viewed by others

Style Transfer Based on Style Primitive Discovery

Computational Decomposition of Style for Controllable and Enhanced Style Transfer

CalliGAN: Unpaired Mutli-chirography Chinese Calligraphy Image Translation

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Key words

CLC number

Search

Navigation