Human Parsing via Shape Boltzmann Machine Networks

Wang, Qiurui; Yuan, Chun; Huang, Feiyue; Wang, Chengjie

doi:10.1007/978-3-319-24075-6_63

Qiurui Wang^18,19,
Chun Yuan¹⁹,
Feiyue Huang²⁰ &
…
Chengjie Wang²⁰

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9314))

Included in the following conference series:

Pacific Rim Conference on Multimedia

1866 Accesses

Abstract

Human parsing is a challenging task because it is difficult to obtain accurate results of each part of human body. Precious Boltzmann Machine based methods reach good results on segmentation but are poor expression on human parts. In this paper, an approach is presented that exploits Shape Boltzmann Machine networks to improve the accuracy of human body parsing. The proposed Curve Correction method refines the final segmentation results. Experimental results show that the proposed method achieves good performance in body parsing, measured by Average Pixel Accuracy (aPA) against state-of-the-art methods on Penn-Fudan Pedestrians dataset and Pedestrian Parsing Surveillance Scenes dataset.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

ResNet-Based Multiscale U-Net for Human Parsing

Graph-Boosted Attentive Network for Semantic Body Parsing

Human body segmentation based on shape constraint

Article 14 March 2017

References

Fowlkes, C.C., Bo, Y.: Shape-based pedestrian parsing. In: IEEE Conference on Computer Vision and Pattern Recognition (2011)
Google Scholar
Bourdev, L., Malik, J.: Poselets: body part detectors trained using 3D human pose annotations. In: IEEE International Conference on Computer Vision (2009)
Google Scholar
Lin, L., Yang, W., Luo, P.: Clothing co-parsing by joint image segmentation and labeling. In: IEEE Conference on Computer Vision and Pattern Recognition (2014)
Google Scholar
Luis, K., Ortiz, E., Berg, T.L., Yamaguchi, K., Hadi, M.: Parsing clothing in fashion photographs. In: IEEE Conference on Computer Vision and Pattern Recognition (2012)
Google Scholar
Rauschert, I., Collins, R.T.: A generative model for simultaneous estimation of human body shape and pixel-level segmentation. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part V. LNCS, vol. 7576, pp. 704–717. Springer, Heidelberg (2012)
Chapter Google Scholar
Williams, C., Ali Eslami, S.M.: A generative model for parts-based object segmentation. In: Advances in Neural Information Processing Systems, pp. 272–281 (2012)
Google Scholar
Williams, C.K.I., Winn, J., Eslami, S.M.A., Heess, N.: The shape boltzmann machine: a strong model of object shape. In: IEEE Conference on Computer Vision and Pattern Recognition (2012)
Google Scholar
Wang, X., Tang, X., Luo, P., Tian, Y.: Switchable deep network for pedestrian detection. In: IEEE Conference on Computer Vision and Pattern Recognition (2014)
Google Scholar
Salakhutdinov, R., Fidler, S., Zhu, Y., Urtasun, R.: Segdeepm: exploiting segmentation and context in deep neural networks for object detection. In: IEEE Conference on Computer Vision and Pattern Recognition (2015)
Google Scholar
Girshick, R., Malik, J., Hariharan, B., Arbelez, P.: Hypercolumns for object segmentation and fine-grained localization, eprint (2014). arXiv:1411.5752
Darrell, T., Malik, J., Girshick, R., Donahue, J.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: IEEE Conference on Computer Vision and Pattern Recognition (2014)
Google Scholar
Wang, L.-M., Shi, J., Song, G., Shen, I.-F.: Object detection combining recognition and segmentation. In: Yagi, Y., Kang, S.B., Kweon, I.S., Zha, H. (eds.) ACCV 2007, Part I. LNCS, vol. 4843, pp. 189–199. Springer, Heidelberg (2007)
Chapter Google Scholar
Tang, X., Wang, X.: Pedestrian parsing via deep decompositional network. In: IEEE International Conference on Computer Vision (2013)
Google Scholar
Simon, M., Yang, J., Safar, Y.: Max-margin boltzmann machines for object segmentation. In: IEEE Conference on Computer Vision and Pattern Recognition (2014)
Google Scholar
Fowlkes, C., Malik, J., Arbelez, P., Maire, M.: Contour detection and hierarchical image segmentation. In: IEEE Transaction on Software Engineering (2011)
Google Scholar
Puzicha, J., Belongie, S., Malik, J.: Shape matching and object recognition using shape contexts. In: IEEE International Conference on Computer Science and Information Technology (2002)
Google Scholar
Liu, S., Guo, X., Lin, L., Cao, X., Zhang, H.: Sym-fish: a symmetry-aware flip invariant sketch histogram shape descriptor. In: IEEE International Conference on Computer Vision (2013)
Google Scholar
Nowozin, S., Kim, S., Yoo, C.D., Kohli, P.: Image segmentation using higher-order correlation clustering. IEEE Trans. Pattern Anal. Mach. Intell. 36(9), 1761–1774 (2014)
Article Google Scholar
Balan, A.O., Sigal, L., Black, M.J.: Humaneva: synchronized video and motion capture dataset for evaluation of articulated human motion. In: IEEE International Conference on Computer Vision (2006)
Google Scholar

Download references

Acknowledgements

This work is supported by the National High Technology Development Plan (863 Plan) under Grant No. 2011AA01A205, the National Significant Science and Technology Projects of China under Grant No. 2013ZX01039001-002-003, the NSFC project under Grant No. U1433112 and No. 61170253. We also thank the support from the academic program of Tencent Inc.

Author information

Authors and Affiliations

Department of Computer Science, Tsinghua University, Beijing, China
Qiurui Wang
Graduate School at Shenzhen, Tsinghua University, Shenzhen, China
Qiurui Wang & Chun Yuan
BestImage Team, Tencent, Shanghai, China
Feiyue Huang & Chengjie Wang

Authors

Qiurui Wang
View author publications
You can also search for this author in PubMed Google Scholar
Chun Yuan
View author publications
You can also search for this author in PubMed Google Scholar
Feiyue Huang
View author publications
You can also search for this author in PubMed Google Scholar
Chengjie Wang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Chun Yuan .

Editor information

Editors and Affiliations

Gwangju Institute of Science and Technology, Gwangju, Korea (Republic of)
Yo-Sung Ho
Chinese Academy of Sciences, Institute of Automation, Beijing, China
Jitao Sang
ICU, IVY Lab, KAIST, Daejeon, Korea (Republic of)
Yong Man Ro
KAIST, Daejeon, Korea (Republic of)
Junmo Kim
College of Computer Science, Zhejiang University, Hangzhou, China
Fei Wu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wang, Q., Yuan, C., Huang, F., Wang, C. (2015). Human Parsing via Shape Boltzmann Machine Networks. In: Ho, YS., Sang, J., Ro, Y., Kim, J., Wu, F. (eds) Advances in Multimedia Information Processing -- PCM 2015. PCM 2015. Lecture Notes in Computer Science(), vol 9314. Springer, Cham. https://doi.org/10.1007/978-3-319-24075-6_63

Download citation

DOI: https://doi.org/10.1007/978-3-319-24075-6_63
Published: 22 November 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-24074-9
Online ISBN: 978-3-319-24075-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics