Recent Developments on 2D Pose Estimation From Monocular Images

Bąk, Artur; Kulbacki, Marek; Segen, Jakub; Świątkowski, Dawid; Wereszczyński, Kamil

doi:10.1007/978-3-662-49390-8_43

Artur Bąk⁸,
Marek Kulbacki⁸,
Jakub Segen⁸,
Dawid Świątkowski⁸ &
…
Kamil Wereszczyński^8,9

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9622))

Included in the following conference series:

Asian Conference on Intelligent Information and Database Systems

1560 Accesses

Abstract

Human pose estimation from monocular images is one of the most significant aspects of modern computer vision tasks and its application demand is still increasing in such areas as automatic images indexing or human activity recognition from video. Among many approaches applied in these areas the one based on pose estimation gives, beyond all doubts, one of the most powerful representation of human on the picture in sense of sparsity and semantics. In this paper we provide a detailed survey of the most efficient methods in 2D pose estimation domain as well as the test results of selected methods on the LSP dataset, which is commonly used by state-of-the-art works.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Felzenszwalb, P., Huttenlocher, D.: Pictorial structures for object recognition. IJCV 61(1), 55–79 (2005)
Article Google Scholar
Yang, Y., Ramanan, D.: Articulated pose estimation with flexible mixtures-of parts. In: CVPR, pp. 1385–1392 (2011)
Google Scholar
Wang, F., Li, Y.: Beyond physical connections: tree models in human pose estimation. In: CVPR, pp. 596–603 (2013)
Google Scholar
Tian, Y., Zitnick, C.L., Narasimhan, S.G.: Exploring the spatial hierarchy of mixture models for human pose estimation. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part V. LNCS, vol. 7576, pp. 256–269. Springer, Heidelberg (2012)
Chapter Google Scholar
Johnson, S., Everingham, M.: Learning effective human pose estimation from inaccurate annotation. In: CVPR, pp. 1465–1472 (2011)
Google Scholar
Bourdev, L., Malik., J.: Poselets: body part detectors trained using 3D human pose annotations. In: ICCV, pp. 1365–1372 (2009)
Google Scholar
Gkioxari, G., Hariharan, B., Girshick, R., Malik, J.: Using k-poselets for detecting people and localizing their keypoints. In: CVPR, pp. 3582–3589 (2014)
Google Scholar
Pishchulin, L., Andriluka., M., Gehler, P., Schiele, B.: Poselet conditioned pictorial structures. In: CVPR, pp. 588–595 (2013)
Google Scholar
Andriluka, M., Pishchulin, L., Gehler, P., Schiele, B.: 2D human pose estimation: new benchmark and state of the art analysis. In: CVPR, pp. 3686–3693 (2014)
Google Scholar
Liu, Z., Zhu, J., Bu, J., Chen, C.: A survey of human pose estimation: the body parts parsing based methods. JVCI 32, 10–19 (2015)
Google Scholar
Felzenszwalb, P., Girshick, R., McAllester, D., Ramanan, D.: Object detection with discriminatively trained part based models. PAMI 32(9), 1627–1645 (2010)
Article Google Scholar
Maji, S., Malik, J.: Object detection using a max-margin hough tranform. In: CVPR, pp. 1038–1045 (2009)
Google Scholar
Ferrari, V., Marin-Jimenez, M., Zisserman, A.: Progressive search space reduction for human pose estimation. In: CVPR, pp. 1–8 (2008)
Google Scholar
Wang, H., Klaser, A., Schmid, C., Liu, C.-L.: Action recognition by dense trajectories. In: CVPR, pp. 3169–3176 (2011)
Google Scholar
Fischler, M., Elschlager, R.: The representation and matching of pictorial structures. IEEE Trans. Comput. 22(1), 67–92 (1973)
Article Google Scholar
Eichner, M., Ferrari, V.: Better appearance models for pictorial structures. In: BMVC, pp. 1–11 (2009)
Google Scholar
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: CVPR, vol. 1, pp. 886–893 (2005)
Google Scholar
Johnson, S., Everingham, M.: Clustered pose and nonlinear appearance models for human pose estimation. In: BMVC, pp. 1–11 (2010)
Google Scholar
Sapp, B., Taskar, B.: MODEC: multimodal decomposable models for human pose estimation. In: CVPR, pp. 3674–3681 (2013)
Google Scholar
Gkioxari, G., Arbelaez, P., Bourdev, L., Malik, J.: Articulated pose estimation using discriminative armlet classifiers. In: CVPR, pp. 3342–3349 (2013)
Google Scholar
Cherian, A., Mairal, J., Alahari, K., Schmid, C.: Mixing body-part sequences for human pose estimation. In: CVPR, pp. 2361–2368 (2014)
Google Scholar
Wang, C., Wang, Y., Yuille, A.: An approach to pose-based action recognition. In: CVPR, pp. 915–922 (2013)
Google Scholar
Nie, B., Xiong, C., Zhu, S.-C.: Joint action recognition and pose estimation from video. In: CVPR, pp. 1293–1301 (2015)
Google Scholar
Fan, X., Zheng, K., Lin, Y.: Combining local appearance and holistic view: dual-source deep neural networks for human pose estimation. In: CVPR, pp. 1347–1355 (2015)
Google Scholar
Ouyang, W., Chu, X., Wang, X.: Multi-source deep learning for human pose estimation. In: CVPR, pp. 2337–2344 (2014)
Google Scholar
Toshev, A., Szegedy, C.: DeepPose: human pose estimation via deep neural networks. In: CVPR, pp. 1653–1660 (2014)
Google Scholar
Tompson, J., Jain, A., LeCun, Y., Bregler, C.: Joint training of a convolutional network and a graphical model for human pose estimation. In: NIPS, pp. 1799–1807 (2014)
Google Scholar
Dempster, A., Laird, N., Rubin, D.: Maximum likelihood from incomplete data via the EM algorithm. J. Roy. Stat. Soc. 39(1), 1–38 (1977)
MathSciNet MATH Google Scholar

Download references

Acknowledgements

This work has been supported by the National Centre for Research and Development (project UOD-DEM-1-183/001 “Intelligent video analysis system for behavior and event recognition in surveillance networks”).

Author information

Authors and Affiliations

Polish-Japanese Academy of Information Technology, Koszykowa 86, 02-008, Warszawa, Poland
Artur Bąk, Marek Kulbacki, Jakub Segen, Dawid Świątkowski & Kamil Wereszczyński
Institute of Informatics, Silesian University of Technology, Akademicka 16, 44-100, Gliwice, Poland
Kamil Wereszczyński

Authors

Artur Bąk
View author publications
You can also search for this author in PubMed Google Scholar
Marek Kulbacki
View author publications
You can also search for this author in PubMed Google Scholar
Jakub Segen
View author publications
You can also search for this author in PubMed Google Scholar
Dawid Świątkowski
View author publications
You can also search for this author in PubMed Google Scholar
Kamil Wereszczyński
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Marek Kulbacki .

Editor information

Editors and Affiliations

Wrocław University of Technology, Wrocław, Poland
Ngoc Thanh Nguyen
Wrocław University of Technology, Wrocław, Poland
Bogdan Trawiński
Iwate Prefectural University, Takizawa, Japan
Hamido Fujita
National University of Kaohsiung, Kaohsiung, Taiwan
Tzung-Pei Hong

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bąk, A., Kulbacki, M., Segen, J., Świątkowski, D., Wereszczyński, K. (2016). Recent Developments on 2D Pose Estimation From Monocular Images. In: Nguyen, N.T., Trawiński, B., Fujita, H., Hong, TP. (eds) Intelligent Information and Database Systems. ACIIDS 2016. Lecture Notes in Computer Science(), vol 9622. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-49390-8_43

Download citation

DOI: https://doi.org/10.1007/978-3-662-49390-8_43
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-49389-2
Online ISBN: 978-3-662-49390-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics