Abstract
Tracking human poses in a video is a challenging problem and has numerous applications. The task is particularly difficult in realistic scenes because of several intrinsic and extrinsic factors, including complicated and fast movements, occlusions and lighting changes. We propose an online learning approach for tracking human poses using latent structured Support Vector Machine (SVM). The first frame in a video is used for training, in which body parts are initialized by users and tracking models are learned using latent structured SVM. The models are updated for each subsequent frame in the video sequence. To solve the occlusion problem, we formulate a Prize-Collecting Steiner tree (PCST) problem and use a branch-and-cut algorithm to refine the detection of body parts. Experiments using several challenging videos demonstrate that the proposed method outperforms two state-of-the-art methods.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsNotes
References
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: ICCV, pp. 886–893 (2005)
Everingham, M., Van Gool, L., Williams, C.K.I., Winn, J., Zisserman, A.: The PASCAL Visual Object Classes (VOC) challenge. IJCV 88(2), 303–338 (2010)
Felzenszwalb, P.F., Girshick, R.B., McAllester, D., Ramanan, D.: Object detection with discriminatively trained part-based models. TPAMI 32(9), 1627–1645 (2010)
Huang, C.H., Boyer, E., Ilic, S.: Robust human body shape and pose tracking. In: 3DV, pp. 287–294 (2013)
Huang, C., Boyer, E., Navab, N., Ilic, S.: Human shape and pose tracking using keyframes. In: CVPR, pp. 3446–3453 (2014)
Ionescu, C., Li, F., Sminchisescu, C.: Latent structured models for human pose estimation. In: ICCV, pp. 2220–2227 (2011)
Li, Z., Wu, X.M., Chang, S.F.: Segmentation using superpixels: a bipartite graph partitioning approach. In: CVPR, pp. 789–796 (2012)
Lim, T., Hong, S., Han, B., Hee Han, J.: Joint segmentation and pose tracking of human in natural videos. In: ICCV (2013)
Ljubic, I., Weiskircher, R., Pferschy, U., Klau, G.W., Mutzel, P., Fischetti, M.: An algorithmic framework for the exact solution of the prize-collecting steiner tree problem. Math. Program. 105(2–3), 427–449 (2006)
Park, D., Ramanan, D.: N-best maximal decoders for part models. In: ICCV, pp. 2627–2634 (2011)
Ramakrishna, V., Kanade, T., Sheikh, Y.: Tracking human pose by tracking symmetric parts. In: CVPR, pp. 3728–3735 (2013)
Tian, J., Li, L., Liu, W.: Multi-scale human pose tracking in 2D monocular images. J. Comput. Commun. 2(2), 78–84 (2014)
Tian, Y., Zitnick, C.L., Narasimhan, S.G.: Exploring the spatial hierarchy of mixture models for human pose estimation. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012. LNCS, vol. 7576, pp. 256–269. Springer, Heidelberg (2012). doi:10.1007/978-3-642-33715-4_19
Viola, P., Jones, M.: Rapid object detection using a boosted cascade of simple features. In: CVPR, pp. 511–518 (2001)
Wang, X., Ning, C., Shi, A., Lv, G.: An improved similarity measure in particle filters for robust object tracking. In: CISP, pp. 46–50 (2013)
Yang, Y., Ramanan, D.: Articulated pose estimation with flexible mixtures-of-parts. In: CVPR (2011)
Yao, R., Shi, Q., Shen, C., Zhang, Y., van den Hengel, A.: Part-based visual tracking with online latent structural learning. In: CVPR (2013)
Yu, C.N.J., Joachims, T.: Learning structural svms with latent variables. In: ICML, pp. 1169–1176 (2009)
Zhu, L., Chen, Y., Yuille, A.L., Freeman, W.T.: Latent hierarchical structural learning for object detection. In: CVPR, pp. 1062–1069 (2010)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Hua, KL., Sari, I.N., Yeh, MC. (2017). Human Pose Tracking Using Online Latent Structured Support Vector Machine. In: Amsaleg, L., Guðmundsson, G., Gurrin, C., Jónsson, B., Satoh, S. (eds) MultiMedia Modeling. MMM 2017. Lecture Notes in Computer Science(), vol 10132. Springer, Cham. https://doi.org/10.1007/978-3-319-51811-4_51
Download citation
DOI: https://doi.org/10.1007/978-3-319-51811-4_51
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-51810-7
Online ISBN: 978-3-319-51811-4
eBook Packages: Computer ScienceComputer Science (R0)