Gaussian Process Latent Variable Models for Human Pose Estimation

Ek, Carl Henrik; Torr, Philip H. S.; Lawrence, Neil D.

doi:10.1007/978-3-540-78155-4_12

Gaussian Process Latent Variable Models for Human Pose Estimation

Carl Henrik Ek¹,
Philip H. S. Torr¹ &
Neil D. Lawrence²

Conference paper

1561 Accesses
36 Citations

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4892))

Abstract

We describe a method for recovering 3D human body pose from silhouettes. Our model is based on learning a latent space using the Gaussian Process Latent Variable Model (GP-LVM) [1] encapsulating both pose and silhouette features Our method is generative, this allows us to model the ambiguities of a silhouette representation in a principled way. We learn a dynamical model over the latent space which allows us to disambiguate between ambiguous silhouettes by temporal consistency. The model has only two free parameters and has several advantages over both regression approaches and other generative methods. In addition to the application shown in this paper the suggested model is easily extended to multiple observation spaces without constraints on type.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Lawrence, N.D.: Probabilistic non-linear principal component analysis with gaussian process latent variable models. Journal of Machine Learning Research 6, 1783–1816 (2005)
MathSciNet Google Scholar
Agarwal, A., Triggs, B.: Recovering 3d human pose from monocular images. IEEE Trans. Pattern Anal. Mach. Intell. 28(1), 44–58 (2006)
Article Google Scholar
Grauman, K., Shakhnarovich, G., Darrell, T.: Inferring 3d structure with a statistical image-based shape model. In: ICCV 2003, pp. 641–648 (2003)
Google Scholar
Kehl, R., Bray, M., Gool, L.J.V.: Full body tracking from multiple views using stochastic sampling. In: CVPR(2), pp. 129–136 (2005)
Google Scholar
Sminchisescu, C., Kanaujia, A., Li, Z., Metaxas, D.N.: Discriminative density propagation for 3d human motion estimation. In: CVPR (1), pp. 390–397 (2005)
Google Scholar
Sminchisescu, C., Telea, A.: Human pose estimation from silhouettes - a consistent approach using distance level sets. In: WSCG, pp. 413–420 (2002)
Google Scholar
Sidenbladh, H., Black, M.J., Fleet, D.J.: Stochastic tracking of 3d human figures using 2d image motion. In: Vernon, D. (ed.) ECCV 2000. LNCS, vol. 1843, pp. 702–718. Springer, Heidelberg (2000)
Chapter Google Scholar
de Campos, T.E., Murray, D.W.: Regression-based hand pose estimation from multiple cameras. In: CVPR(1), pp. 782–789 (2006)
Google Scholar
Sun, Y., Bray, M., Thayananthan, A., Yuan, B., Torr, P.: Regression-based human motion capture from voxel data. In: BMVC (2006)
Google Scholar
Belongie, S., Malik, J., Puzicha, J.: Shape context: A new descriptor for shape matching and object recognition. In: NIPS, pp. 831–837 (2000)
Google Scholar
Mori, G., Belongie, S.J., Malik, J.: Efficient shape matching using shape contexts. IEEE Trans. Pattern Anal. Mach. Intell. 27(11), 1832–1837 (2005)
Article Google Scholar
Rasmussen, C.E., Williams, C.K.: Gaussian Processes for Machine Learning. MIT Press, Cambridge (2006)
Google Scholar
Lawrence, N.D.: Gaussian process latent variable models for visualisation of high dimensional data. In: NIPS (2003)
Google Scholar
Lawrence, N.D., Candela, J.Q.: Local distance preservation in the gp-lvm through back constraints. In: ICML, pp. 513–520 (2006)
Google Scholar
Wang, J., Fleet, D.J., Hertzmann, A.: Gaussian process dynamical models. In: NIPS (2005)
Google Scholar
Shon, A.P., Grochow, K., Hertzmann, A., Rao, R.P.N.: Learning shared latent structure for image synthesis and robotic imitation. In: NIPS (2005)
Google Scholar
Viterbi, A.J.: Error bounds for convolutional codes and an asymptotical optimum decoding algorithm. IEEE Transactions on Information Theory (1967)
Google Scholar
Shakhnarovich, G., Viola, P.A., Darrell, T.: Fast pose estimation with parameter-sensitive hashing. In: ICCV, pp. 750–759 (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computing, Oxford Brookes University, United Kingdom
Carl Henrik Ek & Philip H. S. Torr
School of Computer Science, University of Manchester, United Kingdom
Neil D. Lawrence

Authors

Carl Henrik Ek
View author publications
You can also search for this author in PubMed Google Scholar
Philip H. S. Torr
View author publications
You can also search for this author in PubMed Google Scholar
Neil D. Lawrence
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Andrei Popescu-Belis Steve Renals Hervé Bourlard

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ek, C.H., Torr, P.H.S., Lawrence, N.D. (2008). Gaussian Process Latent Variable Models for Human Pose Estimation. In: Popescu-Belis, A., Renals, S., Bourlard, H. (eds) Machine Learning for Multimodal Interaction. MLMI 2007. Lecture Notes in Computer Science, vol 4892. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-78155-4_12

Download citation

DOI: https://doi.org/10.1007/978-3-540-78155-4_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-78154-7
Online ISBN: 978-3-540-78155-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics