Learning a Person-Independent Representation for Precise 3D Pose Estimation

Yan, Shuicheng; Zhang, Zhenqiu; Fu, Yun; Hu, Yuxiao; Tu, Jilin; Huang, Thomas

doi:10.1007/978-3-540-68585-2_28

Shuicheng Yan¹,
Zhenqiu Zhang¹,
Yun Fu¹,
Yuxiao Hu¹,
Jilin Tu¹ &
…
Thomas Huang¹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 4625))

Included in the following conference series:

1282 Accesses
7 Citations

Abstract

Precise 3D pose estimation plays a significant role in developing human-computer interfaces and practical face recognition systems. This task is challenging due to the personality in pose variation for a certain subject. In this work, the pose data space is considered as a union of the submanifolds which characterize different subjects, instead of a single continuous manifold as conventionally regarded. A novel manifold embedding algorithm dually supervised by subjects and poses, called Synchronized Submanifold Embedding (SSE), is proposed for person-independent precise pose estimation. First, the submanifold of a certain subject is approximated as a set of simplexes constructed using neighboring samples. Then, these simplexized submanifolds from different subjects are embedded by synchronizing the locally propagated poses within the simplexes and at the same time maximizing the intra-submanifold variances. Finally, the pose of a new datum is estimated as the median of the poses for the nearest neighbors in the dimensionality reduced feature space. The experiments on the 3D pose estimation database, CHIL data for CLEAR07 evaluation demonstrate the effectiveness of our proposed algorithm.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Ba, S., Dobez, J.: A Probabilistic Framework for Joint Head Tracking and Pose Estimation. In: Proceedings of International Conference on Pattern Recognition, vol. 4, pp. 264–267 (2004)
Google Scholar
Belkin, M., Niyogi, P.: Laplacian Eigenmaps and Spectral Techniques for Embedding and Clustering. Advances in Neural Information Processing System, 585–591 (2001)
Google Scholar
Bregler, C., Omohundro, S.: Nonlinear image interpolation using manifold learning. Advances in Neural Information Processing Systems, 973–980 (1995)
Google Scholar
Brown, L., Tian, Y.: Comparative study of coarse head pose estimation. In: Proceedings of IEEE Workshop on Motion and Video Computing, pp. 125–130 (2002)
Google Scholar
Chen, L., Zhang, L., Hu, Y., Li, M., Zhang, H.: Head pose estimation using fisher manifold learning. In: Proceedings of IEEE International Workshop on Analysis and Modeling of Faces and Gestures, pp. 203–207 (2003)
Google Scholar
Cootes, T., Edwards, G., Taylor, C.: Active appearance models. IEEE Transactions on Pattern Analysis and Machine Intelligence 23(6), 681–685 (2001)
Article Google Scholar
Fu, Y., Huang, T.: Graph embedded analysis for head pose estimation. In: Proceddings of the 7th International Conference on Automatic Face and Gesture Recognition, pp. 3–8 (2006)
Google Scholar
Fukunnaga, K.: Introduction to Statistical Pattern Recognition, 2nd edn. Academic Press, London (1991)
Google Scholar
He, X., Yan, S., Hu, Y., Niyogi, P., Zhang, H.: Face Recognition Using Laplacianfaces. IEEE Transactions on Pattern Analysis and Machine Intelligence 27(3), 328–340 (2005)
Article Google Scholar
Hu, N., Huang, W., Ranganath, S.: Head pose estimation by non-linear embedding and mapping. In: Proceedings of IEEE International Conference on Image Processing, pp. 342–345 (2005)
Google Scholar
Jolliffe, I.: Principal Component Analysis. Springer, Heidelberg (1986)
Google Scholar
Lanitis, A., Draganova, C., Christodoulou, C.: Comparing different classifiers for automatic age estimation. IEEE Transactions on Systems, Man and Cybernetics, Part B 34, 621–628 (2004)
Article Google Scholar
Li, S., Fu, Q., Gu, L., Scholkopf, B., Cheng, Y., Zhang, H.: Kernel machine based learning for multi-view face detection and pose estimation. In: Proceedings of the International Conference on Computer Vision, vol. 2, pp. 674–679 (2001)
Google Scholar
Li, S., Lu, X., Hou, X., Peng, X., Cheng, Q.: Learning multiview face subspaces and facial pose estimation using independent component analysis. IEEE Transactions on Image Processing 14(6), 705–712 (2005)
Article Google Scholar
Munkres, J.: Elements of Algebraic Topology. Perseus Press (1993)
Google Scholar
Raytchev, B., Yoda, I., Sakaue, K.: Head pose estimation by nonlinear manifold learning. In: Proceedings of the 17th International Conference on Pattern Recognition, vol. 4, pp. 1051–4651 (2004)
Google Scholar
Ritte, D., Kouropteva, O., Okun, O., Pietikainen, M., Duin, R.: Supervised locally linear embedding. In: Proceedings of Artificial Neural Networks and Neural Information, pp. 333–341 (2003)
Google Scholar
Roweis, S., Saul, L.: Nonlinear Dimensionality Reduction by Locally Linear Embedding. Science 290(22), 2323–2326 (2000)
Article Google Scholar
Saul, L., Roweis, S.: Think Globally, Fit Locally: Unsupervised Learning of Low Dimensional Manifolds. Journal of Machine Learning Research 4, 119–155 (2003)
Article MathSciNet Google Scholar
Tenenbaum, J., Silva, V., Langford, J.: A Global Geometric Framework for Nonlinear Dimensionality Reduction. Science 290(22), 2319–2323 (2000)
Article Google Scholar
Tu, J., Fu, Y., Hu, Y., Huang, T.: Evaluation of Head Pose Estimation For Studio Data. In: Stiefelhagen, R., Garofolo, J.S. (eds.) CLEAR 2006. LNCS, vol. 4122, pp. 281–290. Springer, Heidelberg (2007)
Chapter Google Scholar
Turk, M., Pentland, A.: Eigenfaces for recognition. Journal of Cognitive Neuroscience 13, 71–86 (1991)
Article Google Scholar
Wang, H., Ahuja, N.: Facial expression decomposition. In: IEEE International Conference on Computer Vision, vol. 2, pp. 958–965 (2003)
Google Scholar
Weinberger, K., Saul, L.: Unsupervised Learning of Image Manifolds by Semidefinite Programming. In: Proceddings of IEEE Conference on Computer Vision and Pattern Recognition, vol. 2, pp. 988–995 (2004)
Google Scholar
Wenzel, M., Schiffmann, W.: Head pose estimation of partially occluded faces. In: Proceeding of the Second Canadian Conference on Computer and Robot Vision, pp. 353–360 (2005)
Google Scholar
Yan, S., Xu, D., Zhang, B., Zhang, H., Yang, Q., Lin, S.: Graph Embedding and Extensions: A General Framework for Dimensionality Reduction. Proc. IEEE Trans. Pattern Analysis and Machine Intelligence 29(1), 40–51 (2007)
Article Google Scholar
Zhao, W., Chellappa, R., Rosenfeld, A., Phillips, P.: Face Recognition: A Literature Survey. ACM Computing Surveys, 399–458 (2003)
Google Scholar
http://isl.ira.uka.de/clear07/?The_Evaluation

Download references

Author information

Authors and Affiliations

ECE Department, University of Illinois at Urbana Champaign, USA
Shuicheng Yan, Zhenqiu Zhang, Yun Fu, Yuxiao Hu, Jilin Tu & Thomas Huang

Authors

Shuicheng Yan
View author publications
You can also search for this author in PubMed Google Scholar
Zhenqiu Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Yun Fu
View author publications
You can also search for this author in PubMed Google Scholar
Yuxiao Hu
View author publications
You can also search for this author in PubMed Google Scholar
Jilin Tu
View author publications
You can also search for this author in PubMed Google Scholar
Thomas Huang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Rainer Stiefelhagen Rachel Bowers Jonathan Fiscus

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yan, S., Zhang, Z., Fu, Y., Hu, Y., Tu, J., Huang, T. (2008). Learning a Person-Independent Representation for Precise 3D Pose Estimation. In: Stiefelhagen, R., Bowers, R., Fiscus, J. (eds) Multimodal Technologies for Perception of Humans. RT CLEAR 2007 2007. Lecture Notes in Computer Science, vol 4625. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-68585-2_28

Download citation

DOI: https://doi.org/10.1007/978-3-540-68585-2_28
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-68584-5
Online ISBN: 978-3-540-68585-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics