Binocular Full-Body Pose Recognition and Orientation Inference Using Multilinear Analysis

Peng, Bo; Qian, Gang

doi:10.1007/978-1-84882-299-3_10

Bo Peng⁵ &
Gang Qian⁵

Part of the book series: Advances in Pattern Recognition ((ACVPR))

2775 Accesses
5 Citations

Abstract

In this chapter, we propose an approach to full-body pose recognition and body orientation estimation using multilinear analysis. We extract low-dimensional pose and body orientation coefficient vectors by performing tensor decomposition and projection on silhouette images obtained from wide baseline binocular cameras. The coefficient vectors are then used as feature vectors in pose recognition and body orientation estimation. To do pose recognition, pose coefficient vectors obtained from synthesized pose silhouettes are used to train a family of support vector machines as pose classifiers. Using orientation coefficient vectors, a 1-D orientation manifold is learned and further used for the estimation of body orientation. Experiment results obtained using both synthetic and real image data showed that the performance of our approach is comparable to existing pose recognition approaches, and that our approach outperformed the traditional tensor-based recognition approach in the comparative test.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Banerjee, A., Dhillon, I.S., Ghosh, J., Sra, S.: Clustering on the unit hypersphere using von mises-fisher distributions. The Journal of Machine Learning Research 6, 1345–1382 (2005)
MathSciNet Google Scholar
Boulay, B., Bremond, F., Thonnat, M.: Applying 3d human model in a pose recognition system. Pattern Recognition Letters 27(15), 1788–1796 (2006)
Article Google Scholar
Camurri, A., Hashimoto, S., Ricchetti, M., Ricci, A., Suzuki, K., Trocca, R., Volpe, G.: Eyesweb: Toward gesture and affect recognition in interactive dance and music systems. Computer Music Journal 24(1), 57–69 (2000)
Article Google Scholar
Cheung, K.M., Baker, S., Kanade, T.: Shape-from-silhouette of articulated objects and its use for human body kinematics estimation and motion capture. In: Proc. CVPR, pp. 77–84 (2003)
Google Scholar
Chu, C., Cohen, I.: Pose and gesture recognition using 3d body shapes decomposition. In: Proc. CVPR, pp. 69–78 (2005)
Google Scholar
Elden, L.: Matrix Methods in Data Mining and Pattern Recognition. SIAM, Philadelphia (2007)
Google Scholar
Elgammal, A., Lee, C.: Inferring 3d body pose from silhouettes using activity manifold learning. In: Proc. CVPR, vol.2, pp. 681–688 (2004)
Google Scholar
Guo, F., Qian, G.: Dance pose recognition using wide-baseline orthogonal stereo cameras. In: Proc. FGR, pp. 481–486 (2006)
Google Scholar
Guo, P., Miao, Z., Yuan, Y.: Posture and activity recognition using projection histogram and pca methods. In: Proc. CISP, pp. 397–401 (2008)
Google Scholar
Howe, N.R.: Silhouette lookup for monocular 3d pose tracking. Image and Vision Computing 25(3), 331–341 (2007)
Article MathSciNet Google Scholar
Hu, J.S., Su, T.M., Lin, P.C.: 3-d human posture recognition system using 2-d shape features. In: Proc. ICRA, pp. 3933–3938 (2007)
Google Scholar
Huang, F., Di, H., Xu, G.: Viewpoint insensitive pose representation for action recognition. In: Proc. AMDO, pp. 143–152 (2006)
Google Scholar
Mikic, I., Trivedi, M.M., Hunter, E., Cosman, P.C.: Human body model acquisition and tracking using voxel data. International Journal of Computer Vision 53(3), 199–223 (2003)
Article Google Scholar
James, J., Ingalls, T., Qian, G., Olsen, L., Whiteley, D., Wong, S., Rikakis., T.: Movement-based interactive dance performance. In: Proc. MULTIMEDIA, pp. 470–480 (2006)
Google Scholar
Jenkins, O.C., González, G., Loper, M.M.: Tracking human motion and actions for interactive robots. In: HRI ’07: Proceedings of the ACM/IEEE International Conference on Human-Robot Interaction, pp. 365–372. ACM, New York, NY, USA (2007). doi http://doi.acm.org/10.1145/1228716.1228765
Kakadiaris, I.A., Metaxas, D.: Model-based estimation of 3D human motion with occlusion based on active multi-viewpoint selection. In: Proc. CVPR, pp. 81–87 (1996)
Google Scholar
Kiers, H.A.L.: An alternating least squares algorithms for parafac2 and three-way dedicom. Computational Statistics & Data Analysis 16(1), 103–118 (1993)
Article MATH Google Scholar
Kjolberg, J.: Designing full body movement interaction using modern dance as a starting point. In: Proc. DIS, pp. 353–356 (2004)
Google Scholar
Lathauwer, L.D., Moor, B.D., Vandewalle, J.: A multilinear singular value decomposition. SIAM Journal on Matrix Analysis and Applications 21(4), 1253–1278 (2000)
Article MATH MathSciNet Google Scholar
Lee, C.S., Elgammal, A.: Modeling view and posture manifolds for tracking. In: Proc. ICCV, pp. 1–8 (2007)
Google Scholar
Lee, M.W., Nevatia, R.: Integrating component cues for human pose tracking. In: Proc. Joint IEEE Int. Workshop on VS-PETS, pp. 41–48 (2005)
Google Scholar
Lee, S.W.: Automatic gesture recognition for intelligent human-robot interaction. In: Proc. FGR, pp. 645–650 (2006)
Google Scholar
Levenberg, K.: A method for the solution of certain non-linear problems in least squares. The Quarterly of Applied Mathematics 2, 164–168 (1944)
MATH MathSciNet Google Scholar
Li, C.C., Chen, Y.Y.: Human posture recognition by simple rules. newblock In: Proc. SMC, pp. 3237–3240 (2006)
Google Scholar
Li, R., Yang, M.H., Sclaro, S., Tian, T.P.: Monocular tracking of 3d human motion with a coordinated mixture of factor analyzers. In: Proc. ECCV, pp. 137–150 (2006)
Google Scholar
Mitra, S., Acharya, T.: Gesture recognition: A survey. Systems, Man, and Cybernetics, Part C: Applications and Reviews 37(3), 311–324 (2007)
Google Scholar
Moeslund, T.B., Granum, E.: A survey of computer vision-based human motion capture. Computer Vision and Image Understanding 81(3), 231–268 (2001)
Article MATH Google Scholar
Moeslund, T.B., Hilton, A., Kruger, V.: A survey of advances in vision-based human motion capture and analysis. Computer Vision and Image Understanding 104(2), 90–126 (2006)
Article Google Scholar
Navaratnam, R., Thayananthan, A., Torr, P., Cipolla, R.: Hierarchical part-based human body pose estimation. In: Proc. British Machine Vision Conference (2005)
Google Scholar
Ong, S.C., Ranganath, S.: Automatic sign language analysis: a survey and the future beyond lexical meaning. IEEE Transactions on Pattern Analysis and Machine Intelligence 27(6), 873–891 (2005). doi 10.1109/TPAMI.2005.112
Article Google Scholar
Qian, G., Guo, F., Ingalls, T., Olson, L., James, J., Rikakis, T.: A gesture-driven multimodal interactive dance system. In: Proc. ICME, pp. 1579–1582 (2004)
Google Scholar
Rosales, R., Sclaro, S.: Learning body pose via specialized maps. In: Proc. Conference on Neural Information Processing Systems, pp. 1263–1270 (2002)
Google Scholar
Sul, C., Lee, K., Wohn, K.: Virtual stage: A location-based karaoke system. IEEE Multimedia 05(2), 42–52 (1998)
Article Google Scholar
Urano, T., Matsui, T., Nakata, T., Mizoguchi, H.: Human pose recognition by memory-based hierarchical feature matching. In: Proc. SMC, pp. 6412–6416 (2004)
Google Scholar
Urtasun, D.F.R., Fua, P.: 3d people tracking with gaussian process dynamical models. In: Proc. CVPR, pp. 238–245 (2006)
Google Scholar
Vasilescu, M.A.O., Terzopoulos, D.: Multilinear analysis of image ensembles: Tensorfaces. In: Proc. ECCV, pp. 447–460 (2002)
Google Scholar
Vasilescu, M.A.O., Terzopoulos, D.: Tensortextures: Multilinear image-based rendering. ACM Transactions on Graphics 23(3), 334–340 (2004)
Article Google Scholar
Vlasic, D., Brand, M., Pfister, H., Popovi, J.: Face transfer with multilinear models. In: Proc. ACM SIGGRAPH, pp. 426–433 (2005)
Google Scholar
Wang, W.L., Tan, T.: Recent development in human motion analysis. Pattern Recognition 36, 585–601 (2003)
Article Google Scholar
Wu, Y., Huang, T.S.: Vision-based gesture recognition: A review. In: GW ’99: Proceedings of the International Gesture Workshop on Gesture-Based Communication in Human-Computer Interaction, pp. 103–115. Springer-Verlag, London, UK (1999)
Google Scholar

Download references

Author information

Authors and Affiliations

Arts, Media & Engineering Program, and Department of Electrical Engineering, Arizona State University, Tempe, AZ 85287
Bo Peng & Gang Qian

Authors

Bo Peng
View author publications
You can also search for this author in PubMed Google Scholar
Gang Qian
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Bo Peng .

Editor information

Editors and Affiliations

Escuela Técnica Superior de Ingenieros de Telecomunicacíon, Universidad de Valladolid, Campus Miguel Delibes, 47011 Valladolid, Spain
Santiago Aja-Fernández & Rodrigo de Luis García &
Dept. Computing, HongKong Polytechnic University, Hong Kong
Dacheng Tao
University of London Birkbeck College School of Computer Science & Information Systems, Malet Street, London, WC1E 7HX, UK
Xuelong Li

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Peng, B., Qian, G. (2009). Binocular Full-Body Pose Recognition and Orientation Inference Using Multilinear Analysis. In: Aja-Fernández, S., de Luis García, R., Tao, D., Li, X. (eds) Tensors in Image Processing and Computer Vision. Advances in Pattern Recognition. Springer, London. https://doi.org/10.1007/978-1-84882-299-3_10

Download citation

DOI: https://doi.org/10.1007/978-1-84882-299-3_10
Publisher Name: Springer, London
Print ISBN: 978-1-84882-298-6
Online ISBN: 978-1-84882-299-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics