Abstract
In this chapter, we propose an approach to full-body pose recognition and body orientation estimation using multilinear analysis. We extract low-dimensional pose and body orientation coefficient vectors by performing tensor decomposition and projection on silhouette images obtained from wide baseline binocular cameras. The coefficient vectors are then used as feature vectors in pose recognition and body orientation estimation. To do pose recognition, pose coefficient vectors obtained from synthesized pose silhouettes are used to train a family of support vector machines as pose classifiers. Using orientation coefficient vectors, a 1-D orientation manifold is learned and further used for the estimation of body orientation. Experiment results obtained using both synthetic and real image data showed that the performance of our approach is comparable to existing pose recognition approaches, and that our approach outperformed the traditional tensor-based recognition approach in the comparative test.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Banerjee, A., Dhillon, I.S., Ghosh, J., Sra, S.: Clustering on the unit hypersphere using von mises-fisher distributions. The Journal of Machine Learning Research 6, 1345–1382 (2005)
Boulay, B., Bremond, F., Thonnat, M.: Applying 3d human model in a pose recognition system. Pattern Recognition Letters 27(15), 1788–1796 (2006)
Camurri, A., Hashimoto, S., Ricchetti, M., Ricci, A., Suzuki, K., Trocca, R., Volpe, G.: Eyesweb: Toward gesture and affect recognition in interactive dance and music systems. Computer Music Journal 24(1), 57–69 (2000)
Cheung, K.M., Baker, S., Kanade, T.: Shape-from-silhouette of articulated objects and its use for human body kinematics estimation and motion capture. In: Proc. CVPR, pp. 77–84 (2003)
Chu, C., Cohen, I.: Pose and gesture recognition using 3d body shapes decomposition. In: Proc. CVPR, pp. 69–78 (2005)
Elden, L.: Matrix Methods in Data Mining and Pattern Recognition. SIAM, Philadelphia (2007)
Elgammal, A., Lee, C.: Inferring 3d body pose from silhouettes using activity manifold learning. In: Proc. CVPR, vol.2, pp. 681–688 (2004)
Guo, F., Qian, G.: Dance pose recognition using wide-baseline orthogonal stereo cameras. In: Proc. FGR, pp. 481–486 (2006)
Guo, P., Miao, Z., Yuan, Y.: Posture and activity recognition using projection histogram and pca methods. In: Proc. CISP, pp. 397–401 (2008)
Howe, N.R.: Silhouette lookup for monocular 3d pose tracking. Image and Vision Computing 25(3), 331–341 (2007)
Hu, J.S., Su, T.M., Lin, P.C.: 3-d human posture recognition system using 2-d shape features. In: Proc. ICRA, pp. 3933–3938 (2007)
Huang, F., Di, H., Xu, G.: Viewpoint insensitive pose representation for action recognition. In: Proc. AMDO, pp. 143–152 (2006)
Mikic, I., Trivedi, M.M., Hunter, E., Cosman, P.C.: Human body model acquisition and tracking using voxel data. International Journal of Computer Vision 53(3), 199–223 (2003)
James, J., Ingalls, T., Qian, G., Olsen, L., Whiteley, D., Wong, S., Rikakis., T.: Movement-based interactive dance performance. In: Proc. MULTIMEDIA, pp. 470–480 (2006)
Jenkins, O.C., González, G., Loper, M.M.: Tracking human motion and actions for interactive robots. In: HRI ’07: Proceedings of the ACM/IEEE International Conference on Human-Robot Interaction, pp. 365–372. ACM, New York, NY, USA (2007). doi http://doi.acm.org/10.1145/1228716.1228765
Kakadiaris, I.A., Metaxas, D.: Model-based estimation of 3D human motion with occlusion based on active multi-viewpoint selection. In: Proc. CVPR, pp. 81–87 (1996)
Kiers, H.A.L.: An alternating least squares algorithms for parafac2 and three-way dedicom. Computational Statistics & Data Analysis 16(1), 103–118 (1993)
Kjolberg, J.: Designing full body movement interaction using modern dance as a starting point. In: Proc. DIS, pp. 353–356 (2004)
Lathauwer, L.D., Moor, B.D., Vandewalle, J.: A multilinear singular value decomposition. SIAM Journal on Matrix Analysis and Applications 21(4), 1253–1278 (2000)
Lee, C.S., Elgammal, A.: Modeling view and posture manifolds for tracking. In: Proc. ICCV, pp. 1–8 (2007)
Lee, M.W., Nevatia, R.: Integrating component cues for human pose tracking. In: Proc. Joint IEEE Int. Workshop on VS-PETS, pp. 41–48 (2005)
Lee, S.W.: Automatic gesture recognition for intelligent human-robot interaction. In: Proc. FGR, pp. 645–650 (2006)
Levenberg, K.: A method for the solution of certain non-linear problems in least squares. The Quarterly of Applied Mathematics 2, 164–168 (1944)
Li, C.C., Chen, Y.Y.: Human posture recognition by simple rules. newblock In: Proc. SMC, pp. 3237–3240 (2006)
Li, R., Yang, M.H., Sclaro, S., Tian, T.P.: Monocular tracking of 3d human motion with a coordinated mixture of factor analyzers. In: Proc. ECCV, pp. 137–150 (2006)
Mitra, S., Acharya, T.: Gesture recognition: A survey. Systems, Man, and Cybernetics, Part C: Applications and Reviews 37(3), 311–324 (2007)
Moeslund, T.B., Granum, E.: A survey of computer vision-based human motion capture. Computer Vision and Image Understanding 81(3), 231–268 (2001)
Moeslund, T.B., Hilton, A., Kruger, V.: A survey of advances in vision-based human motion capture and analysis. Computer Vision and Image Understanding 104(2), 90–126 (2006)
Navaratnam, R., Thayananthan, A., Torr, P., Cipolla, R.: Hierarchical part-based human body pose estimation. In: Proc. British Machine Vision Conference (2005)
Ong, S.C., Ranganath, S.: Automatic sign language analysis: a survey and the future beyond lexical meaning. IEEE Transactions on Pattern Analysis and Machine Intelligence 27(6), 873–891 (2005). doi 10.1109/TPAMI.2005.112
Qian, G., Guo, F., Ingalls, T., Olson, L., James, J., Rikakis, T.: A gesture-driven multimodal interactive dance system. In: Proc. ICME, pp. 1579–1582 (2004)
Rosales, R., Sclaro, S.: Learning body pose via specialized maps. In: Proc. Conference on Neural Information Processing Systems, pp. 1263–1270 (2002)
Sul, C., Lee, K., Wohn, K.: Virtual stage: A location-based karaoke system. IEEE Multimedia 05(2), 42–52 (1998)
Urano, T., Matsui, T., Nakata, T., Mizoguchi, H.: Human pose recognition by memory-based hierarchical feature matching. In: Proc. SMC, pp. 6412–6416 (2004)
Urtasun, D.F.R., Fua, P.: 3d people tracking with gaussian process dynamical models. In: Proc. CVPR, pp. 238–245 (2006)
Vasilescu, M.A.O., Terzopoulos, D.: Multilinear analysis of image ensembles: Tensorfaces. In: Proc. ECCV, pp. 447–460 (2002)
Vasilescu, M.A.O., Terzopoulos, D.: Tensortextures: Multilinear image-based rendering. ACM Transactions on Graphics 23(3), 334–340 (2004)
Vlasic, D., Brand, M., Pfister, H., Popovi, J.: Face transfer with multilinear models. In: Proc. ACM SIGGRAPH, pp. 426–433 (2005)
Wang, W.L., Tan, T.: Recent development in human motion analysis. Pattern Recognition 36, 585–601 (2003)
Wu, Y., Huang, T.S.: Vision-based gesture recognition: A review. In: GW ’99: Proceedings of the International Gesture Workshop on Gesture-Based Communication in Human-Computer Interaction, pp. 103–115. Springer-Verlag, London, UK (1999)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag London Limited
About this chapter
Cite this chapter
Peng, B., Qian, G. (2009). Binocular Full-Body Pose Recognition and Orientation Inference Using Multilinear Analysis. In: Aja-Fernández, S., de Luis GarcÃa, R., Tao, D., Li, X. (eds) Tensors in Image Processing and Computer Vision. Advances in Pattern Recognition. Springer, London. https://doi.org/10.1007/978-1-84882-299-3_10
Download citation
DOI: https://doi.org/10.1007/978-1-84882-299-3_10
Publisher Name: Springer, London
Print ISBN: 978-1-84882-298-6
Online ISBN: 978-1-84882-299-3
eBook Packages: Computer ScienceComputer Science (R0)