3D Human Activity Recognition Using Skeletal Data from RGBD Sensors

Ling, Jiaxu; Tian, Lihua; Li, Chen

doi:10.1007/978-3-319-50832-0_14

Jiaxu Ling²⁵,
Lihua Tian²⁵ &
Chen Li²⁵

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 10073))

Included in the following conference series:

International Symposium on Visual Computing

5333 Accesses
4 Citations

Abstract

In this paper, a new effective method was proposed to recognize human actions based on RGBD data sensed by a depth camera, namely Microsoft Kinect. Skeleton data extracted from depth images was utilized to generate 10 direction features which represent specific body parts and 11 position features which represent specific human joints. The fusion features composed of both was used to represent a human posture. An algorithm based on the difference level of adjacent postures was presented to select the key postures from an action. Finally, the action features, composed of the key postures’ features, were classified and recognized by a multiclass Support Vector Machine. Our major contributions are proposing a new framework to recognize the users’ actions and a simple and effective method to select the key postures. The recognition results in the KARD dataset and the Florence 3D Action dataset show that our approach significantly outperforms the compared methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Baak, A., et al.: A data-driven approach for real-time full body pose reconstruction from a depth camera. In: Fossati, A., Gall, J., Grabner, H., Ren, X., Konolige, K. (eds.) Consumer Depth Cameras for Computer Vision, pp. 1092–1099. Springer, London (2013)
Google Scholar
Hadfield, S., Bowden, R.: Kinecting the dots: particle based scene flow from depth sensors. In: IEEE Proceedings, vol. 58, no. 11, pp. 2290–2295 (2011)
Google Scholar
Bagdanov, A.D., Del Bimbo, A., et al.: Real-time hand status recognition from RGB-D imagery. International Conference on Pattern Recognition IEEE, pp. 2456–2459 (2012)
Google Scholar
Berretti, S., Bimbo, A.D., Pala, P.: Superfaces: a super-resolution model for 3D faces. In: Fusiello, A., Murino, V., Cucchiara, R. (eds.) ECCV 2012. LNCS, vol. 7583, pp. 73–82. Springer, Heidelberg (2012). doi:10.1007/978-3-642-33863-2_8
Chapter Google Scholar
Gasparrini, S., et al.: Performance analysis of self-organising neural networks tracking algorithms for intake monitoring using kinect. In: IET International Conference on Technologies for Active and Assisted Living. IET (2015)
Google Scholar
Shotton, J., et al.: Real-time human pose recognition in parts from single depth images. Postgrad. Med. J. 56(1), 1297–1304 (2011)
Google Scholar
Yamato, J., Ohya, J., Ishii, K.: Recognizing human action in time-sequential images using hidden Markov model. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 379–385 (1992)
Google Scholar
Kellokumpu, V., Pietikäinen, M., Heikkilä, J.: Human activity recognition using sequences of postures. IAPR Conference on Machine Vision Applications, pp. 570–573 (2005)
Google Scholar
Scholkopf, B., Smola, A.J.: Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond. MIT Press, Cambridge (2001)
Google Scholar
Yang, X., Zhang, C., Tian, Y.L.: Recognizing actions using depth motion maps-based histograms of oriented gradients. In: ACM International Conference on Multimedia, pp. 1057–1060. ACM (2012)
Google Scholar
Oreifej, O., Liu, Z.: HON4D: histogram of oriented 4D normals for activity recognition from depth sequences. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 716–723. IEEE (2013)
Google Scholar
Gan, L., Chen, F.: Human action recognition using APJ3D and random forests. J. Softw. 8(9), 2238–2245 (2013)
Article Google Scholar
Wang, J., Liu, Z., Wu, Y., Yuan, Y.: Mining actionlet ensemble for action recognition with depth cameras. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1290–1297 (2012)
Google Scholar
Althloothi, S., et al.: Human activity recognition using multi-features and multiple kernel learning. Pattern Recogn. 47(5), 1800–1812 (2014)
Article Google Scholar
Ohn-Bar, E., Trivedi, M. M.: Joint angles similarities and HOG2 for action recognition. In: IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp. 465–470. IEEE Computer Society (2013)
Google Scholar
Zhang, Z., et al.: A novel method for user-defined human posture recognition using Kinect. In: International Congress on Image and Signal Processing, pp. 736–740. IEEE (2014)
Google Scholar
Gaglio, S., Re, G.L., Morana, M.: Human activity recognition process using 3-D posture data. IEEE Trans. Hum.-Mach. Syst. 45(5), 1–12 (2014)
Google Scholar
Hsu, C.W., Lin, C.J.: Errata to “a comparison of methods for multiclass support vector machines”. IEEE Trans. Neural Netw. 13(4), 415–425 (2002)
MathSciNet Google Scholar
Chang, C.C., Lin, C.J.: LIBSVM: a library for support vector machines. ACM Trans. Intell. Syst. Technol. 2(3), 389–396 (2011)
Article Google Scholar
Seidenari, L., et al.: Recognizing actions from depth cameras as weakly aligned multi-part bag-of-poses. In: IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp. 479–485. IEEE Computer Society (2013)
Google Scholar
Devanne, M., et al.: 3-D human action recognition by shape analysis of motion trajectories on riemannian manifold. IEEE Trans. Syst. Man Cybern. 45(7), 1023–1029 (2014)
Google Scholar

Download references

Acknowledgment

This work is supported by the National Natural Science Foundation of China under Grant No. 61403302 and the Fundamental Research Funds for the Central Universities No. XJJ2016029.

Author information

Authors and Affiliations

School of Software Engineering, Xi’an Jiaotong University, Xi’an, China
Jiaxu Ling, Lihua Tian & Chen Li

Authors

Jiaxu Ling
View author publications
You can also search for this author in PubMed Google Scholar
Lihua Tian
View author publications
You can also search for this author in PubMed Google Scholar
Chen Li
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Chen Li .

Editor information

Editors and Affiliations

University of Nevada, Reno, Nevada, USA
George Bebis
NASA Ames Research Center, Moffett Field, California, USA
Richard Boyle
Lawrence Berkeley National Laboratory, Berkeley, California, USA
Bahram Parvin
Desert Research Institute, Reno, Nevada, USA
Darko Koracin
The Australian National University, O’Malley, Aust Capital Terr, Australia
Fatih Porikli
Pilot AI Labs, Redwood City, California, USA
Sandra Skaff
University of Florida, Gainesville, Florida, USA
Alireza Entezari
Google Inc., Mountain View, California, USA
Jianyuan Min
Osaka University, Osaka, Japan
Daisuke Iwai
The MOVES Institute, Monterey, California, USA
Amela Sadagic
University of Arizona, Tucson, Arizona, USA
Carlos Scheidegger
Université Paris-Sud, Orsay, France
Tobias Isenberg

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ling, J., Tian, L., Li, C. (2016). 3D Human Activity Recognition Using Skeletal Data from RGBD Sensors. In: Bebis, G., et al. Advances in Visual Computing. ISVC 2016. Lecture Notes in Computer Science(), vol 10073. Springer, Cham. https://doi.org/10.1007/978-3-319-50832-0_14

Download citation

DOI: https://doi.org/10.1007/978-3-319-50832-0_14
Published: 10 December 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-50831-3
Online ISBN: 978-3-319-50832-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics