Abstract
This paper presents a human action recognition algorithm using a depth image. First, 3D coordinates of the body’s joints of each frame are generated from the depth image. Then, the proposed method applies normalization and quantization processes to the body joints of all frames of the action video to obtain a 3D histogram. The histogram is projected onto xy, xz, and yz plans sequentially and combined into a one-dimensional feature vector. For dimension reduction, the principal component analysis (PCA) technique is applied to the feature vector to generate an action descriptor. To further improve the recognition performance, a decision tree method is developed to divide input actions into four main categories. The action description vectors of each category are used to design its respective support vector machine (SVM) classifier. Each SVM classifies the actions of a category into one type of actions. Experimental results verify that our approach effectively rules out the interference of background and improves the recognition rate.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Fujiyoshi H, Lipton AJ (1998) Real-time human motion analysis by image skeletonization. In: Fourth IEEE workshop on applications of computer vision, WACV 1998, pp 15–21
Bobick F, Davis JW (2001) The recognition of human movement using temporal templates. IEEE Trans Pattern Anal Mach Intell 23:257–267
Li X (2007) HMM based action recognition using oriented histograms of optical flow field. Electron Lett 43:560–561
Jianhao D et al (2010) Extraction of human body skeleton based on silhouette images. In: 2010 second international workshop on education technology and computer science (ETCS), pp 71–74
Cherla S et al (2008) Towards fast, view-invariant human action recognition. In: IEEE computer society conference on computer vision and pattern recognition workshops, 2008, CVPRW ‘08. pp 1–8
Yang X et al (2012) Recognizing actions using depth motion maps-based histograms of oriented gradients. In: The 20th ACM international conference on multimedia, pp 1057–1060, Nara, Japan
Zainordin FD et al (2012) Human pose recognition using Kinect and rule-based system. In: World automation congress (WAC), 2012, pp 1–6
Chang CC, Lin CJ (2001) LIBSVM: a library for support vector machines. Software available at http://www.csie.ntu.edu.tw/~cjlin/libsvm
Hwang WY, Hsieh CH (2010) Human action recognition using 3-D HOG and key-point distribution. In: Proceedings of 23th IPPR conference on computer vision, graphics and image processing (CVGIP2010), Taiwan
Kläser A et al (2008) A spatio-temporal descriptor based on 3d-gradients. In: BMVC, pp 99.1–99.10
Acknowledgements
This research is supported in part by the National Science Council, Taiwan under the grants of NSC 99-2632-E130-001-MY3 and NSC 99-2221-E130-011-MY3.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer Science+Business Media Dordrecht
About this paper
Cite this paper
Hsieh, CH., Huang, CP., Hung, J.M. (2013). Human Action Recognition Using Depth Images. In: Park, J., Barolli, L., Xhafa, F., Jeong, HY. (eds) Information Technology Convergence. Lecture Notes in Electrical Engineering, vol 253. Springer, Dordrecht. https://doi.org/10.1007/978-94-007-6996-0_8
Download citation
DOI: https://doi.org/10.1007/978-94-007-6996-0_8
Published:
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-007-6995-3
Online ISBN: 978-94-007-6996-0
eBook Packages: EngineeringEngineering (R0)