Abstract
Hand movement recognition is one of hot research topics in the field of computer vision, which has received extensive research interests. However, current classical hand dance movement recognition has high computational complexity and low accuracy. To address these problems, we present a classical hand dance movement recognition and analysis method based on deep learning. Firstly, our method extracts the key frames from the input classical hand dance video by using an inter frame difference method. Secondly, we use a method based on stacked hourglass network to estimate the 2D hand poses of key frames. Thirdly, a network named HandLinearNet with spatial and channel attention mechanisms is constructed for 3D hand pose estimation. Finally, our method uses ConvLSTM for classical hand dance movement recognition, and outputs corresponding classical hand dance movements. The method can recognize 12 basic classical hand dance movements, where users can better analyze and study classical hand dance.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Lai, J., Yang, Y.: Key frame extraction based on visual attention model. J. Vis. Commun. Image Represent. 23(1), 114–125 (2012)
Oikonomidis, I., Kyriazis, N., Argyros, A.: Full DoF tracking of a hand interacting with an object by modeling occlusions and physical constraints. In: IEEE International Conference on Computer Vision. IEEE (2011)
Lu, S., Metaxas, D., Samaras, D.: Using multiple cues for hand tracking and model refinement. In: 2013 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. IEEE (2013)
Zimmermann, C., Brox, T.: Learning to estimate 3D hand pose from single RGB images. In: IEEE International Conference on Computer Vision. IEEE (2017)
Zhang, X., Li, Q., Mo, H., Zhang, W.: End-to-end hand mesh recovery from a monocular RGB image. In: 2019 IEEE/CVF International Conference on Computer Vision. IEEE (2019)
Ge, L., Ren, Z., Li, Y., Xue, Z.:3D hand shape and pose estimation from a single RGB image. In: IEEE Conference on Computer Vision and Pattern Recognition. IEEE (2019)
Cao, Y., Liu, C., Sheng, Y., Huang, Z., Deng, X.: Action recognition model based on 3D graph convolution and attention enhanced. J. Electron. Inf. Technol. 43(7), 2071–2078 (2021)
Woo, S., Park, J., Lee, J.-Y., Kweon, I.S.: CBAM: convolutional block attention module. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11211, pp. 3–19. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01234-2_1
Burger, I., Lerasle, F., Infantes, G.: Two-handed gesture recognition and fusion with speech to command a robot. Auton. Robot. 32(2), 129–147 (2012)
Kuremoto, T., Kinoshita, Y., Feng, L., Watanabe, S., Kobayashi, K.: A gesture recognition system with retina-V1 model and one-pass dynamic programming. Neurocomputing 116(2), 291–300 (2013)
Raj, R., Dharan, S., Thomas, S.: Optimal feature selection and classification of Indian classical dance hand gesture dataset. Vis. Comput. 39(9), 4049–4064 (2023)
Ma, J., Lv, Q., Yan, H., Ye, T., Shen, Y., Sun, H.: Color-saliency-aware correlation filters with approximate affine transform for visual tracking. Vis. Comput. 39(9), 4065–4086 (2023)
Bayoudh, K., Knani, R., Hamdaoui, F., Mtibaa, A.: A survey on deep multimodal learning for computer vision: advances, trends, applications, and datasets. Vis. Comput. 38(8), 2939–2970 (2022)
Zeghoud, S., et al.: Real-time spatial normalization for dynamic gesture classification. Vis. Comput. 38(4), 1345–1357 (2022)
Acknowledgements
This work was supported by the Funding Project of Beijing Social Science Foundation (No. 19YTC043).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Cai, X., Lu, Q., Li, F., Liu, S., Hu, Y. (2024). Hand Movement Recognition and Analysis Based on Deep Learning in Classical Hand Dance Videos. In: Sheng, B., Bi, L., Kim, J., Magnenat-Thalmann, N., Thalmann, D. (eds) Advances in Computer Graphics. CGI 2023. Lecture Notes in Computer Science, vol 14497. Springer, Cham. https://doi.org/10.1007/978-3-031-50075-6_5
Download citation
DOI: https://doi.org/10.1007/978-3-031-50075-6_5
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-50074-9
Online ISBN: 978-3-031-50075-6
eBook Packages: Computer ScienceComputer Science (R0)