Skip to main content

Hand Movement Recognition and Analysis Based on Deep Learning in Classical Hand Dance Videos

  • Conference paper
  • First Online:
Advances in Computer Graphics (CGI 2023)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14497))

Included in the following conference series:

  • 172 Accesses

Abstract

Hand movement recognition is one of hot research topics in the field of computer vision, which has received extensive research interests. However, current classical hand dance movement recognition has high computational complexity and low accuracy. To address these problems, we present a classical hand dance movement recognition and analysis method based on deep learning. Firstly, our method extracts the key frames from the input classical hand dance video by using an inter frame difference method. Secondly, we use a method based on stacked hourglass network to estimate the 2D hand poses of key frames. Thirdly, a network named HandLinearNet with spatial and channel attention mechanisms is constructed for 3D hand pose estimation. Finally, our method uses ConvLSTM for classical hand dance movement recognition, and outputs corresponding classical hand dance movements. The method can recognize 12 basic classical hand dance movements, where users can better analyze and study classical hand dance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 59.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 79.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Lai, J., Yang, Y.: Key frame extraction based on visual attention model. J. Vis. Commun. Image Represent. 23(1), 114–125 (2012)

    Article  Google Scholar 

  2. Oikonomidis, I., Kyriazis, N., Argyros, A.: Full DoF tracking of a hand interacting with an object by modeling occlusions and physical constraints. In: IEEE International Conference on Computer Vision. IEEE (2011)

    Google Scholar 

  3. Lu, S., Metaxas, D., Samaras, D.: Using multiple cues for hand tracking and model refinement. In: 2013 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. IEEE (2013)

    Google Scholar 

  4. Zimmermann, C., Brox, T.: Learning to estimate 3D hand pose from single RGB images. In: IEEE International Conference on Computer Vision. IEEE (2017)

    Google Scholar 

  5. Zhang, X., Li, Q., Mo, H., Zhang, W.: End-to-end hand mesh recovery from a monocular RGB image. In: 2019 IEEE/CVF International Conference on Computer Vision. IEEE (2019)

    Google Scholar 

  6. Ge, L., Ren, Z., Li, Y., Xue, Z.:3D hand shape and pose estimation from a single RGB image. In: IEEE Conference on Computer Vision and Pattern Recognition. IEEE (2019)

    Google Scholar 

  7. Cao, Y., Liu, C., Sheng, Y., Huang, Z., Deng, X.: Action recognition model based on 3D graph convolution and attention enhanced. J. Electron. Inf. Technol. 43(7), 2071–2078 (2021)

    Google Scholar 

  8. Woo, S., Park, J., Lee, J.-Y., Kweon, I.S.: CBAM: convolutional block attention module. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11211, pp. 3–19. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01234-2_1

    Chapter  Google Scholar 

  9. Burger, I., Lerasle, F., Infantes, G.: Two-handed gesture recognition and fusion with speech to command a robot. Auton. Robot. 32(2), 129–147 (2012)

    Article  Google Scholar 

  10. Kuremoto, T., Kinoshita, Y., Feng, L., Watanabe, S., Kobayashi, K.: A gesture recognition system with retina-V1 model and one-pass dynamic programming. Neurocomputing 116(2), 291–300 (2013)

    Article  Google Scholar 

  11. Raj, R., Dharan, S., Thomas, S.: Optimal feature selection and classification of Indian classical dance hand gesture dataset. Vis. Comput. 39(9), 4049–4064 (2023)

    Article  Google Scholar 

  12. Ma, J., Lv, Q., Yan, H., Ye, T., Shen, Y., Sun, H.: Color-saliency-aware correlation filters with approximate affine transform for visual tracking. Vis. Comput. 39(9), 4065–4086 (2023)

    Article  Google Scholar 

  13. Bayoudh, K., Knani, R., Hamdaoui, F., Mtibaa, A.: A survey on deep multimodal learning for computer vision: advances, trends, applications, and datasets. Vis. Comput. 38(8), 2939–2970 (2022)

    Article  Google Scholar 

  14. Zeghoud, S., et al.: Real-time spatial normalization for dynamic gesture classification. Vis. Comput. 38(4), 1345–1357 (2022)

    Article  Google Scholar 

Download references

Acknowledgements

This work was supported by the Funding Project of Beijing Social Science Foundation (No. 19YTC043).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yan Hu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Cai, X., Lu, Q., Li, F., Liu, S., Hu, Y. (2024). Hand Movement Recognition and Analysis Based on Deep Learning in Classical Hand Dance Videos. In: Sheng, B., Bi, L., Kim, J., Magnenat-Thalmann, N., Thalmann, D. (eds) Advances in Computer Graphics. CGI 2023. Lecture Notes in Computer Science, vol 14497. Springer, Cham. https://doi.org/10.1007/978-3-031-50075-6_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-50075-6_5

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-50074-9

  • Online ISBN: 978-3-031-50075-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics