research-article

A Dynamic Gesture Recognition Method Based on Encoded Video

Authors:

Zhang ZhaozheAuthors Info & Claims

AIPR '22: Proceedings of the 2022 5th International Conference on Artificial Intelligence and Pattern Recognition

Pages 711 - 716

https://doi.org/10.1145/3573942.3574084

Published: 16 May 2023 Publication History

Abstract

Most of the video-based dynamic gesture recognition methods require decoding video into raw RGB images. The approved accuracy relies on multiple data patterns, such as depth map or optical flow, in specific scenario. So, the more complexity models, the huger calculation power and storage consumption. In this paper, a new characterized model for spatiotemporal data is proposed to represent the spatiotemporal features of dynamic gestures, take advantage of Intra-frames (I-frame), motion vectors, and residuals in encoded videos, so that the additional consumption of computation and storage caused by decoding videos are escaped. Furthermore, a key predicted frames (P-frame) selection (KPFS) module is proposed to filter those P-frames having no useful information, based on an image entropy estimated with the residuals. The more distinguished features are obtained. Comprehensively experiments are performed on two benchmark datasets, VIVA and SKIG. The results show that our method can achieve an average accuracy of 81.13% and 98.70% using lone RGB data, reduce the storage overhead by 88.5%. The result is similar to that of the state-of-the-art methods with the running speed of more than 4.3 times.

References

[1]

Abavisani M, Joze H R V, Patel V M. Improving the performance of unimodal dynamic hand-gesture recognition with multimodal training[C]// Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2019: 1165-1174.

[2]

Li J, Liu R, Kong D, Attentive 3D-Ghost Module for Dynamic Hand Gesture Recognition with Positive Knowledge Transfer[J]. Computational Intelligence and Neuroscience, 2021, 2021.

[3]

Wu C Y, Zaheer M, Hu H, Compressed video action recognition[C]// Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2018: 6026-6035.

[4]

Shou Z, Lin X, Kalantidis Y, Dmc-net: Generating discriminative motion cues for fast compressed video action recognition[C]// Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2019: 1268-1277.

[5]

Hu H, Zhou W, Li X, MV2Flow: Learning Motion Representation for Fast Compressed Video Action Recognition[J]. ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), 2020, 16(3s): 1-19.

Digital Library

[6]

Kusuma T, Jagannathan S. Real time object tracking in H. 264/avc using polar vector median and global motion compensation[C]//2017 4th International Conference on Electronics and Communication Systems (ICECS). IEEE, 2017: 93-95.

[7]

Xie X, Zhao H, Jiang L. Dynamic gesture recognition based on video data features[J]. Journal of Beijing University of Posts and Telecommunications 2020; 43(5): 91.

[8]

Singla N. Motion detection based on frame difference method[J]. International Journal of Information & Computation Technology, 2014, 4(15): 1559-1565.

[9]

Ohn-Bar E, Trivedi M M . Hand Gesture Recognition in Real Time for Automotive Interfaces: A Multimodal Vision-Based Approach and Evaluations[J]. IEEE Transactions on Intelligent Transportation Systems, 2014, 15(6):2368-2377.

[10]

Liu L, Shao L. Learning discriminative representations from RGB-D video data[C]//Twenty-third international joint conference on artificial intelligence. 2013.

[11]

Wang L, Xiong Y, Wang Z, Temporal segment networks: Towards good practices for deep action recognition[C]//European conference on computer vision. Springer, Cham, 2016: 20-36.

[12]

Konovalenko I, Maruschak P, Kozbur H, Influence of uneven lighting on quantitative indicators of surface defects[J]. Machines, 2022, 10(3): 194.

[13]

Tran D, Bourdev L, Fergus R, Learning spatiotemporal features with 3d convolutional networks[C]//Proceedings of the IEEE international conference on computer vision. 2015: 4489-4497.

[14]

Carreira J, Zisserman A. Quo vadis, action recognition? a new model and the kinetics dataset[C]//proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2017: 6299-6308.

[15]

Lu Z, Qin S, Li X, One-shot learning hand gesture recognition based on modified 3d convolutional neural networks[J]. Machine Vision and Applications, 2019, 30(7): 1157-1180.

Digital Library

[16]

Tang X, Yan Z, Peng J, Selective spatiotemporal features learning for dynamic gesture recognition[J]. Expert Systems with Applications, 2021, 169: 114499.

Index Terms

A Dynamic Gesture Recognition Method Based on Encoded Video
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision tasks
        Activity recognition and understanding

Recommendations

SlowFast Convolution LSTM Networks for Dynamic Gesture Recognition
APIT '21: Proceedings of the 2021 3rd Asia Pacific Information Technology Conference

Computer vision-based gesture recognition is gradually becoming a popular research direction in the field of human-computer interaction (HCI). However, there are various challenges in the extraction of gesture features, such as complex backgrounds, ...
R-Lambda model based CTU-level rate control for intra frames in HEVC

In High Efficiency Video Coding (HEVC), the coding efficiency of intra frames is much lower than inter frames. If the bits allocated to intra frames are not sufficient to improve their quality, the quality fluctuation between intra frames and their ...
Computational complexity allocation and control for inter-coding of high efficiency video coding with fast coding unit split decision

A computational complexity allocation and control method for the low-delay P-frame configuration of the HEVC encoder.The complexity allocation includes the group of pictures layer, the frame layer, and the CU layer in the HEVC encoder.Motion vector ...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

AIPR '22: Proceedings of the 2022 5th International Conference on Artificial Intelligence and Pattern Recognition

September 2022

1221 pages

ISBN:9781450396899

DOI:10.1145/3573942

Copyright © 2022 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 16 May 2023

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Conference

AIPR 2022

AIPR 2022: 2022 5th International Conference on Artificial Intelligence and Pattern Recognition

September 23 - 25, 2022

Xiamen, China

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
51
Total Downloads

Downloads (Last 12 months)17
Downloads (Last 6 weeks)3

Reflects downloads up to 01 Mar 2025

Other Metrics

View Author Metrics

Citations

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Figures

Tables

Media

View Table of Conten