KOA-CLSTM-based real-time dynamic hand gesture recognition on mobile terminal

Hou, Xianghui; Cen, Sijia; Zhang, Meiyu; Jian, Chengfeng

doi:10.1007/s11760-022-02395-w

KOA-CLSTM-based real-time dynamic hand gesture recognition on mobile terminal

Original Paper
Published: 22 November 2022

Volume 17, pages 1841–1854, (2023)
Cite this article

Signal, Image and Video Processing Aims and scope Submit manuscript

Xianghui Hou¹,
Sijia Cen¹,
Meiyu Zhang¹ &
…
Chengfeng Jian¹

191 Accesses
1 Altmetric
Explore all metrics

Abstract

Focus on the machine terminals with the only monocular camera and low computational power without GPU on the Mobile Terminal, it is difficult to realize the real-time dynamic hand gesture when multiple fingertips move in a small range because of the spatiotemporal noise. In this paper, the KOA-CLSTM-based real-time dynamic hand gesture recognition method is proposed for Mobile Terminals. The method consists of three parts: Kernel Optimize Accumulation (KOA), Union Frame Difference (UDF), and CLSTM. First, KOA is put forward to realize gesture extraction under special situations like parallel fingers, juxtaposed fingertips, natural syndactyly, and curved fingertips. Second, UDF facilitates frame feature fusion in both temporal and spatial dimensions, effectively resolving spatiotemporal noise such as inter-frame noise, camera dithering, and background interference on the Mobile Terminal. For the lightweight network CLSTM, which has a 5-block 3D convolutional neural network, it is better than C3D in a similar situation with a classification accuracy of 0.9689 in 0.0267 and an identification accuracy of 0.9682 in less than 0.13 s. It means that our proposed architecture on the Mobile Terminal performs well on fingertip classification and hand gestural recognition.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Improved mKLT and low layered HG-CNN based dynamic gesture recognition hardware system

Article 11 March 2024

A hand gesture recognition algorithm based on DC-CNN

Article 24 January 2019

HandSense: smart multimodal hand gesture recognition based on deep neural networks

Article 23 August 2018

Data availability

The experiment datasets in this essay generated, used or analyzed during the current study are available from the corresponding author on reasonable request.

References

Pal, S.K., et al.: Deep learning in multi-object detection and tracking: state of the art. Appl. Intell. (Dordrecht, Netherlands) 51(9), 6400–6429 (2021)
Google Scholar
Jia, L., Zhou, X., Xue, C.: Non-trajectory-based gesture recognition in human-computer interaction based on hand skeleton data. Multimed. Tools Appl. 81(15), 20509–20539 (2022)
Article Google Scholar
Sun, C., et al.: Mask-guided SSD for small-object detection. Appl. Intell. (Dordrecht, Netherlands) 51(6), 3311–3322 (2021)
Google Scholar
Xu, M., et al.: A novel dynamic graph evolution network for salient object detection. Appl. Intell. (Dordrecht, Netherlands) 52(3), 2854–2871 (2022)
Google Scholar
Li, Y., Zhang, P.: Static hand gesture recognition based on hierarchical decision and classification of finger features. Sci. Prog. (1916) (2022). https://doi.org/10.1177/00368504221086362
Article Google Scholar
Sadeddine, K., et al.: Recognition of user-dependent and independent static hand gestures: application to sign language. J. Vis. Commun. Image Represent. 79, 103193 (2021)
Article Google Scholar
Lazarou, M., Li, B., Stathaki, T.: A novel shape matching descriptor for real-time static hand gesture recognition. Comput. Vis. Image Underst. 210, 103241 (2021)
Article Google Scholar
Dong, Y., Liu, J., Yan, W.: Dynamic hand gesture recognition based on signals from specialized data glove and deep learning algorithms. IEEE Trans. Instrum. Meas. 70, 1–14 (2021)
Google Scholar
Tang, H., et al.: Fast and robust dynamic hand gesture recognition via key frames extraction and feature fusion. Neurocomputing (Amsterdam) 331, 424–433 (2019)
Article Google Scholar
Luo, Y., Cui, G., Li, D.: An improved gesture segmentation method for gesture recognition based on CNN and YCbCr. J. Electr. Comput. Eng. (2021). https://doi.org/10.1155/2021/1783246
Article Google Scholar
Wu, X.Y.: A hand gesture recognition algorithm based on DC-CNN. Multimed. Tools Appl. 79(13–14), 9193–9205 (2020)
Article Google Scholar
Zhu, G., et al.: Continuous gesture segmentation and recognition using 3DCNN and convolutional LSTM. IEEE Trans. Multimed. 21(4), 1011–1021 (2019)
Article MathSciNet Google Scholar
Ouyang, X., et al.: A 3D-CNN and LSTM based multi-task learning architecture for action recognition. IEEE Access 7, 40757–40770 (2019)
Article Google Scholar
Luo, J., Zhang, X.: Convolutional neural network based on attention mechanism and Bi-LSTM for bearing remaining life prediction. Appl. Intell. (Dordrecht, Netherlands) 52(1), 1076–1091 (2022)
Google Scholar
Zhu, G., et al.: Redundancy and attention in convolutional LSTM for gesture recognition. IEEE Trans. Neural Netw. Learn. Syst. 31(4), 1323–1335 (2020)
Article Google Scholar
De Smedt, Q., Wannous, H., Vandeborre, J.: Heterogeneous hand gesture recognition using 3D dynamic skeletal data. Comput. Vis. Image Underst. 181, 60–72 (2019)
Article Google Scholar
Jin, B., et al.: Robust dynamic hand gesture recognition based on millimeter wave Rader using Atten-TsNN. IEEE Sens. J. 22(11), 10861–10869 (2022)
Article Google Scholar
Jian, C., Liu, X., Zhang, M.: RD-hand: a real-time regression-based detector for dynamic hand gesture. Appl. Intell. (Dordrecht, Netherlands) 52(1), 417–428 (2022)
Google Scholar
Jia, L., et al.: MobileNetV3 with CBAM for bamboo stick counting. IEEE Access 10, 53963–53971 (2022)
Article Google Scholar
Amrutha, E., Arivazhagan, S., Sylvia, W.: MixNet: a robust mixture of convolutional neural networks as feature extractors to detect stego images created by content-adaptive steganography. Neural Process. Lett. 54(2), 853–870 (2022)
Article Google Scholar
Han, K., et al.: GhostNets on heterogeneous devices via cheap operations. Int. J. Comput. Vis. 130(4), 1050–1069 (2022)
Article MathSciNet Google Scholar
Han, Q., Liu, J., Jung, C.: Lightweight generative network for image inpainting using feature contrast enhancement. IEEE Access 10, 86458–86469 (2022)
Article Google Scholar
Shin, Y., et al.: PEPSI++: fast and lightweight network for image inpainting. IEEE Trans. Neural Netw. Learn. Syst. 32(1), 252–265 (2021)
Article Google Scholar
Zhang, P., et al.: Efficient lightweight attention network for face recognition. IEEE Access 10, 31740–31750 (2022)
Article Google Scholar
Kong, L., Wang, J., Zhao, P.: YOLO-G: a lightweight network model for improving the performance of military targets detection. IEEE Access 10, 55546–55564 (2022)
Article Google Scholar
Feng, Z., Lee, F., Chen, Q.: SRUNet: stacked reversed U-shape network for lightweight single image super-resolution. IEEE Access 10, 60151–60162 (2022)
Article Google Scholar
Renjun, X., et al.: Fault detection method based on improved faster R-CNN: take ResNet-50 as an example. Geofluids (2022). https://doi.org/10.1155/2022/7812410
Article Google Scholar
Ding, I., Zheng, N.: CNN deep learning with wavelet image fusion of CCD RGB-IR and depth-grayscale sensor data for hand gesture intention recognition. Sensors 22(3), 803 (2022)
Article Google Scholar
Khan, M.S., et al.: Deep learning for ocular disease recognition: an inner-class balance. Comput. Intell. Neurosci. (2022). https://doi.org/10.1155/2022/5007111
Article Google Scholar

Download references

Acknowledgements

This work was supported in part by the National Natural Science Foundation of China under Grant No. 61672461 and No. 62073293.

Author information

Authors and Affiliations

Computer Science and Technology College, Zhejiang University of Technology, Hangzhou, China
Xianghui Hou, Sijia Cen, Meiyu Zhang & Chengfeng Jian

Authors

Xianghui Hou
View author publications
You can also search for this author in PubMed Google Scholar
Sijia Cen
View author publications
You can also search for this author in PubMed Google Scholar
Meiyu Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Chengfeng Jian
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Chengfeng Jian.

Ethics declarations

Ethical approval

It is an original paper. The submission is approved by all the authors. If accepted, the work described in this paper will not be published elsewhere. And the study is not split up into several parts to increase the quantity of submissions and submitted to various journals or to one journal over time. No data have been fabricated or manipulated (including images) to support our conclusions. No data, text, or theories by others are presented as if they were our own.

Human or animal participants

This article does not contain any studies with human participants or animals performed by any of the authors.

Informed consent

Informed consent was obtained from all individual participants included in the study.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Hou, X., Cen, S., Zhang, M. et al. KOA-CLSTM-based real-time dynamic hand gesture recognition on mobile terminal. SIViP 17, 1841–1854 (2023). https://doi.org/10.1007/s11760-022-02395-w

Download citation

Received: 24 September 2022
Revised: 21 October 2022
Accepted: 07 November 2022
Published: 22 November 2022
Issue Date: July 2023
DOI: https://doi.org/10.1007/s11760-022-02395-w

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

KOA-CLSTM-based real-time dynamic hand gesture recognition on mobile terminal

Abstract

Access this article

Similar content being viewed by others

Improved mKLT and low layered HG-CNN based dynamic gesture recognition hardware system

A hand gesture recognition algorithm based on DC-CNN

HandSense: smart multimodal hand gesture recognition based on deep neural networks

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Ethical approval

Human or animal participants

Informed consent

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

KOA-CLSTM-based real-time dynamic hand gesture recognition on mobile terminal

Abstract

Access this article

Similar content being viewed by others

Improved mKLT and low layered HG-CNN based dynamic gesture recognition hardware system

A hand gesture recognition algorithm based on DC-CNN

HandSense: smart multimodal hand gesture recognition based on deep neural networks

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Ethical approval

Human or animal participants

Informed consent

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation