Skip to main content
Log in

Deep neural learning techniques with long short-term memory for gesture recognition

  • S.I. : Applying Artificial Intelligence to the Internet of Things
  • Published:
Neural Computing and Applications Aims and scope Submit manuscript

Abstract

Gesture recognition is a kind of biometric which has assumed great significance in the field of computer vision for communicating information through human activities. To recognize the various gestures and achieve efficient classification, an efficient computational machine learning technique is required. The Shift Invariant Convolutional Deep Structured Neural Learning with Long Short-Term Memory (SICDSNL–LSTM) and Bivariate Fully Recurrent Deep Neural Network with Long Short-Term Memory (BFRDNN–LSTM) have been introduced for classifying human activities with efficient accuracy and minimal time complexity. The SICDSNL–LSTM technique collects gesture data (a kind of biometric) from the dataset and gives it to the input layers of Shift Invariant Convolutional Deep Structured Neural Learning. The SICDSNL–LSTM technique uses two hidden layers for performing regression and classification. In the regression process, dice similarity is used for measuring the relationship between data and output classes. In the second process, the input data are classified into dissimilar classes for each time step using LSTM unit with soft-step activation function. The soft-step activation function uses ‘forget gate’ for removing the less significant data from the cell state. This is also used to make a decision to display the output at a particular time step and to remove other class results. Then, LSTM output is given to the output layers, and convolutional deep neural learning is performed to classify the gesture. Based on the classification results, human activity and gesture are recognized with high accuracy. The BFRDNN–LSTM technique also performs regression in the first hidden layers using bivariate correlation to find relationship between data and classes. The LSTM unit in BFRDNN–LSTM technique uses Gaussian activation function in the second hidden layers for categorizing incoming data into various classes at each time step. In this proposed BFRDNN–LSTM method, fully recurrent deep neural network utilizes gradient descent function to minimize the error rate at the output layers and to increase the accuracy of the gesture recognition. Both SICDSNL–LSTM and BFRDNN–LSTM techniques automatically learn the features and the data to minimize time complexity in gesture recognition. Experimental evaluation is carried out using factors such as gesture recognition accuracy, false-positive rate and time complexity with a number of data.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

References

  1. Chong Y, Huang J, Pan S (2016) Hand gesture recognition using appearance features based on 3d point cloud. J Softw Eng Appl 9(04):103

    Article  Google Scholar 

  2. Czuszyński K, Rumiński J, Kwaśniewska A (2018) Gesture recognition with the linear optical sensor and recurrent neural networks. IEEE Sens J 18(13):5429–5438

    Article  Google Scholar 

  3. Davila J, Cretu AM, Zaremba M (2017) Wearable sensor data classification for human activity recognition based on an iterative learning framework. Sensors 17(6):1287

    Article  Google Scholar 

  4. Ding H, He Q, Zhou Y, Dan G, Cui S (2017) An individual finger gesture recognition system based on motion-intent analysis using mechanomyogram signal. Front Neurol 8:573

    Article  Google Scholar 

  5. Dominio F, Donadeo M, Zanuttigh P (2014) Combining multiple depth-based descriptors for hand gesture recognition. Pattern Recognit Lett 50:101–111

    Article  Google Scholar 

  6. Fong S, Liang J, Fister I, Mohammed S (2015) Gesture recognition from data streams of human motion sensor using accelerated PSO swarm search feature selection algorithm. J Sens. https://doi.org/10.1155/2015/205707

    Article  Google Scholar 

  7. Frolova D, Stern H, Berman S (2013) Most probable longest common subsequence for recognition of gesture character input. IEEE Trans Cybern 43(3):871–880

    Article  Google Scholar 

  8. Gao L, Bourke A, Nelson J (2014) Evaluation of accelerometer based multi-sensor versus single-sensor activity recognition systems. Med Eng Phys 36(6):779–785

    Article  Google Scholar 

  9. Goyal M, Shahi B, Prema K, Reddy NS (2017) Performance analysis of human gesture recognition techniques. In: 2017 2nd IEEE international conference on recent trends in electronics, information and communication technology (RTEICT), IEEE, pp 111–115

  10. Gupta A, Sehrawat VK, Khosla M (2012) Fpga based real time human hand gesture recognition system. Proc Technol 6:98–107

    Article  Google Scholar 

  11. Hasan MM, Mishra PK (2012) Robust gesture recognition using Gaussian distribution for features fitting. Int J Mach Learn Comput 2(3):266

    Article  Google Scholar 

  12. He J, Zhang C, He X, Dong R (2019) Visual recognition of traffic police gestures with convolutional pose machine and handcrafted features. Neurocomputing. https://doi.org/10.1016/j.neucom.2019.07.103

    Article  Google Scholar 

  13. Joshi A, Monnier C, Betke M, Sclaroff S (2017) Comparing random forest approaches to segmenting and classifying gestures. Image Vis Comput 58:86–95

    Article  Google Scholar 

  14. Kılıboz NÇ, Güdükbay U (2015) A hand gesture recognition technique for human–computer interaction. J Vis Commun Image Represent 28:97–104

    Article  Google Scholar 

  15. Kim JH, Hong GS, Kim BG, Dogra DP (2018) deepgesture: deep learning-based gesture recognition scheme using motion sensors. Displays 55:38–45

    Article  Google Scholar 

  16. Kwon Y, Kang K, Bae C (2014) Unsupervised learning for human activity recognition using smartphone sensors. Expert Syst Appl 41(14):6067–6074

    Article  Google Scholar 

  17. Li C, Xie C, Zhang B, Chen C, Han J (2018a) Deep fisher discriminant learning for mobile hand gesture recognition. Pattern Recognit 77:276–288

    Article  Google Scholar 

  18. Li Y, Wang X, Liu W, Feng B (2018b) Deep attention network for joint hand gesture localization and recognition using static RGB-D images. Inf Sci 441:66–78

    Article  MathSciNet  Google Scholar 

  19. Li YT, Wachs JP (2014) Hegm: a hierarchical elastic graph matching for hand gesture recognition. Pattern Recognit 47(1):80–88

    Article  Google Scholar 

  20. Liu K, Chen C, Jafari R, Kehtarnavaz N (2014) Fusion of inertial and depth sensor data for robust hand gesture recognition. IEEE Sens J 14(6):1898–1903

    Article  Google Scholar 

  21. Liu YT, Zhang YA, Zeng M (2018) Novel algorithm for hand gesture recognition utilizing a wrist-worn inertial sensor. IEEE Sens J 18(24):10,085–10,095

    Article  Google Scholar 

  22. Madeo RCB, Peres SM, de Moraes Lima CA (2016) Gesture phase segmentation using support vector machines. Expert Syst Appl 56:100–115

    Article  Google Scholar 

  23. Nguyen-Duc-Thanh N, Lee S, Kim D (2012) Two-stage hidden markov model in gesture recognition for human robot interaction. Int J Adv Robot Syst 9(2):39

    Article  Google Scholar 

  24. Nunez JC, Cabido R, Pantrigo JJ, Montemayor AS, Velez JF (2018) Convolutional neural networks and long short-term memory for skeleton-based human activity and hand gesture recognition. Pattern Recognit 76:80–94

    Article  Google Scholar 

  25. Ordóñez F, Roggen D (2016) Deep convolutional and lstm recurrent neural networks for multimodal wearable activity recognition. Sensors 16(1):115

    Article  Google Scholar 

  26. Rahman SU, Afroze Z, Tareq M (2014) Hand gesture recognition techniques for human computer interaction using open cv. Int J Sci Res Publ 4(12):1–6

    Google Scholar 

  27. Razzaq M, Cleland I, Nugent C, Lee S (2018) Multimodal sensor data fusion for activity recognition using filtered classifier. In: Multidisciplinary digital publishing institute proceedings, vol 2, p 1262

  28. Ronao CA, Cho SB (2017) Recognizing human activities from smartphone sensors using hierarchical continuous hidden markov models. Int J Distrib Sens Netw 13(1):1550147716683,687

    Article  Google Scholar 

  29. Sagayam KM, Hemanth DJ (2017) Hand posture and gesture recognition techniques for virtual reality applications: a survey. Virtual Real 21(2):91–107

    Article  Google Scholar 

  30. Santos D, Fernandes B, Bezerra B (2015) HAGR-D: a novel approach for gesture recognition with depth maps. Sensors 15(11):28,646–28,664

    Article  Google Scholar 

  31. Soares AD, Apolinário AL Jr (2017) Real-time 3d gesture recognition using dynamic time warping and simplification methods. J WSCG 25(1):59–66

    Google Scholar 

  32. Sun J, Fu Y, Li S, He J, Xu C, Tan L (2018) Sequential human activity recognition based on deep convolutional network and extreme learning machine using wearable sensors. J Sens. https://doi.org/10.1155/2018/8580959

    Article  Google Scholar 

  33. Tai TM, Jhang YJ, Liao ZW, Teng KC, Hwang WJ (2018) Sensor-based continuous hand gesture recognition by long short-term memory. IEEE Sens Lett 2(3):1–4

    Article  Google Scholar 

  34. Trabelsi D, Mohammed S, Chamroukhi F, Oukhellou L, Amirat Y (2013) An unsupervised approach for automatic activity recognition based on hidden markov model regression. IEEE Trans Autom Sci Eng 10(3):829–835

    Article  Google Scholar 

  35. Wang Z, Chen B, Wu J (2018) Effective inertial hand gesture recognition using particle filtering based trajectory matching. J Electr Comput Eng. https://doi.org/10.1155/2018/6296013

    Article  MathSciNet  Google Scholar 

  36. Xu C, He J, Zhang X, Cai H, Duan S, Tseng PH, Li C (2018) Recurrent transformation of prior knowledge based model for human motion recognition. Comput Intell Neurosci. https://doi.org/10.1155/2018/4160652

    Article  Google Scholar 

  37. Zhu J, San-Segundo R, Pardo JM (2017) Feature extraction for robust physical activity recognition. Hum Centric Comput Inf Sci 7(1):16

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Deepak Kumar Jain.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Jain, D.K., Mahanti, A., Shamsolmoali, P. et al. Deep neural learning techniques with long short-term memory for gesture recognition. Neural Comput & Applic 32, 16073–16089 (2020). https://doi.org/10.1007/s00521-020-04742-9

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00521-020-04742-9

Keywords

Navigation