Abstract
Recognition of hand gesture is a crucial task as far as accuracy of recognition and real-time performance is concerned. The recognition accuracy of hand gesture systems used in applications from virtual reality to sign language recognition depends not only on the features extracted but also on the soft computing techniques used to recognize the gesture. This requires recognition system, with robust features which are unaffected by rotation, scaling, translation, variation in illumination and view of the gesture within the image. Accordingly, the research work presented in this paper aims to recognize the dynamic hand gestures using Hough transform-based spatiotemporal feature extraction for preprocessing and artificial neural network for identification. The system is implemented and tested using benchmark Cambridge hand gesture database and Sebastien dynamic hand posture database. As many as 32 features comprising of 19 Fourier descriptors, 3 geometrical features and 10 temporal features are obtained on each gesture sequence and are given as input to NN. The experimental results revealed that this approach is able to recognize dynamic hand gestures with an average recognition rate (RR) higher than 94% on Cambridge database which is better than that reported by earlier researchers while 98% RR on Sebastien database. The noteworthy feature of the proposed approach is that it relieves from the burden of foreground detection, generally required for other video processing systems.
Similar content being viewed by others
References
Martin Sagayam, K., Jude Hemanth, D.: Hand posture and gesture recognition techniques for virtual reality applications: a survey. Virtual Real. 21(2), 91–107 (2017)
Mitra, S., Acharya, T.: Gesture recognition: a survey. IEEE Trans. Syst. Man Cybern. Part C (Appl. Rev.) 37(3), 311–324 (2007)
Liu, J., et al.: Skeleton-based action recognition using spatio-temporal LSTM network with trust gates. IEEE Trans. Pattern Anal. Mach. Intell. (Nov 2017). https://doi.org/10.1109/TPAMI.2017.2771306
Shanableh, T., Assaleh, K., Al-Rousan, M.: Spatio-temporal feature-extraction techniques for isolated gesture recognition in Arabic sign language. IEEE Trans. Syst. Man Cybern. Part B (Cybern.) 37(3), 641–650 (2007)
Alon, J., et al.: A unified framework for gesture recognition and spatiotemporal gesture segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 31(9), 1685 (2009)
Patil, A.R., Subbaraman, S.: Static hand gesture detection and classification using contour based fourier descriptor. Int. J. Eng. Res. Electr. Electron. Eng. 3, 6–11 (2016)
Ye, H. et al.: A new method based on Hough transform for quick line and circle detection. In: 8th IEEE International Conference on Biomedical Engineering and Informatics (2015)
D’Orazio, T. et.al.: A new algorithm for ball recognition using circle Hough transform and neural classifier. In: Pattern Recognition Society, Published by Elsevier, 393–408 (2004)
Due, T. et. al.: Combining Hough transform and contour algorithm for detecting vehicles license-plates. In: 4th International Symposium on Intelligent Multimedia. Video and Speech Processing, Hong Kong
Ruta, A., et al.: In-vehicle camera traffic sign detection and recognition. Mach. Vis. Appl. 22, 359–375 (2011). https://doi.org/10.1007/s00138-009-0231
Chhor, V., Kondo, T.: Illumination-invariant line detection with the Gray-scale Hough transform. In: 7th IEEE International Conference on Cybernetics and Intelligent Systems (CIS) and IEEE Conference on Robotics, Automation and Mechatronics (RAM) (2015)
Sheshkus, A. et. al.: Combining convolutional neural networks and Hough transform for classification of images containing lines. In: Proceedings of SPIE 9th International Conference on Machine Vision (ICMV 2016), vol. 103411C-1
Munib, Q. et. al.: American sign language (ASL) recognition based on Hough transform and neural networks. In: Elsevier Expert Systems with Applications, pp. 24–37 (2007)
Wu, Y.M.: The implementation of gesture recognition for media player system. Master Thesis of the Department of Electrical Engineering, National Taiwan University of Science and Technology, Taipei, Taiwan (2009)
Hafizur, M.H., et. al.: Hand gesture recognition using multiclass support vector machine. Int. J. Comput. Appl. 74(1), 39–43 (2013)
Gall, J., Yao, A., Razavi, N., VanGool, L., Lempitsky, V.: Hough forests for object detection, tracking, and action recognition. IEEE Trans. Pattern Anal. Mach. Intell. 33(11), 2188–2202 (2011)
Sato, K., Aggarwal, J.K.: Temporal spatio-velocity transform and its application to tracking and interaction. Comput. Vis. Image Underst. 96(2), 100–128 (2004)
Shaika, K.B. et. al.: Comparative study of skin color detection and segmentation in HSV and YCbCr color space. In: 3rd International Elsevier Conference on Recent Trends in Computing (2015)
Patil, A.R., Subbaraman, S.: Illumination invariant hand gesture classification against complex background using combinational features. Int. J. Comput. Sci. Inf. Secur. (IJCSIS) 16(3), 63–70 (2018)
Duda, R.O., Hart, P.E.: Use of the Hough transformation to detect lines and curves in pictures. Commun. ACM 15, 11–15 (1972)
Seib, V., Kusenbach, M. et. al.: Object recognition using hough-transform clustering of SURF features. In: Scientific Cooperations International Workshops on Electrical and Computer Engineering Subfields 22–23 Aug (2014)
Patil, A.R., Subbaraman, S.: Comparative analysis of various feature extraction techniques with special reference to HGR. In: IEEE International Conference on 5th Signal and Image Processing, Bangalore (2014)
Kim, T., Wong, S., Cipolla, R.: Tensor canonical correlation analysis for action classification. In: Conference on Computer Vision and Pattern Recognition, Minneapolis, Minnesota, USA, pp. 1–8 (2007)
Marcel, S., Bernier, O., Viallet, J.-E., Collobert, D.: Hand gesture recognition using input/ouput hidden Markov models. In: Proceedings of the 4th International Conference on Automatic Face and Gesture Recognition (AFGR) (2000)
Kim, T.-K., Kittler, J., Cipolla, R.: Discriminative learning and recognition of image set classes using canonical correlations. IEEE Trans. Pattern Anal. Mach. Intell. 6, 1–14 (2007)
Lui, Y.M., Beveridge, J., Kirby, M.: Action classification on product manifolds. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 833–839 (2010)
Lui, Y.: Tangent bundles on special manifolds for action recognition. IEEE Trans. Circ. Syst. Video Technol. 22(6), 930–942 (2012)
Sanin, A. et. al.: Spatio-temporal covariance descriptors for action and gesture recognition. In: WACV ‘13 Proceedings of the 2013 IEEE Workshop on Applications of Computer Vision (WACV), pp. 103–110 (2013)
Liu, L., Shao, L.: Synthesis of spatio-temporal descriptors for dynamic hand gesture recognition using genetic programming. In: 2013 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG) (2013)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Patil, A.R., Subbaraman, S. A spatiotemporal approach for vision-based hand gesture recognition using Hough transform and neural network. SIViP 13, 413–421 (2019). https://doi.org/10.1007/s11760-018-1370-1
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11760-018-1370-1