Abstract
Communication plays an important role in today’s world. Before the evolution of the verbal communication, sign language was the only way of communication used by our ancestors. Later on, the verbal communication started evolving and different people from different region started to speak different languages. But there are some groups of people who cannot express themselves with verbal language; instead they use sign language to communicate. To bridge the gap between those people who use sign language for communication with those who use verbal language, a system is designed that recognizes the gestures of the sign language, interprets it and converts it into verbal language. Various researches have been carried out by capturing the hand signs of the speech impaired people through sensors like leap motion sensors and camera. This research works focusses on improving the gesture capturing through camera and process them through deep learning models. This work focussed on creating a hand gesture dataset “HandG” that includes 20,600 images for 10 classes (2060 images per category) using digital camera and image augmentation. A novel Convolution Neural Networks (CNN) based model, termed as “HandGCNN”, is proposed achieving a high prediction accuracy of 99.13%. A real-time system with webcam being the input receptor unit is built which recognises the signal and generates the audio relevant to that. The generated audio will serve as voice assistance for impaired people.






Similar content being viewed by others
Explore related subjects
Discover the latest articles and news from researchers in related subjects, suggested using machine learning.References
Apoorva A, Mishra GK, Sahoo RR, Bhoi SK, Mallick C (2021) Deep learning-based ship detection in remote sensing imagery using TensorFlow. In Advances in machine learning and computational intelligence (pp 165–177). Springer, Singapore. https://doi.org/10.1007/978-981-15-5243-4_14
Badi H (2016) RETRACTED ARTICLE: a survey on recent vision-based gesture recognition. Intell Ind Syst 2(2):179–191. https://doi.org/10.1007/s40903-016-0046-9
Bhoi SK, Panda SK, Patra B, Pradhan B, Priyadarshinee P, Tripathy S, … Khilar PM (2018) FallDS-IoT: a fall detection system for elderly healthcare based on IoT data analytics. In 2018 International Conference on Information Technology (ICIT) (pp 155–160). IEEE. https://doi.org/10.1109/ICIT.2018.00041
Dong Y, Liu J, Yan W (2021) Dynamic hand gesture recognition based on signals from specialized data glove and deep learning algorithms. IEEE Trans Instrum Meas 70:1–14. https://doi.org/10.1109/TIM.2021.3077967
Gullapalli S, Karthik P, Sathish P (2020) A comparative analysis of cloud based Watson system and CNN for gesture recognition systems. In 2020 IEEE international Students' conference on electrical, electronics and computer science (SCEECS) (pp 1–5). IEEE. https://doi.org/10.1109/SCEECS48394.2020.66
He K, Sun J (2015) Convolutional neural networks at constrained time cost. In proceedings of the IEEE conference on computer vision and pattern recognition (pp 5353–5360)
Howard AG, Zhu M, Chen B, Kalenichenko D, Wang W, Weyand T, Andreetto M, Adam H (2017) MobileNets: efficient convolutional neural networks for Mobile vision applications. ArXiv, abs/1704.04861
Jain A, Jain L, Sharma I, Chauhan A (2018) Image processing based speaking system for mute people using hand gesture. Int J Eng Sci Res Technol (IJESRT) 368–374
Jain R, Karsh RK, Barbhuiya AA (2021) Encoded motion image-based dynamic hand gesture recognition. The visual computer, 1–18. https://doi.org/10.1016/j.cag.2021.04.017
Jia Y, Shelhamer E, Donahue J, Karayev S, Long J, Girshick RB, Guadarrama S, Darrell T (2014) Caffe: convolutional architecture for fast feature embedding. Proceedings of the 22nd ACM international conference on multimedia. https://doi.org/10.1145/2647868.2654889
Kepuska V, Bohouta G (2018) Next-generation of virtual personal assistants (microsoft cortana, apple siri, amazon alexa and google home). In 2018 IEEE 8th annual computing and communication workshop and conference (CCWC) (pp 99–103). IEEE. https://doi.org/10.1109/CCWC.2018.8301638
Khan T, Pathan AH (2015) Hand gesture recognition based on digital image processing using MATLAB. Int J Sci Eng Res 6(9):338–346
Khari M, Garg AK, Crespo RG, Verdú E (2019) Gesture recognition of RGB and RGB-D static images using convolutional neural networks. Int J Interact Multim Artif Intell 5(7):22–27. https://doi.org/10.9781/ijimai.2019.09.002
Krizhevsky A, Sutskever I, Hinton GE (2012) ImageNet classification with deep convolutional neural networks. Commun ACM 60:84–90. https://doi.org/10.1145/3065386
Li Y, Ma D, Yu Y, Wei G, Zhou Y (2021) Compact joints encoding for skeleton-based dynamic hand gesture recognition. Comput Graph 97:191–199
Lingyun G, Lin Z, Zhaokui W (2020) Hierarchical attention-based astronaut gesture recognition: a dataset and CNN model. IEEE Access 8:68787–68798. https://doi.org/10.1109/ACCESS.2020.2986473
Nikam AS, Ambekar AG (2016) Sign language recognition using image based hand gesture recognition techniques. In 2016 online international conference on green engineering and technologies (IC-GET) (pp 1–5). IEEE. https://doi.org/10.1109/GET.2016.7916786
Oudah M, Al-Naji A, Chahl J (2020) Hand gesture recognition based on computer vision: a review of techniques. J Imaging 6(8):73. https://doi.org/10.3390/jimaging6080073
Plouffe G, Cretu AM (2015) Static and dynamic hand gesture recognition in depth data using dynamic time warping. IEEE Trans Instrum Meas 65(2):305–316. https://doi.org/10.1109/TIM.2015.2498560
Poornima N, Murugan M (2019) Improved gesture precision virtual personal assistant (IGP-VPA) system for speech impaired people. I-manager's J Pattern Recogn 6(2):17. https://doi.org/10.26634/jpr.6.2.16754
Qian Y, Dong J, Wang W, Tan T (2015) Deep learning for steganalysis via convolutional neural networks. In media watermarking, security, and forensics 2015 (Vol 9409, p 94090J). International Society for Optics and Photonics. https://doi.org/10.1117/12.2083479
Rai P, Alva A, Mahale GK, Shetty JS, Manjushree AN (2018) Gesture recognition system. Int J Comput Sci Mob Comput 7(5):164–175
Reda MM, Mohammed NG, Seoud RAAAA (2018) SVBiComm: Sign-Voice Bidirectional Communication System for Normal, “Deaf/Dumb” and Blind People based on Machine Learning. In 2018 1st International Conference on Computer Applications & Information Security (ICCAIS) (pp 1–8). IEEE
Rougier NP, Hinsen K, Alexandre F, Arildsen T, Barba LA, Benureau FC, … Zito T (2017) Sustainable computational science: the ReScience initiative. PeerJ Comput Sci 3:e142
Saha HN, Tapadar S, Ray S, Chatterjee SK, Saha S (2018) A machine learning based approach for hand gesture recognition using distinctive feature extraction. In 2018 IEEE 8th annual computing and communication workshop and conference (CCWC) (pp 91–98). IEEE. https://doi.org/10.1109/CCWC.2018.8301631
Saxena A, Jain DK, Singhal A (2014) Hand gesture recognition using an android device. In 2014 fourth international conference on communication systems and network technologies (pp 819–822). IEEE. https://doi.org/10.1109/CSNT.2014.170
Shinde SS, Autee R (2016) Real time hand gesture recognition and voice conversion system for deaf and dumb person based on image processing. JournalNX 2(9):39–43
Simonyan K, Zisserman A (2015) Very deep convolutional networks for large-scale image recognition. CoRR, abs/1409.1556
Zhang W, Wang J, Lan F (2020) Dynamic hand gesture recognition based on short-term sampling neural networks. IEEE/CAA J Autom Sin 8(1):110–120. https://doi.org/10.1109/JAS.2020.1003465
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Stellin, R., Rukmani, P., Anbarasi, L.J. et al. HandGCNN model for gesture recognition based voice assistance. Multimed Tools Appl 81, 42353–42369 (2022). https://doi.org/10.1007/s11042-022-13497-5
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-022-13497-5