research-article

Spatio-Temporal dependency preserving Cognitive-assisted Continuous Chinese Sign Language Recognition

Authors:
Priyanka Ganesan

Mepco Schlenk Engineering College, India

Mepco Schlenk Engineering College, India

0000-0001-6961-2994
View Profile

,
Senthil Jagatheesaperumal

Mepco Schlenk Engineering College, India

Mepco Schlenk Engineering College, India

0000-0002-9516-0327
View Profile

,
Silvia Gaftandzhieva

University of Plovdiv Paisii Hilendarski, Bulgaria

University of Plovdiv Paisii Hilendarski, Bulgaria

0000-0002-0569-9776
View Profile

,
Rositsa Doneva

University of Plovdiv Paisii Hilendarski, Bulgaria

University of Plovdiv Paisii Hilendarski, Bulgaria

0000-0003-0296-1297
View Profile

CompSysTech '23: Proceedings of the 24th International Conference on Computer Systems and TechnologiesJune 2023Pages 31–37https://doi.org/10.1145/3606305.3606307

Published:12 September 2023Publication History

CompSysTech '23: Proceedings of the 24th International Conference on Computer Systems and Technologies

Pages 31–37

ABSTRACT

Sign Language recognition system plays a vital role for the hearing and visually impaired people to make normal communication with the other common people. But deaf and dumb people signs are not understandable to the common person, leading to a communication barrier. Also, there were only 50 certified sign language interpreters in India for a deaf population of around 7 million. To come this communication barrier, an intelligent translator system for isolated and continuous sign language recognition is proposed. In our proposed work, the continuous sign language recognition is dealt as a multi-class classification problem. A Video-based custom Long-Short Term Memory (VidcuLSTM) model was designed and configured with different hyperparameter tuning and optimizers to avoid memorization and overfitting of the translator system. We evaluated the proposed system using Chinese Isolated and Continuous SLR dataset and it is evident from the results that the proposed system outperforms existing state-of-art systems with improved performance in recognition rate by around 5% with reduced WER of around 2.67.

References

L. Boppana, R. Ahamed, H. Rane and R. K. Kodali, "Assistive Sign Language Converter for Deaf and Dumb," 2019 International Conference on Internet of Things (iThings) and IEEE Green Computing and Communications (GreenCom) and IEEE Cyber, Physical and Social Computing (CPSCom) and IEEE Smart Data (SmartData), 2019, pp. 302-307, doi: 10.1109/iThings/GreenCom/CPSCom/SmartData.2019.00071.Google ScholarCross Ref
Anshul Mittal, Pradeep Kumar, Partha Roy, Raman Balasubramanian, BidyutB. Chaudhari, “A Modified LSTM model for Continuous Sign Language Recognition Using Leap Motion”, IEEE Sensors Journal 19(16):7056-7063, Aug 2019.Google ScholarCross Ref
Gaolin Fang, Wen Gao, Debin Zhao, “A Real-Time Large Vocabulary Recognition System for Chinese Sign Language”, IEEE Conference on Advances in Multimedia Information Processing, Oct 2016.Google Scholar
Guilin Yao, Hongxun Yao, Xin Liu, Feng Jiang, “Real Time Large Vocabulary Continuous Sign Language Recognition based on OP/Viterbi Algorithm”, IEEE Conference on Pattern Recognition,2006.Google Scholar
Tamer Shanableh, Khaled Assaleh, M.Al-Rousan, “Spatio-Temporal Feature-Extraction Techniques for Isolated Gesture Recognition in Arabic Sign Language”, IEEE Transactions On Cybernetics 37(3):641-50, Jul 2007.Google ScholarDigital Library
Qinkun Xiao, Minying Qin, Peng Gao, Yidan Zhao, “Multimodal Fusion based on LSTM and a Couple Conditional Hidden Markov Model for Chinese Sign Language Recognition, IEEE Access PP(99):1-1,June 2019.Google ScholarCross Ref
Gaolin Fang, Wen Gao, Debin Zhao, “Large-Vocabulary Continuous Sign Language Recognition based on Transition-Movement Models”, IEEE Transactions on Systems, Man, And Cybernetics, Vol. 37, No. 1, January 2007.Google Scholar
Gaolin Fang, Wen Gao, Debin Zhao, “Large Vocabulary Sign Language Recognition based on Hierarchical Decision Trees”, IEEE Transactions on Systems Man and Cybernetics, June 2004.Google Scholar
Cao Dong, Ming C. Leu, Zhaozheng Yin, “American Sign Language Alphabet Recognition Using Microsoft Kinect”, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, pp. 44-52, 2015.Google Scholar
Assaleh, K., Al-Rousan, M. Recognition of Arabic Sign Language Alphabet Using Polynomial Classifiers. EURASIP J. Adv. Signal Process. 2005, 507614 (2005). https://doi.org/10.1155/ASP.2005.2136Google ScholarDigital Library
W. Aly, S. Aly and S. Almotairi, "User-Independent American Sign Language Alphabet Recognition Based on Depth Image and PCANet Features," in IEEE Access, vol. 7, pp. 123138-123150, 2019, doi: 10.1109/ACCESS.2019.2938829.Google ScholarCross Ref
Prajwal Paudyal, Junghyo Lee, Ayan Banerjee, Sandeep K. S. Gupta, “A Comparison of Techniques for Sign Language Alphabet Recognition Using Armband Wearables”, ACM Transactions on Interactive Intelligent Systems, Volume 9,Issue 2-3, September 2019.Google ScholarDigital Library
A. M. Rafi, N. Nawal, N. S. N. Bayev, L. Nima, C. Shahnaz and S. A. Fattah, "Image-based Bengali Sign Language Alphabet Recognition for Deaf and Dumb Community," 2019 IEEE Global Humanitarian Technology Conference (GHTC), 2019, pp. 1-7, doi: 10.1109/GHTC46095.2019.9033031.Google ScholarCross Ref
M. Mohandes, M. Deriche and J. Liu, "Image-Based and Sensor-Based Approaches to Arabic Sign Language Recognition," in IEEE Transactions on Human-Machine Systems, vol. 44, no. 4, pp. 551-557, Aug. 2014, doi: 10.1109/THMS.2014.2318280.Google ScholarCross Ref
Masood, S., Srivastava, A., Thuwal, H.C., Ahmad, M. (2018). Real-Time Sign Language Gesture (Word) Recognition from Video Sequences Using CNN and RNN. In: Bhateja, V., Coello Coello, C., Satapathy, S., Pattnaik, P. (eds) Intelligent Engineering Informatics. Advances in Intelligent Systems and Computing, vol 695. Springer, Singapore. https://doi.org/10.1007/978-981-10-7566-7_63.Google ScholarCross Ref
Lean Karlo S. Tolentino, Ronnie O. Serfa Juan, August C. Thio-ac, Maria Abigail B. Pamahoy, Joni Rose R. Forteza, and Xavier Jet O. Garcia, “Static Sign Language Recognition Using Deep Learning”, International Journal of Machine Learning and Computing, Vol. 9, No. 6, December 2019.Google ScholarCross Ref
Runpeng Cui, Hu Liu, Changshui Zhang, “Recurrent Convolutional Neural Networks for Continuous Sign Language Recognition by Staged Optimization”, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 7361-7369, 2017.Google ScholarCross Ref
Junfu Pu, Wengang Zhou, Houqiang Li, “Iterative Alignment Network for Continuous Sign Language Recognition”, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 4165-4174, 2019.Google ScholarCross Ref
Aloysius, N., Geetha, M. Understanding vision-based continuous sign language recognition. Multimed Tools Appl 79, 22177–22209 (2020). https://doi.org/10.1007/s11042-020-08961-z.Google ScholarCross Ref
O Koller, O Zargaran, H Ney and R Bowden, “Deep Sign: Hybrid CNN-HMM for Continuous Sign Language Recognition”, Proceedings of the British Machine Vision Conference 2016, The British Machine Vision Conference (BMVC) 2016.Google ScholarCross Ref
Fang, G., Gao, W., Chen, X., Wang, C., Ma, J. (2002). Signer-Independent Continuous Sign Language Recognition Based on SRN/HMM. In: Wachsmuth, I., Sowa, T. (eds) Gesture and Sign Language in Human-Computer Interaction. GW 2001. Lecture Notes in Computer Science, vol 2298. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-47873-6_8.Google ScholarCross Ref
Wenwen Yang, Jinxu Tao, Zhongfu Ye, “Continuous sign language recognition using level building based on fast hidden Markov model”, Pattern Recognition Letters, Volume 78, Pages 28-35, 2016.Google ScholarDigital Library
R. Cui, H. Liu and C. Zhang, "A Deep Neural Framework for Continuous Sign Language Recognition by Iterative Training," in IEEE Transactions on Multimedia, vol. 21, no. 7, pp. 1880-1891, July 2019, doi: 10.1109/TMM.2018.2889563.Google ScholarCross Ref
Cheng, K.L., Yang, Z., Chen, Q., Tai, YW. (2020). Fully Convolutional Networks for Continuous Sign Language Recognition. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, JM. (eds) Computer Vision – ECCV 2020. ECCV 2020. Lecture Notes in Computer Science, vol 12369. Springer, Cham. https://doi.org/10.1007/978-3-030-58586-0_41.Google ScholarDigital Library
A. Mittal, P. Kumar, P. P. Roy, R. Balasubramanian and B. B. Chaudhuri, "A Modified LSTM Model for Continuous Sign Language Recognition Using Leap Motion," in IEEE Sensors Journal, vol. 19, no. 16, pp. 7056-7063, 15 Aug.15, 2019, doi: 10.1109/JSEN.2019.2909837.Google ScholarCross Ref
Camgoz, N.C., Hadfield, S., Koller, O., Bowden, R., “Subunets: end-to-end hand shape and continuous sign language recognition”, Proceedings of IEEE International Conference on Computer Vision, pp. 3075–3084 (2017).Google Scholar
Guo, D., Zhou, W., Li, H., Wang, M., “Hierarchical LSTM for sign language translation”, Proceedings of AAAI Conference on Artificial Intelligence, pp. 6845–6852 (2018).Google Scholar
Huang, J., Zhou, W., Zhang, Q., Li, H., Li, W.: Video-based sign language recognition without temporal segmentation. In: Proceedings of AAAI Conference on Artificial Intelligence, pp. 2257–2264 (2018).Google Scholar
Yang, Z., Shi, Z., Shen, X., Tai, Y.W., “SF-net: structured feature network for continuous sign language recognition”, arXiv preprint arXiv:1908.01341 (2019).Google Scholar
Pu, J., Zhou, W., Li, H., “Iterative alignment network for continuous sign language recognition”, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp. 4165–4174 (2019).Google Scholar
T. Prathiba, R. Shantha Selva Kumari, “Content based video retrieval system based on multimodal featuregrouping by KFCM clustering algorithm to promote human–computerinteraction”, Journal of Ambient Intelligence and Humanized Computing, Volume 12, PP. 6215–6229, 2021.Google ScholarCross Ref
G. Priyanka, M. Prasha Meena, “Survey and Evaluation on Video Summarization Techniques”, Journal of Critical Reviews, Vol 7, Issue 8, 2020.Google Scholar
S.Arivazhagan, R.NewlinShebiah, V.Sridevi, “Development of video analytic algorithm for anomalydetection in individual and crowd behavior”, International Journal of Applied Engineering Research, ISSN 0973-4562 Vol. 10 No.1, 2015.Google Scholar
Newlin Shebiah Russel, Arivazhagan Selvaraj,“Fusion of spatial and dynamic CNN streams for action recognition”, Multimedia Systems, Springer, 2021.https://doi.org/10.1145/1177352.1177355Google ScholarDigital Library
Kasabov, N., NeuCube EvoSpike Architecture for Spatio-Temporal Modelling and Pattern Recognition of Brain Signals, in: Mana, Schwenker and Trentin (Eds) ANNPR, Springer LNAI 7477, 2012, 225-243, https://doi.org/10.1007/978-3-642-33212-8_21.Google ScholarDigital Library
Tu, E., Kasabov, N., & Yang, J. (2017). Mapping temporal variables into the NeuCube for improved pattern recognition, predictive modelling, and understanding of stream data. IEEE transactions on neural networks and learning systems, 28(6), 1305-1317.Google Scholar
Kasabov, N., Time-Space, Spiking Neural Networks and Brain-Inspired Artificial Intelligence, Springer Nature (2019) 750p., https://www.springer.com/gp/book/9783662577134Google Scholar

Index Terms

Spatio-Temporal dependency preserving Cognitive-assisted Continuous Chinese Sign Language Recognition
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
    2. Natural language processing
  2. Machine learning
    1. Learning paradigms
      1. Supervised learning

Index terms have been assigned to the content through auto-classification.

Recommendations

A teaching system of Japanese sign language using sign language recognition and generation
MULTIMEDIA '02: Proceedings of the tenth ACM international conference on Multimedia

In recent years, the number of sign language learners is increasing in Japan. And there are many teaching materials of sign language such as textbooks, videotapes and software for PCs. However, these teaching materials have several problems that ...
Read More
Chinese Sign Language Recognition with Batch Sampling ResNet-Bi-LSTM
Abstract
Sign language has served as a communication medium between the Deaf community and society. Nonetheless, the practice of sign language is not common in Chinese society, along with a lack of professional sign language interpreters. Most existing ...
Read More
Mobile sign language translation system for deaf community
W4A '12: Proceedings of the International Cross-Disciplinary Conference on Web Accessibility

Nowadays, web technologies are a very efficient way to ensure communication between a large and heterogeneous audience. Furthermore, web information is mainly based on textual and multimedia content and consequently, some people with special needs, such ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
CompSysTech '23: Proceedings of the 24th International Conference on Computer Systems and Technologies
June 2023
201 pages
ISBN:9798400700477
DOI:10.1145/3606305
Editors:
Tzvetomir Vassilev,
Roumen Trifonov
Copyright © 2023 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 12 September 2023
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
CSLR
LSTM
Recognition
WER
sign language
Qualifiers
- research-article
- Research
- Refereed limited
Conference

Acceptance Rates
Overall Acceptance Rate241of492submissions,49%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 23
  Total Downloads
- Downloads (Last 12 months)23
- Downloads (Last 6 weeks)3
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format

Spatio-Temporal dependency preserving Cognitive-assisted Continuous Chinese Sign Language Recognition

CompSysTech '23: Proceedings of the 24th International Conference on Computer Systems and Technologies

ABSTRACT

References

Cited By

Index Terms

Recommendations

A teaching system of Japanese sign language using sign language recognition and generation

Chinese Sign Language Recognition with Batch Sampling ResNet-Bi-LSTM

Mobile sign language translation system for deaf community

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

HTML Format

Caption

Spatio-Temporal dependency preserving Cognitive-assisted Continuous Chinese Sign Language Recognition

CompSysTech '23: Proceedings of the 24th International Conference on Computer Systems and Technologies

ABSTRACT

References

Cited By

Index Terms

Recommendations

A teaching system of Japanese sign language using sign language recognition and generation

Chinese Sign Language Recognition with Batch Sampling ResNet-Bi-LSTM

Mobile sign language translation system for deaf community

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

HTML Format

Share this Publication link

Share on Social Media