skip to main content
10.1145/3605423.3605430acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicctaConference Proceedingsconference-collections
research-article

Optimization of Deep CNN-based Bangla Sign Language Recognition using XGBoost classifier

Published:20 August 2023Publication History

ABSTRACT

Deafness is one of the major health problems in Bangladesh, where almost 10% of the population falls under this category. The plight of deaf individuals to fit in with the mainstream population is significant. Unable to express themselves through sound, they have to rely on sign language to communicate.To create a more inclusive environment for this minority, we developed a model that recognizes Bengali characters and digits expressed using sign language in accordance with Bangla Sign Language (BdSL) guidelines. The model comprises two parts: a convolutional neural network that uses deep learning techniques, and an Extreme Gradient Boosting (XGBoost) classifier. Together, these components form the full architecture. The model’s execution was validated using the ‘Ishara-Lipi’ dataset, which is the first open-access digit and character dataset for BdSL. With the help of pre-processing techniques such as contrast-limited adaptive histogram equalization, using a more complex pre-trained CNN model like Inception-ResNet-v2, and optimizing the XGBoost model by using GridSearchCV, we achieved an accuracy of 86.67%, precision of 89%, recall and f1 score of 87%. Lastly, for digit recognition, we obtained an accuracy of 97.33%, precision of 98%, and recall and f1-score of 97%.

References

  1. Shahjalal Ahmed, Md Islam, Jahid Hassan, Minhaz Uddin Ahmed, Bilkis Jamal Ferdosi, Sanjay Saha, Md Shopon, 2019. Hand sign to Bangla speech: a deep learning in vision based system for recognizing hand sign digits and generating Bangla speech. arXiv preprint arXiv:1901.05613 (2019).Google ScholarGoogle Scholar
  2. Sunanda Das, Md Samir Imtiaz, Nieb Hasan Neom, Nazmul Siddique, and Hui Wang. 2023. A hybrid approach for Bangla sign language recognition using deep transfer learning model with random forest classifier. Expert Systems with Applications 213 (2023), 118914.Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Jerome H Friedman. 2001. Greedy function approximation: a gradient boosting machine. Annals of statistics (2001), 1189–1232.Google ScholarGoogle Scholar
  4. Promila Haque, Badhon Das, and Nazmun Nahar Kaspy. 2019. Two-handed bangla sign language recognition using principal component analysis (PCA) and KNN algorithm. In 2019 International Conference on Electrical, Computer and Communication Engineering (ECCE). IEEE, 1–4.Google ScholarGoogle ScholarCross RefCross Ref
  5. Muttaki Hasan, Tanvir Hossain Sajib, and Mrinmoy Dey. 2016. A machine learning based approach for the detection and recognition of Bangla sign language. In 2016 International Conference on Medical Engineering, Health Informatics and Technology (MediTec). IEEE, 1–5.Google ScholarGoogle ScholarCross RefCross Ref
  6. Md Mehedi Hasan, Azmain Yakin Srizon, and Md Al Mehedi Hasan. 2020. Classification of Bengali sign language characters by applying a novel deep convolutional neural network. In 2020 IEEE Region 10 Symposium (TENSYMP). IEEE, 1303–1306.Google ScholarGoogle ScholarCross RefCross Ref
  7. Md Shafiqul Islalm, Md Moklesur Rahman, Md Hafizur Rahman, Md Arifuzzaman, Roberto Sassi, and Md Aktaruzzaman. 2019. Recognition bangla sign language using convolutional neural network. In 2019 international conference on innovation and intelligence for informatics, computing, and technologies (3ICT). IEEE, 1–6.Google ScholarGoogle ScholarCross RefCross Ref
  8. Md Sanzidul Islam, Sadia Sultana Sharmin Mousumi, Nazmul A Jessan, AKM Shahariar Azad Rabby, and Sayed Akhter Hossain. 2018. Ishara-lipi: The first complete multipurposeopen access dataset of isolated characters for bangla sign language. In 2018 International Conference on Bangla Speech and Language Processing (ICBSLP). IEEE, 1–4.Google ScholarGoogle ScholarCross RefCross Ref
  9. WW Kong and Surendra Ranganath. 2014. Towards subject independent continuous sign language recognition: A segment and merge approach. Pattern Recognition 47, 3 (2014), 1294–1308.Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Wei-Meng Lee. 2021. Tuning the Hyperparameters of Your Machine Learning Model Using Grid Search CV. https://towardsdatascience.com/tuning-the-hyperparameters-of-your-machine-learning-model-using-gridsearchcv-7fc2bb76ff27.Google ScholarGoogle Scholar
  11. Jiaqing Liu, Kotaro Furusawa, Tomoko Tateyama, Yutaro Iwamoto, and Yen-wei Chen. 2019. An Improved Kinect-Based Real-Time Gesture Recognition Using Deep Convolutional Neural Networks for Touchless Visualization of Hepatic Anatomical Mode. Journal of Image and Graphics 7, 2 (2019), 45–49.Google ScholarGoogle ScholarCross RefCross Ref
  12. Mygel Andrei M Martija, Jakov Ivan S Dumbrique, and Prospero C Naval Jr. 2020. Underwater gesture recognition using classical computer vision and deep learning techniques. (2020).Google ScholarGoogle Scholar
  13. Md Humaion Kabir Mehedi, Ehteshamul Haque, Sameen Yasir Radin, Md Abrar Ur Rahman, Md Tanzim Reza, and Md. Golam Robiul Alam. 2022. Kidney Tumor Segmentation and Classification using Deep Neural Network on CT Images. In 2022 International Conference on Digital Image Computing: Techniques and Applications (DICTA). 1–7. https://doi.org/10.1109/DICTA56598.2022.10034638Google ScholarGoogle ScholarCross RefCross Ref
  14. Trong-Nguyen Nguyen, Huu-Hung Huynh, and Jean Meunier. 2013. Static hand gesture recognition using artificial neural network. Journal of Image and Graphics 1, 1 (2013), 34–38.Google ScholarGoogle ScholarCross RefCross Ref
  15. Waseem Rawat and Zenghui Wang. 2017. Deep convolutional neural networks for image classification: A comprehensive review. Neural computation 29, 9 (2017), 2352–2449.Google ScholarGoogle Scholar
  16. Christian Szegedy, Sergey Ioffe, Vincent Vanhoucke, and Alexander A Alemi. 2017. Inception-v4, inception-resnet and the impact of residual connections on learning. In Thirty-first AAAI conference on artificial intelligence.Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Jia Uddin, Fahmid Nasif Arko, Nujhat Tabassum, Taposhi Rabeya Trisha, and Fariha Ahmed. 2017. Bangla sign language interpretation using bag of features and Support Vector Machine. In 2017 3rd International Conference on Electrical Information and Communication Technology (EICT). IEEE, 1–4.Google ScholarGoogle ScholarCross RefCross Ref
  18. Jingzhong Wang, Xiaoqing Xu, and Meng Li. 2015. The study of gesture recognition based on SVM with LBP and PCA. Journal of Image and Graphics 3, 1 (2015), 16–19.Google ScholarGoogle ScholarCross RefCross Ref
  19. Haoyang Xu. 2020. The Problem with’Accurate’History: Complexity within Sallust’s Bellum Catilinae. Int’l J. Soc. Sci. Stud. 8 (2020), 81.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Optimization of Deep CNN-based Bangla Sign Language Recognition using XGBoost classifier

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Other conferences
        ICCTA '23: Proceedings of the 2023 9th International Conference on Computer Technology Applications
        May 2023
        270 pages
        ISBN:9781450399579
        DOI:10.1145/3605423

        Copyright © 2023 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 20 August 2023

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article
        • Research
        • Refereed limited
      • Article Metrics

        • Downloads (Last 12 months)15
        • Downloads (Last 6 weeks)0

        Other Metrics

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      HTML Format

      View this article in HTML Format .

      View HTML Format