Abstract
Sign language recognition system generally consists of three main processes, which are segmentation, modelling, and classification. Image segmentation plays a crucial role as the initial step in sign language recognition. Despite the many sign language recognition system algorithms proposed in the literature and their well-understood usage, their performance analyses are relatively limited. As such, the main motivation of this paper is to critically analyse the feasibility of successful sign language segmentation under variation of dynamic scene parameters such as noise, hand size, and intensity difference between hand and background. The focus is on image thresholding using Otsu technique, since it is the most commonly used in initial process of sign language segmentation. The analysis of this work was developed based on Monte Carlo statistical method, which showed that the success of sign language segmentation depends on hand size, hand background intensity difference, and noise measurement. The result showed that the sign alphabets with handheld shape like A, E, I, M, N, S, and T is easier to segment, while sign alphabets with finger-extend shape like C, D, F, G, H, K, L, P, R, U, V, W, and Y is harder to segment. Experiment using real images demonstrate the capability of the conditions to correctly predict the outcome of sign language segmentation using Otsu technique. In conclusion, the success of sign language segmentation could be predicted beforehand with obtainable scene parameters.














Similar content being viewed by others
References
Ahmed MA, Zaidan BB, Zaidan AA, Salih MM, Lakulu MMB (2018) A review on systems-based sensory gloves for sign language recognition state of the art between 2007 and 2017. Sensors 18(7):2208
Anjna E (2016) Review of image segmentation technique. J Pediatr 175(4):246–247
Badi H (2016) Recent methods in vision-based hand gesture recognition. International Journal of Data Science and Analytics 1(2):77–87
Basah SN, Hoseinnezhad R, Bab-Hadiashar A (2008) Limits of motion-background segmentation using fundamental matrix estimation, in Proceedings - Digital Image Computing: Techniques and Applications, DICTA
Basah SN, Bab-Hadiashar A, Hoseinnezhad R (2009) Conditions for motion-background segmentation using fundamental matrix, IET Comput Vis
Basah SN, Bab-Hadiashar A, Hoseinnezhad R (2009) Conditions for segmentation of 2D translations of 3D objects. In: Lecture notes in computer science (including subseries lecture notes in artificial intelligence and lecture notes in bioinformatics)
Basah SN, Hoseinnezhad R, Bab-Hadiashar A (2014) Analysis of planar-motion segmentation using affine fundamental matrix, IET Comput Vis
Cambridge Dictionary, Definition of ‘sign language’ (2018). [Online]. Available: https://dictionary.cambridge.org/dictionary/english/sign-language. (Accessed: 04-Oct-2018).
Chen Y, Tao J, Liu L, Xiong J, Xia R, Xie J, ... & Yang K. (2020). Research of improving semantic image segmentation based on a feature fusion model. J Ambient Intell Humaniz Comput.
Cheok MJ, Omar Z, Jaward MH (2019) A review of hand gesture and sign language recognition techniques. Int J Mach Learn Cybern 10(1):131–153
Dong C, Leu MC, Yin Z (2015) American sign language alphabet recognition using microsoft kinect, pp. 44–52
Goh TY, Basah SN, Yazid H, Aziz Safar MJ, Ahmad Saad FS (2018) Performance analysis of image thresholding: Otsu technique. Meas J Int Meas Confed 114(June 2017):298–307
Hoseinnezhad R, Bab-Hadiashar A, Suter D (2010) Finite sample bias of robust estimators in segmentation of closely spaced structures: a comparative study. J Math Imaging Vis
Ibrahim NB, Selim MM, Zayed HH (2018) An automatic arabic sign language recognition system (ArSLRS). Journal of King Saud University-Computer and Information Sciences 30(4):470–477
Imagawa K, Lu S, Igi S (1998) Color-based hands tracking system for sign language recognition. Proc. - 3rd IEEE Int. Conf. Autom. Face Gesture Recognition, FG, pp. 462–467, 1998.
Johnston T, Schembri A (2007) Australian sign language (Auslan): an introduction to sign language linguistics
Joshi A, Sierra H, Arzuaga E (2017) American sign language translation using edge detection and cross correlation, 2017 IEEE Colomb Conf Commun Comput COLCOM 2017 - Proc., 2017.
Kakoty NM, Sharma MD (2018) Recognition of sign language alphabets and numbers based on hand kinematics using a data glove. Procedia Comput Sci 133:55–62
Kang SK, Nam MY, Rhee PK (2008) Color based hand and finger detection technology for user interaction. Proc. - 2008 Int. Conf. Converg. Hybrid Inf. Technol. ICHIT 2008, pp. 229–236
Konwar AS, Borah BS, Tuithung CT (2014) An American sign language detection system using HSV color model and edge detection, Int Conf Commun Signal Process ICCSP - Proc., pp. 743–747
Kumar P, Gauba H, Roy PP, Dogra DP (2017) Coupled HMM-based multi-sensor data fusion for sign language recognition. Pattern Recogn Lett 86:1–8
Ogden SK, Fei DL, Schilling NS, Ahmed YF, Hwa J, Robbins DJ (2008) A kind of method for selection of optimum threhold for segmentation of digital color plane image. Nature 456(7224):967–970
Otsu N (1979) A threshold selection method from gray-level histograms. IEEE Trans Syst Man Cybernetics 9(1):62–66
Oxford Dictionary, Definition of sign in English (2018). [Online]. Available: https://en.oxforddictionaries.com/definition/sign. (Accessed: 04-Oct-2018).
Pigou L, Dieleman S, Kindermans PJ, Schrauwen B (2015) Sign language recognition using convolutional neural networks. In: Lecture notes in computer science (including subseries lecture notes in artificial intelligence and lecture notes in bioinformatics)
Pugeault N, Bowden R (2011) Spelling it out: real-time ASL fingerspelling recognition. Proc IEEE Int Conf Comput Vis (November):1114–1119
Quinn M, Olszewska JI (2019) British sign language recognition in the wild based on Multi-Class SVM, 2019 Federated Conference on Computer Science and Information Systems (FedCSIS), Leipzig, Germany, pp. 81–86, https://doi.org/10.15439/2019F274.
Rafael R-CJ, Cinthia M-L, Genaro R-M, Antonio O-CJ, Jaime M-A (2016) Raise awareness in society about deafness: a proposal with learning objects and scenarios. Procedia – Soc Behav Sci 228(June):575–581
Ramirez-Cortes JM, Gomez-Gil P, Sanchez-Perez G, Prieto-Castro C (2009) Shape-based hand recognition approach using the morphological pattern spectrum. J Electron Imaging 18(1):013012
Ravikiran J, Mahesh K, Mahishi S, Dheeraj R, Sudheender S, Pujari NV (2009) Finger detection for sign language recognition. Proc. I(March):0–4
Rivera-Acosta M, Ortega-Cisneros S, Rivera J, Sandoval-Ibarra F (2017) American sign language alphabet recognition using a neuromorphic sensor and an artificial neural network. Sensors (Switzerland) 17(10)
Saini S, Arora K (2014) A study analysis on the different image segmentation techniques. Int J Inf Comput Technol 4(14):1445–1452
Shah P, Pandya K, Shah H, Gandhi J (2019) Survey on vision based hand gesture recognition. International Journal of Computer Sciences and Engineering 7:281–288
Sharma R, Nemani Y, Kumar S, Kane L, Khanna P (2013) Recognition of single handed sign language gestures using contour tracing descriptor. Proc World Congr Eng II, WCE 2013, July 3–5, 2013, London, U.K.:1–5
Shukor AZ, Miskon MF, Jamaluddin MH, Ibrahim FBA, Asyraf MF, Bahar MBB (2015) A new data glove approach for malaysian sign language detection. Procedia Comput Sci 76(Iris):60–67
Som HM, Zain JM, Ghazali AJ (2011) An application of threshold techniques for readability improvement of Jawi. 2(2):60–69
Tao W, Leu MC, Yin Z (2018) American sign language alphabet recognition using convolutional neural networks with multiview augmentation and inference fusion. Eng Appl Artif Intell 76(July):202–213
Thalange A, Dixit SK (2016) COHST and wavelet features based static ASL numbers recognition. Procedia Comput Sci 92:455–460
Tripathi K, Nandi NBGC (2015) Continuous Indian sign language gesture recognition and sentence formation. Procedia Comput Sci 54:523–531
Tubaiz N, Shanableh T, Assaleh K (2015) Glove-based continuous Arabic sign language recognition in user-dependent mode. IEEE Transactions on Human-Machine Systems 45(4):526–533
Vala HJ, Baxi PA (2013) A review on otsu image segmentation algorithm. 2(2):387–389
Yang H-D (2014) Sign language recognition with the Kinect sensor based on conditional random fields. Sensors 15(1):135–147
Yang HD, Sclaroff S, Lee S-W (2009) Sign language spotting with a threshold model based on conditional random fields. IEEE Trans Pattern Anal Mach Intell 31(7):1264–1277
Yang H, Sclaroff S, Member S (July 2009) Sign language spotting with a threshold model based on conditional random fields. IEEE Trans Pattern Anal Mach Intell 31(7):1264–1277
Yang W, Tao J, Ye Z (2016) Continuous sign language recognition using level building based on fast hidden Markov model. Pattern Recogn Lett 78:28–35
Yong Z, Jiazheng Y, Hongzhe L, QIing L (2017) GrabCut image segmentation algorithm based on structure tensor. 43(8)
Zadghorban M, Nahvi M (2018) An algorithm on sign words extraction and recognition of continuous Persian sign language based on motion and shape features of hands. Pattern Anal Applic 21(2):323–335
Zhang Q, Chen F, Liu X (2008, July) Hand gesture detection and segmentation based on difference background image with complex background. In 2008 International Conference On Embedded Software and Systems (pp. 338-343). IEEE.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Tan, Z.Y., Basah, S.N., Yazid, H. et al. Performance analysis of Otsu thresholding for sign language segmentation. Multimed Tools Appl 80, 21499–21520 (2021). https://doi.org/10.1007/s11042-021-10688-4
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-021-10688-4