Abstract
This paper is based on the Chinese sign language video library, and discusses the algorithm design of video classification based on handshape recognition of key frames in video. Video classification in sign language video library is an important part of sign language arrangement and is also the premise of video feature retrieval. At present, sign language video’s handshape classification work is done manually. The accuracy and correctness of the results are quite erroneous and erroneous. In this paper, from the angle of computer image analysis, the definition and extraction of key frames are carried out, and then the region of interest is identified. Finally, an improved SURF algorithm is used to match the area of interest and the existing hand image, and the classification of the video is completed. The entire process is based on the actual development environment, and it can be used for reference based on the classification of video image features.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Johnson, A.Y., Bobick, A.F.: A multi-view method for gait recognition using static body parameters. In: Bigun, J., Smeraldi, F. (eds.) AVBPA 2001. LNCS, vol. 2091, pp. 301–311. Springer, Heidelberg (2001). https://doi.org/10.1007/3-540-45344-X_44
Jiang, C., Jiang, X., Sun, T.: Video filtration for content security based on multimodal features. Inf. Secur. Communi. Privacy 3, 76–77 (2012). (in Chinese)
Feng, W., Gao, J., Bill, P.B., et al.: Wireless capsule endoscopy video classification using an unsupervised learning approach. J. Image Graph. 16(11), 2041–2046 (2011). (in Chinese)
Fischer, S., Lienhart, R., Effelsberg, W.: Automatic recognition of film genres. In: Proceedings of the 3rd ACM International Conference on Multimedia, pp. 295–304. ACM Press, New York (1995)
Huang, C.N., Fu, T.J., Chen, H.C.: Text-based video content classification for online video-sharing sites. J. Am. Soc. Inf. Sci. Technol. 61(5), 891–906 (2010)
Jiang, X.H., Sun, T.F., Wang, S.L.: An automatic video content classification scheme based on combined visual features model with modified DAGSVM. Multimedia Tools Appl. 52(1), 105–120 (2011)
Subashini, K., Palanivel, S., Ramalingam, V.: Audio-video based classification using SVM. IUP J. Sci. Technol. 7(1), 44–53 (2011)
Lowe, D.G.: Distinctive image features from scaleinvariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)
Bay, H., Tuytelaars, T., Van Gool, L.: SURF: speeded up robust features. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3951, pp. 404–417. Springer, Heidelberg (2006). https://doi.org/10.1007/11744023_32
Zhen, E., Lin, J.: Unordered image key frame extraction based on image quality constraint. Comput. Eng. 43(11), 210–215 (2017)
Wang, Y., Sun, S., Ding, X.: A self-adaptive weighted affinity propagation clustering for key frames extraction on human action recognition. J. Vis. Commun. Image Representation 33(3), 193–202 (2015)
Yi, R., Tomasi, C., Guibas, L.J.: Mover’s distance as a metric for image retrieval. Int. J. Comput. Vis. 40(2), 99–121 (2000)
Canny, J.: A computational approach to edge detection. IEEE Trans. Pattern Anal. Image Understand. 18(6), 679–698 (1986)
Wang, X., Liu, X., Guan, Y.: Image edge detection algorithm based on improved Canny operator. Comput. Eng. 34(14), 196–198 (2012)
Wu, L., Yadong, C.Y.: Gesture recognition based on geometric features. Comput. Eng. Des. 35(2), 636–640 (2014). (in Chinese)
Binjue, Zhang, Liaoyin, Zhao, Yixuan, Wang: Fingeritip detection and gesture recognition based on kinect depth data. IEEE Trans. Comput. Sci. Technol. 3(1), 9–14 (2014)
Mitra, S., Acharya, T.: Gesture recognition: a servey. IEEE Trans. Syst. Man Cybern. Part C Appl. Rev. 37(3), 311–324 (2007)
Acknowledgement
This work was supported by Surface Project of Natural Science Research in Colleges and Universities of Jiangsu China (No.16KJB520029), The Major Programs of Natural Science Foundation of the Jiangsu Higher Education Institutions of China (No. 19KJA310002.) and The Natural Science Foundation of the Jiangsu Higher Education Institutions of China (No. 17KJD520006).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering
About this paper
Cite this paper
Zhu, Z., Jiang, X., Zhang, J. (2020). Sign Language Video Classification Based on Image Recognition of Specified Key Frames. In: Zhang, YD., Wang, SH., Liu, S. (eds) Multimedia Technology and Enhanced Learning. ICMTEL 2020. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 327. Springer, Cham. https://doi.org/10.1007/978-3-030-51103-6_33
Download citation
DOI: https://doi.org/10.1007/978-3-030-51103-6_33
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-51102-9
Online ISBN: 978-3-030-51103-6
eBook Packages: Computer ScienceComputer Science (R0)