Sign Language Video Classification Based on Image Recognition of Specified Key Frames

Zhu, Zhaosong; Jiang, Xianwei; Zhang, Juxiao

doi:10.1007/978-3-030-51103-6_33

Part of the book series: Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering ((LNICST,volume 327))

Included in the following conference series:

International Conference on Multimedia Technology and Enhanced Learning

519 Accesses

Abstract

This paper is based on the Chinese sign language video library, and discusses the algorithm design of video classification based on handshape recognition of key frames in video. Video classification in sign language video library is an important part of sign language arrangement and is also the premise of video feature retrieval. At present, sign language video’s handshape classification work is done manually. The accuracy and correctness of the results are quite erroneous and erroneous. In this paper, from the angle of computer image analysis, the definition and extraction of key frames are carried out, and then the region of interest is identified. Finally, an improved SURF algorithm is used to match the area of interest and the existing hand image, and the classification of the video is completed. The entire process is based on the actual development environment, and it can be used for reference based on the classification of video image features.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 79.99; Price excludes VAT (USA)

Softcover Book: USD 99.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Johnson, A.Y., Bobick, A.F.: A multi-view method for gait recognition using static body parameters. In: Bigun, J., Smeraldi, F. (eds.) AVBPA 2001. LNCS, vol. 2091, pp. 301–311. Springer, Heidelberg (2001). https://doi.org/10.1007/3-540-45344-X_44
Chapter Google Scholar
Jiang, C., Jiang, X., Sun, T.: Video filtration for content security based on multimodal features. Inf. Secur. Communi. Privacy 3, 76–77 (2012). (in Chinese)
Google Scholar
Feng, W., Gao, J., Bill, P.B., et al.: Wireless capsule endoscopy video classification using an unsupervised learning approach. J. Image Graph. 16(11), 2041–2046 (2011). (in Chinese)
Google Scholar
Fischer, S., Lienhart, R., Effelsberg, W.: Automatic recognition of film genres. In: Proceedings of the 3rd ACM International Conference on Multimedia, pp. 295–304. ACM Press, New York (1995)
Google Scholar
Huang, C.N., Fu, T.J., Chen, H.C.: Text-based video content classification for online video-sharing sites. J. Am. Soc. Inf. Sci. Technol. 61(5), 891–906 (2010)
Article Google Scholar
Jiang, X.H., Sun, T.F., Wang, S.L.: An automatic video content classification scheme based on combined visual features model with modified DAGSVM. Multimedia Tools Appl. 52(1), 105–120 (2011)
Article Google Scholar
Subashini, K., Palanivel, S., Ramalingam, V.: Audio-video based classification using SVM. IUP J. Sci. Technol. 7(1), 44–53 (2011)
Google Scholar
Lowe, D.G.: Distinctive image features from scaleinvariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)
Article Google Scholar
Bay, H., Tuytelaars, T., Van Gool, L.: SURF: speeded up robust features. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3951, pp. 404–417. Springer, Heidelberg (2006). https://doi.org/10.1007/11744023_32
Chapter Google Scholar
Zhen, E., Lin, J.: Unordered image key frame extraction based on image quality constraint. Comput. Eng. 43(11), 210–215 (2017)
Google Scholar
Wang, Y., Sun, S., Ding, X.: A self-adaptive weighted affinity propagation clustering for key frames extraction on human action recognition. J. Vis. Commun. Image Representation 33(3), 193–202 (2015)
Article Google Scholar
Yi, R., Tomasi, C., Guibas, L.J.: Mover’s distance as a metric for image retrieval. Int. J. Comput. Vis. 40(2), 99–121 (2000)
Article MATH Google Scholar
Canny, J.: A computational approach to edge detection. IEEE Trans. Pattern Anal. Image Understand. 18(6), 679–698 (1986)
Article Google Scholar
Wang, X., Liu, X., Guan, Y.: Image edge detection algorithm based on improved Canny operator. Comput. Eng. 34(14), 196–198 (2012)
Google Scholar
Wu, L., Yadong, C.Y.: Gesture recognition based on geometric features. Comput. Eng. Des. 35(2), 636–640 (2014). (in Chinese)
Google Scholar
Binjue, Zhang, Liaoyin, Zhao, Yixuan, Wang: Fingeritip detection and gesture recognition based on kinect depth data. IEEE Trans. Comput. Sci. Technol. 3(1), 9–14 (2014)
Google Scholar
Mitra, S., Acharya, T.: Gesture recognition: a servey. IEEE Trans. Syst. Man Cybern. Part C Appl. Rev. 37(3), 311–324 (2007)
Article Google Scholar

Download references

Acknowledgement

This work was supported by Surface Project of Natural Science Research in Colleges and Universities of Jiangsu China (No.16KJB520029), The Major Programs of Natural Science Foundation of the Jiangsu Higher Education Institutions of China (No. 19KJA310002.) and The Natural Science Foundation of the Jiangsu Higher Education Institutions of China (No. 17KJD520006).

Author information

Authors and Affiliations

Nanjing Normal University of Special Education, Nanjing, 210038, China
Zhaosong Zhu, Xianwei Jiang & Juxiao Zhang
Department of Informatics, University of Leicester, Leicester, LE1 7RH, UK
Xianwei Jiang

Authors

Zhaosong Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Xianwei Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Juxiao Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Juxiao Zhang .

Editor information

Editors and Affiliations

School of Informatics, University of Leicester, Leicestershire, UK
Yu-Dong Zhang
University of Leicester, Leicestershire, UK
Shui-Hua Wang
Human Normal University, Changsha, China
Shuai Liu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhu, Z., Jiang, X., Zhang, J. (2020). Sign Language Video Classification Based on Image Recognition of Specified Key Frames. In: Zhang, YD., Wang, SH., Liu, S. (eds) Multimedia Technology and Enhanced Learning. ICMTEL 2020. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 327. Springer, Cham. https://doi.org/10.1007/978-3-030-51103-6_33

Download citation

DOI: https://doi.org/10.1007/978-3-030-51103-6_33
Published: 19 July 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-51102-9
Online ISBN: 978-3-030-51103-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics