Max-Pooling Convolutional Neural Network for Chinese Digital Gesture Recognition

Qian, Zhao; Yawei, Li; Mengyu, Zhu; Yuliang, Yang; Ling, Xiao; Chunyu, Xu; Lin, Li

doi:10.1007/978-3-319-38771-0_8

Zhao Qian^17,20,
Li Yawei¹⁸,
Zhu Mengyu¹⁹,
Yang Yuliang¹⁸,
Xiao Ling²⁰,
Xu Chunyu²⁰ &
…
Li Lin²¹

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 455))

1317 Accesses
1 Citations

Abstract

A pattern recognition approach is proposed for the Chinese digital gesture. We shot a group of digital gesture videos by a monocular camera. Then, the video was converted into frame format and turned into the gray image. We selected the gray image as our own dataset. The dataset was divided into six gesture classes and other meaningless gestures. We use the neural network (NN) combining convolution and Max-Pooling (MPCNN) for classification of digital gestures. The MPCNN presents some differences on the data preprocessing, the activation function and the network structure. The accuracy and the robustness have been verified by the simulation experiments with the dataset. The result shows that the MPCNN classifies six gesture classes with 99.98 % accuracy using the Max-Pooling, the Relu activation function, and the binarization processing.

Zhu Mengyu—This work is supported by the national High Technology Research and Development Program of China (2015AA042300).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Bretzner L, Laptev I, Lindeberg T (2002) Hand Gesture Recognition using Multi-Scale Colour Features, Hierarchical Models and Particle Filtering. In: Proceedings of the IEEE confer-ence on automatic face and gesture recognition (FGR2002), pp 423–428
Google Scholar
Neverova N, Wolf C, Paci G, Sommavilla, G, Taylor, GW, Nebout, F (2013) A Multi-scale approach to gesture detection and recognition. In: Proceedings of the IEEE international conference on computer vision workshops (ICCVW), pp 484–491
Google Scholar
Manresa C, Varona J, Mas R, Perales F (2005) Hand tracking and gesture recognition for human-computer interaction. Electron Lett Comput Vis Image Anal 5(3):96–104
Google Scholar
LeCun Y, Jie Huang F, Bottou L (2004) Learning methods for generic object recognition with invariance to pose and lighting Computer Vision and Pattern Recognition”. In: Proceedings of the IEEE computer society conference, vol 2, No 2, pp 97–104
Google Scholar
Wang RY, Popovic J (2009) Real-time hand-tracking with a color glove. ACM Trans Graph 28(3):63
Google Scholar
Liu N, Lovell BC, Kootsookos PJ (2004) Model structure selection & training algorithms for an HMM gesture recognition system, Ninth international workshop on frontiers in handwriting recognition, IEEE conference publications, IWFHR-9. pp 100–105
Google Scholar
Bluche T, Ney H, Kermorvant C (2013) Tandem HMM with convolutional neural network for handwritten word recognition, In: Proceedings of the IEEE international conference on acoustics, speech and signal processing (ICASSP), pp 2390–2394
Google Scholar
Nagi J, Ducatelle F, Di Caro GA, Ciresan D, Meier U, Giusti A, Nagi F, Schmidhuber J, Gambardella LM (2011) Max-pooling convolutional neural networks for vision-based hand gesture recognition. In: Proceedings of the IEEE international conference on signal and image processing applications (ICSIPA), pp 342–347
Google Scholar
Duffner S, Berlemont S, Lefebvre G, Garcia C (2014) 3D gesture classification with convolutional neural networks. In: Proceedings of the IEEE international conference on acoustics, speech and signal processing (ICASSP), pp 5432–5436
Google Scholar
Yamashita T, Watasue T (2014) Hand posture recognition based on bottom-up structured deep convolutional neural network with curriculum learning. In: Proceedings of the IEEE international conference on image processing (ICIP), pp 853–857
Google Scholar
Kim, H-J, Lee JS, Park J-H (2010) Dynamic hand gesture recognition using a CNN model with 3D receptive fields. In: Proceedings of the international conference on neural networks and signal processing, 2008, pp 14–19
Google Scholar
Simonyan K, Zisserman A (2014) Very Deep Convolutional Networks for Large-Scale Image Recognition, arXiv preprint arXiv, pp 1409–1556
Google Scholar
LeCun Y, Bottou L, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. In: Proceedings of the IEEE, vol 86, No 11, pp 2278–2324
Google Scholar
Lauer F, Suen CY, Bloch G (2007) A trainable feature extractor for handwritten digit recognition, Pattern Recognit (S0031-3203), 40(6):1816–1824
Google Scholar
Francke H, Ruiz-del-Solar J, Verschae R (2007) Real time hand gesture detection and recognition using boosted classifiers and active learning, In: Proceedings of the 2nd Pacific Rim conference on advances in image and video technology (PSIVT07), pp 533–547
Google Scholar
Nagi J, Giusti A, Nagi F, Gambardella LM, Di Caro GA (2014) Online feature extraction for the incremental learning of gestures in human-swarm interaction. In: Proceedings of the IEEE international conference on robotics and automation (ICRA), pp 3331–3338
Google Scholar
LeCun Y, Kavukcuoglu K, Farabet C (2010) Convolutional networks and appli-cations in vision. In: Proceedings of the IEEE international symposium on circuits and systems, pp 253–226
Google Scholar

Download references

Author information

Authors and Affiliations

College of Computer Science and Technology, Jilin University, Changchun, China
Zhao Qian
School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing, China
Li Yawei & Yang Yuliang
School of Life Science and Technology, Beijing Institute of Technology, Beijing, China
Zhu Mengyu
Luoyang Hydraulic Engineering Technical Institute, Luoyang, China
Zhao Qian, Xiao Ling & Xu Chunyu
Beijing Canbao Architectural Design and Research Institute, Beijing, China
Li Lin

Authors

Zhao Qian
View author publications
You can also search for this author in PubMed Google Scholar
Li Yawei
View author publications
You can also search for this author in PubMed Google Scholar
Zhu Mengyu
View author publications
You can also search for this author in PubMed Google Scholar
Yang Yuliang
View author publications
You can also search for this author in PubMed Google Scholar
Xiao Ling
View author publications
You can also search for this author in PubMed Google Scholar
Xu Chunyu
View author publications
You can also search for this author in PubMed Google Scholar
Li Lin
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zhu Mengyu .

Editor information

Editors and Affiliations

Department of Automation and Applied Informatics, Faculty of Engineering, Aurel Vlaicu University of Arad, Arad, Romania
Valentina Emilia Balas
University of South Australia, Bournemouth University, Poole, UK, and University of South Australia, Adelaide, Australia
Lakhmi C. Jain
School of Information Engineering, Chang'an University, Xi'an, China
Xiangmo Zhao

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Qian, Z. et al. (2017). Max-Pooling Convolutional Neural Network for Chinese Digital Gesture Recognition. In: Balas, V., Jain, L., Zhao, X. (eds) Information Technology and Intelligent Transportation Systems. Advances in Intelligent Systems and Computing, vol 455. Springer, Cham. https://doi.org/10.1007/978-3-319-38771-0_8

Download citation

DOI: https://doi.org/10.1007/978-3-319-38771-0_8
Published: 06 November 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-38769-7
Online ISBN: 978-3-319-38771-0
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics