Results and Analysis of the ChaLearn Gesture Challenge 2012

Guyon, Isabelle; Athitsos, V.; Jangyodsuk, P.; Escalante, H. J.; Hamner, B.

doi:10.1007/978-3-642-40303-3_19

Isabelle Guyon²⁰,
V. Athitsos²¹,
P. Jangyodsuk²¹,
H. J. Escalante²² &
…
B. Hamner²³

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 7854))

Included in the following conference series:

International Workshop on Depth Image Analysis and Applications

1386 Accesses
33 Citations

Abstract

The Kinect^TMcamera has revolutionized the field of computer vision by making available low cost 3D cameras recording both RGB and depth data, using a structured light infrared sensor. We recorded and made available a large database of 50,000 hand and arm gestures. With these data, we organized a challenge emphasizing the problem of learning from very few examples. The data are split into subtasks, each using a small vocabulary of 8 to 12 gestures, related to a particular application domain: hand signals used by divers, finger codes to represent numerals, signals used by referees, Marshalling signals to guide vehicles or aircrafts, etc. We limited the problem to single users for each task and to the recognition of short sequences of gestures punctuated by returning the hands to a resting position. This situation is encountered in computer interface applications, including robotics, education, and gaming. The challenge setting fosters progress in transfer learning by providing for training a large number of subtasks related to, but different from the tasks on which the competitors are tested.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 49.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Multi-layered Gesture Recognition with Kinect

Lightweight Deep Learning Models for Robust Hand Gesture Recognition

Challenges in Multi-modal Gesture Recognition

References

Bobick, A.F., Davis, J.W.: The recognition of human movement using temporal templates. IEEE Trans. Pattern Anal. Mach. Intell. 23(3), 257–267 (2001)
Article Google Scholar
Bradski, G.: The OpenCV Library. Dr. Dobb’s Journal of Software Tools (2000)
Google Scholar
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: CVPR, pp. 886–893 (2005)
Google Scholar
Escalante, H.J., Guyon, I.: Principal motion: PCA-based reconstruction of motion histograms. Technical report, ChaLearn (2012)
Google Scholar
Escalera, S., Fornés, A., Pujol, O., Lladós, J., Radeva, P.: Circular blurred shape model for multiclass symbol recognition. IEEE Transactions on Systems, Man, and Cybernetics, Part B 41(2), 497–506 (2011)
Article Google Scholar
Fanelli, G., Gall, J., Van Gool, L.J.: Real time head pose estimation with random regression forests. In: CVPR, pp. 617–624 (2011)
Google Scholar
Gallo, L., Placitelli, A.P., Ciampi, M.: Controller-free exploration of medical image data: Experiencing the kinect. In: CBMS, pp. 1–6 (2011)
Google Scholar
Gori, I., Fanello, S.R., Metta, G., Odone, F.: All gestures you can: a memory game. Technical report, Istituto Italiano di Tecnologia, Italy, Submitted to JMLR (2012)
Google Scholar
Guyon, I., Athitsos, V., Jangyodsuk, P., Escalante, H.J.: The ChaLearn Gesture Dataset (CGD 2011). Submitted to Machine Vision and Applications (2013)
Google Scholar
Hastie, T., Tibshirani, R., Friedman, J.H.: The elements of statistical learning: data mining, inference, and prediction: with 200 full-color illustrations. Springer, New York (2001)
Google Scholar
Keskin, C., Kira, F., Kara, Y.E., Akarun, L.: Randomized decision forests for static and dynamic hand shape classification. In: CVPR Workshops, pp. 31–36. IEEE (2012)
Google Scholar
Koller, D., Friedman, N.: Probabilistic Graphical Models: Principles and Techniques. MIT Press (2009)
Google Scholar
Lafferty, J.: Conditional random fields: Probabilistic models for segmenting and labeling sequence data, pp. 282–289. Morgan Kaufmann (2001)
Google Scholar
Laptev, I.: On space-time interest points. International Journal of Computer Vision 64(2-3), 107–123 (2005)
Article Google Scholar
Lucena, M., de la Blanca, N.P., Fuertes, J.M., Marín-Jiménez, M.J.: Human action recognition using optical flow accumulated local histograms. In: Araujo, H., Mendonça, A.M., Pinho, A.J., Torres, M.I. (eds.) IbPRIA 2009. LNCS, vol. 5524, pp. 32–39. Springer, Heidelberg (2009)
Chapter Google Scholar
Malgireddy, M., Nwogu, I., Govindaraju, V.: Language-motivated approaches to action recognition. Submitted to JMLR (2013)
Google Scholar
Oikonomidis, I., Kyriazis, N., Argyros, A.A.: Tracking the articulated motion of two strongly interacting hands. In: CVPR, pp. 1862–1869 (2012)
Google Scholar
Pan, S.J., Yang, Q.: A survey on transfer learning. IEEE Transactions on Knoweledge and Data Engineering 22(10), 1345–1359 (2010)
Article Google Scholar
Rabiner, L.R.: A tutorial on hidden markov models and selected applications in speech recognition. Proceedings of the IEEE, 257–286 (1989)
Google Scholar
Viterbi, A.J.: Error bounds for convolutional codes and an asymptotically optimum decoding algorithm. IEEE Transactions on Information Theory IT-13(2), 260–269 (1967)
Article Google Scholar
Wan, J., Ruan, Q., Li, W.: One-shot learning gesture recognition from rgb-d data using bag-of-features. JMLR (in press, 2013)
Google Scholar

Download references

Author information

Authors and Affiliations

ChaLearn, Berkeley, California, USA
Isabelle Guyon
University of Texas at Arlington, Texas, USA
V. Athitsos & P. Jangyodsuk
INAOE, Puebla, Mexico
H. J. Escalante
Kaggle, San Francisco, California, USA
B. Hamner

Authors

Isabelle Guyon
View author publications
You can also search for this author in PubMed Google Scholar
V. Athitsos
View author publications
You can also search for this author in PubMed Google Scholar
P. Jangyodsuk
View author publications
You can also search for this author in PubMed Google Scholar
H. J. Escalante
View author publications
You can also search for this author in PubMed Google Scholar
B. Hamner
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Mathematics and Computer Science, University of Münster, Einsteinstraße 62, 48149, Münster, Germany
Xiaoyi Jiang
IMAGO Research Group, Centro Politécnico- Jardim das Américas, Universidade Federal do Paraná, Caixa Postal 19092, CEP 81531-980, Curitiba, PR, Brazil
Olga Regina Pereira Bellon
Department of Computer Science and Engineering, University of South Florida, 4202 East Fowler Ave, ENB 118, 33620, Tampa, FL, USA
Dmitry Goldgof
Institute of Industrial Science, The University of Tokyo, Komaba 4-6-1, Meguro-ku, Tokyo, Japan
Takeshi Oishi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Guyon, I., Athitsos, V., Jangyodsuk, P., Escalante, H.J., Hamner, B. (2013). Results and Analysis of the ChaLearn Gesture Challenge 2012. In: Jiang, X., Bellon, O.R.P., Goldgof, D., Oishi, T. (eds) Advances in Depth Image Analysis and Applications. WDIA 2012. Lecture Notes in Computer Science, vol 7854. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40303-3_19

Download citation

DOI: https://doi.org/10.1007/978-3-642-40303-3_19
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-40302-6
Online ISBN: 978-3-642-40303-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics