Abstract
In sign-language or gesture recognition, articulated hand motion tracking is usually a prerequisite to behaviour understanding. However the difficulties such as non-rigidity of the hand, complex background scenes, and occlusion etc make tracking a challenging task. In this paper we present a hybrid HMM/Particle filter tracker for simultaneously tracking and recognition of non-rigid hand motion. By utilising separate image cues, we decompose complex motion into two independent (non-rigid/rigid) components. A generative model is used to explore the intrinsic patterns of the hand articulation. Non-linear dynamics of the articulation such as fast appearance deformation can therefore be tracked without resorting to a complex kinematic model. The rigid motion component is approximated as the motion of a planar region, where a standard particle filter method suffice. The novel contribution of the paper is that we unify the independent treatments of non-rigid motion and rigid motion into a robust Bayesian framework. The efficacy of this method is demonstrated by performing successful tracking in the presence of significant occlusion clutter.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Aggarwal, J., Cai, Q.: Human Motion Analysis: A Review. Computer Vision and Image Understanding (1999)
Koller, D., Weber, J., Malik, J.: Robust Multiple Car Tracking with Occlusion Reasoning. In: Eklundh, J.-O. (ed.) ECCV 1994. LNCS, vol. 800. Springer, Heidelberg (1994)
Starner, T., Pentland, A.: Visual Recognition of American Sign Language Using Hidden Markov Model. In: Proc. International Workshop on Automatic Face and Gesture Recognition (1995)
Isard, M., Blake, A.: Active Contour. Springer, Heidelberg (1998)
Morris, D., Rehg, J.: Singularity Analysis for Articulated Object Tracking. In: Proc. Computer Vision and Pattern Recognition (1998)
Comaniciu, D., Ramesh, V., Meer, P.: Real-Time Tracking of Non-Rigid Objects using Mean Shift. In: Proc. Computer Vision and Pattern Recognition (2000)
Collins, R.: Mean-shift Blob Tracking through Scale Space. In: Proc. Computer Vision and Pattern Recognition (2003)
Rehg, J., Kanade, T.: Model-Based Tracking of Self-Occluding Articulated Objects. In: Proc. International Conference on Computer Vision (1995)
Roweis, S., Saul, L.: Nonlinear Dimensionality Reduction by Locally Linear Embedding. Science (2000)
Palovic, V., Rehg, J., Cham, T., Murphy, K.: A Dynamic Bayesian Network Approach to Figure Tracking Using Learned Dynamic Models. In: Proc. International Conference on Computer Vision (1999)
Stenger, B., Thayananthan, A., Torr, P., Cipolla, R.: Filtering Using a Tree-Based Estimator. In: Proc. International Conference on Computer Vision (2003)
Black, M., Jepson, A.: Eigen tracking: Robust matching and tracking of an articulatedn objects using a view based representation. In: Buxton, B.F., Cipolla, R. (eds.) ECCV 1996. LNCS, vol. 1065. Springer, Heidelberg (1996)
Linde, A., Gray, R.: An algorithm for vector quantization design. IEEE.Trans. on Communications (1980)
Pérez, P., Hue, C., Vermaak, J., Gangnet, M.: Color-based probabilistic tracking. In: Heyden, A., Sparr, G., Nielsen, M., Johansen, P. (eds.) ECCV 2002. LNCS, vol. 2350, pp. 661–675. Springer, Heidelberg (2002)
Rabiner, R.: A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition. Proc. IEEE (1989)
Toyama, K., Blake, A.: Probabilistic Tracking with Exemplars in a Metric Space. In: Proc. International Conference on Computer Vision (2001)
Sidenbladh, H., Black, M.: Learning Image Statistics for Bayesian Tracking. In: Proc. International Conference on Computer Vision (2001)
Isard, M., Blake, A.: ICondensation: Unifying low-level and high-level tracking in a stochastic framework. In: Burkhardt, H.-J., Neumann, B. (eds.) ECCV 1998. LNCS, vol. 1406, p. 893. Springer, Heidelberg (1998)
Brand, M.: Shadow Puppetry. In: Proc. International Conference on Computer Vision (1999)
Viterbi, A.: Error bounds for convolutional codes and an asymptically optimum decoding algorithm. IEEE. Transanction on Information Theory (1967)
Kass, M., Witkin, A., Terzopoulos, D.: Snakes: Active contour models. International Journal of Computer Vision (1987)
Bregler, C.: 1997: Learning and Recognizing Human Dynamics in Video Sequences. In: Proc. Computer Vision and Pattern Recognition (1997)
Birchfield, S.: Elliptical Head Tracking using Intensity Gradients and Colour Histogram. In: Proc. Computer Vision and Pattern Recognition (1998)
Gavrila, D., Philomin, V.: Real-time Object Detection for ‘smart’ Vehicles. In: International Conference on Computer Vision (1999)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Fei, H., Reid, I. (2004). Joint Bayes Filter: A Hybrid Tracker for Non-rigid Hand Motion Recognition. In: Pajdla, T., Matas, J. (eds) Computer Vision - ECCV 2004. ECCV 2004. Lecture Notes in Computer Science, vol 3023. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24672-5_39
Download citation
DOI: https://doi.org/10.1007/978-3-540-24672-5_39
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-21982-8
Online ISBN: 978-3-540-24672-5
eBook Packages: Springer Book Archive