Abstract
In this paper, we aim for the recognition of a set of dance gestures from contemporary ballet. Our input data are motion trajectories followed by the joints of a dancing body provided by a motion-capture system. It is obvious that direct use of the original signals is unreliable and expensive. Therefore, we propose a suitable tool for non-uniform sub-sampling of spatio-temporal signals. The key to our approach is the use of a deformable model to provide a compact and efficient representation of motion trajectories. Our dance gesture recognition method involves a set of hidden Markov models (HMMs), each of them being related to a motion trajectory followed by the joints. The recognition of such movements is then achieved by matching the resulting gesture models with the input data via HMMs. We have validated our recognition system on 12 fundamental movements from contemporary ballet performed by four dancers.
Similar content being viewed by others
Notes
Ballet Atlantique Régine Chopinot.
A flow field (a 2D vector at each pixel) containing s pixels is represented as a long 1D vector consisting of 2s elements.
References
Aggarwal JK, Cai Q (1999) Human motion analysis: a review. Comput Vis Image Und 73(3):428–440
Baum LE, Eagon J (1967) An inequality with applications to statistical prediction for functions of Markov processes and to a model of ecology. Bull Amer Math Soc 73:360–363
Berndt D, Clifford J (1994) Using dynamic time warping to find patterns in time series. In: Proceedings of the AAAI workshop on knowledge discovery in databases, Seattle, Washington, July 1994, pp 359–370
Berthold M, Hand DJ (1999) Intelligent data analysis: an introduction, 1st edn. Springer, Berlin Heidelberg New York
Black MJ, Jepson AD (1998) Recognizing temporal trajectories using the condensation algorithm. In: Proceedings of the 3rd international conference on automatic face and gesture recognition (FG’98), Nara, Japan, April 1998, pp 16–21
Bouet M, Khenchaf A, Briand H (1999) Shape representation for image retrieval. In: Proceedings of the 7th ACM international multimedia conference (ACM Multimedia’99), Orlando, Florida, October/November 1999, vol 2, pp 1–4
Boukir S, Beets E, Chenevière F (2002) Représentation et compression de signaux spatio-temporels. Technical report RT-2002-09-001, Laboratoire L3i, Université de La Rochelle, France
Brand ME, Hertzmann A (2000) Style machines. In: Proceedings of the 27th annual conference on computer graphics (SIGGRAPH 2000), New Orleans, Louisiana, July 2000, pp 183–192
Bregler C (1997) Learning and recognizing human dynamics in video sequences. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR’97), San Juan, Puerto Rico, June 1997, pp 928–934
Bruderlin A, Williams L (1995) Motion signal processing. In: Proceedings of the 22nd annual conference on computer graphics (SIGGRAPH’95), Los Angeles, California, August 1995, pp 97–104
Brunzell H, Eriksson J (2000) Feature reduction for classification of multidimensional data. Pattern Recogn 33:1741–1748
Bunke H, Caelli T (2001) Hidden Markov models: applications in computer vision. World Scientific, River Edge, New Jersey
Campbell L, Bobick A (1995) Recognition of human body motion using phase space constraints. In: Proceedings of the 5th international conference on computer vision (ICCV’95), Cambridge, Massachusetts, June 1995, pp 624–630
Campbell LW, Becker DA, Azarbayejani A, Bobick AF, Pentland A (1996) Invariant features for 3-D gesture recognition. In: Proceedings of the 2nd international conference on automatic face and gesture recognition (FG’96), Killingston, Vermont, October 1996, pp 157–162
Chappelier JC (1996) RST: une architecture connexionniste pour la prise en compte de relations spatiales et temporelles. PhD thesis, École Nationale Supérieure des Télécommunications, Paris, France
Cheneviére F (2002) Prétraitements et apprentissage de données spatio-temporelles dans le cadre du mouvement dansé. Technical report RT-2002-01-001, Laboratoire L3i, Université de La Rochelle, France
Cheneviére F, Boukir S (2004) Deformable model based data compression for gesture recognition. In: Proceedings of the 17th international conference on pattern recognition (ICPR 2004), Cambridge, UK August 2004
Cheneviére F, Boukir S, Vachon B (2002) A HMM-based dance gesture recognition system. In: Proceedings of the 9th international workshop on systems, signals and image processing (IWSSIP 2002), Manchester, UK, June 2002, pp 322–326
Chesnaud C (2000) Techniques statistiques de segmentation par contour actif et mise en oeuvre rapide. PhD thesis, Université de Droit d’Economie et des Sciences d’Aix-Marseille, France
Cohen E, Riesenfeld RF, Elber G (2001) Geometric modeling with splines: an introduction. AK Peters, Natick, Massachusetts
Daehlen M, Lyche T, Schumaker L (1995) Mathematical methods for curves and surfaces, 1st edn. Vanderbilt University Press, London
Eck M, Hadenfeld J (1995) Knot removal for B-spline curves. Computer Aided Geom Des (12):259–282
Elms AJ (1996) The representation and recognition of text using hidden Markov models. PhD Thesis, University of Surrey, UK
Frey PJ, George PL (1999) Maillages: applications aux éléments finis. Hermes, Paris, France
Gonzalez RC, Woods RE (2002) Digital image processing, 2nd edn. Prentice-Hall, Upper Saddle River, New Jersey
Herbison-Evans D (1991) Dance and the computer: a potential for graphic synergy. Technical Report 422, Basser Department of Computer Science, University of Sydney, Australia
Hoey J, Little JL (2000) Representation and recognition of complex human motion. In: Proceedings of the IEEE international conference on computer vision and pattern recognition (CVPR 2000), Hilton Head, South Carolina, June 2000, pp 1752–1759
Jacquemin C (1994) A temporal connectionist approach to natural language. ACM SIGART Bull 5(3):12–22
Johnson DE (1998) Applied multivariate methods for data analysts. Duxbury Press, Pacific Grove, California
Kass M, Witkin A, Terzopoulos D (1987) Snakes: active contour models. In: Proceedings of the 1st international conference on computer vision, London, UK, June 1987, pp 259–268
Keogh EJ, Pazzani MJ (1999) Scaling up dynamic time warping to massive datasets. In: Proceedings of the 3rd European conference on principles and practice of data mining and knowledge discovery in databases (PKDD’99), Prague, Czech Republic, September 1999, pp 1–11
Lang KJ, Waibel AH, Hinton GE (1990) A time-delay neural-network architecture for isolated word recognition. Neural Netw 3:23–44
Latecki LJ, Lakämper R (1999) Convexity rule for shape decomposition based on discrete contour evolution. Comput Vis Image Und 73(3):441–454
Le Buhan Jordan C, Ebrahimi T (1997) Progressive polygon encoding of shape contours. In: Proceedings of the 6th international conference on image processing and its applications (IPA’97), Dublin, Ireland, July 1997, vol 1, pp 17–21
Lee HK, Kim JH (1999) An HMM-based threshold model approach for gesture recognition. IEEE Trans PAMI 21(10):961–972
McInerney TJ (1997) Topologically adaptable deformable models for medical image analysis. PhD thesis, Department of Computer Science, University of Toronto, Canada
Müller K, Ohm JR (1999) Wavelet-based contour descriptor. Technical report, MPEG-7 proposal no P567
Montagnat J, Delingette H, Ayache N (2001) A review of deformable surfaces: topology, geometry and deformation. Image Vis Comput 19(14):1023–1040
Nagaya S, Seki S, Oka R (1996) A theoretical consideration of pattern space trajectory for gesture spotting recognition. In: Proceedings of the 2nd international conference on automatic face and gesture recognition (FG’96), Killingston, Vermont, October 1996, pp 72–77
Ormoneit D, Sidenbladh S, Black MJ, Hastie T (2001) Learning and tracking cyclic human motion. In: Leen TK, Dietterich TG, Volker T (eds) Advances in neural information processing systems. MIT Press, Cambridge, Massachusetts, pp 894–900
Pinkowski B (1996) Fourier descriptors for characterizing object contour. In: Proceedings of the international conference on signal processing applications and technology (ICSPAT’96), Boston, Massachusetts, October 1996, pp 1007–1011
Rabiner LR (1989) A tutorial on hidden Markov models and selected applications in speech recognition. Proc IEEE 77(2):257–286
Rittscher J, Blake A (1999) Classification of human body motion. In: Proceedings of the international conference on computer vision (ICCV’99), Kerkyra, Greece, September 1999, pp 634–639
Rui Y, Anandan P (2000) Segmenting visual actions based on spatio-temporal motion patterns. In: Proceedings of the IEEE international conference on computer vision and pattern recognition (CVPR 2000), Hilton Head, South Carolina, June 2000, vol 1, pp 111–119
Saux E, Daniel M (1999) Data reduction of polygonal curves using B-splines. Comput Aided Des (31):507–515
Sudarsky S, House D (1998) Motion capture data manipulation and reuse via B-splines. In: Proceedings of the first international workshop on modelling and motion capture techniques for virtual environments (CapTech’98), Geneva, Switzerland, November 1998, pp 55–69
Suter D, Chen F (2000) Left ventricular motion reconstruction based on elastic vector splines. IEEE Trans Med Imaging 19(4):295–305
Szilas N (1997) Les réseaux récurrents supervisés : une revue critique. Technical report 972-I, Institut National Polytechnique de Grenoble, France
Wilson AD, Bobick AF (1999) Parametric hidden Markov models for gesture recognition. IEEE Trans PAMI 21(9):884–900
Winkler T (1997) Creating interactive dance with the very nervous system. In: Proceedings of the 1997 Connecticut college symposium on art and technology, New London, Connecticut, March 1997, pp 212–217
Yang MH, Ahuja N (1999) Recognizing hand gesture using motion trajectories. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR’99), Fort Collins, Colorado, June 1999, pp 466–472
Zaletelj J, Pecci R, Spaan F, Hanjalic A, Lagendijk RL (1998) Rate distortion optimal contour compression using cubic B-splines. In: Proceedings of the 9th European signal processing conference (Eusipco’98), Island of Rhodes, Greece, September 1998
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Boukir, S., Chenevière, F. Compression and recognition of dance gestures using a deformable model. Pattern Anal Applic 7, 308–316 (2004). https://doi.org/10.1007/s10044-004-0228-z
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10044-004-0228-z