Abstract
This work presents a unique new dataset and objectives for action analysis. The data presents 3 key challenges: tracking, classification, and judging action quality. The last of these, to our knowledge, has not yet been attempted in the vision literature as applied to sports where technique is scored.
This work performs an initial analysis of the dataset with classification experiments, confirming that temporal information is more useful than holistic bag-of-features style analysis in distinguishing dives. Our investigation lays a groundwork of effective tools for working with this type of sports data for future investigations into judging the quality of actions.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Rodriguez, M., Ahmed, J., Shah, M.: Action mach: A spatio-temporal maximum average correlation height filter for action recognition. In: Proc. CVPR (2008)
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. IJCV 2, 91–110 (2004)
Dalal, N., Triggs, B.: Histogram of oriented gradients for human detection. In: Proc. CVPR (2005)
Felzenszwalb, P., Girshick, D., McAllester, D., Ramanan, D.: Object detection with discriminatively trained part based models. PAMI (2010)
Efros, A.A., Berg, A.C., Mori, G., Malik, J.: Recognizing action at a distance. In: Proc. ICCV (2003)
Schuldt, C., Laptev, I., Caputo, B.: Recognizing human actions: A local svm approach. In: Proc. ICPR (2004)
Yeffet, L., Wolf, L.: Local trinary patterns for human action recognition. In: Proc. ICCV (2009)
Wilson, A., Bobick, A.: Parametric hidden markov models for gesture recognition. PAMI 21 (1999)
Raptis, M., Wnuk, K., Soatto, S.: Flexible dictionaries for action recognition. In: Proceedings of the 1st International Workshop on Machine Learning for Vision-based Motion Analysis, in conjunction with ECCV (2008)
Saad, A., Arslan, B., Mubarak, S.: Chaotic invariants for human action recognition. In: IEEE 11th International Conference on Computer Vision, pp. 1–8 (2007)
Laptev, I., Marszalek, M., Schmid, C., Rozenfeld, B.: Learning realistic human actions from movies. In: Proc. CVPR (2008)
Fischler, M., Bolles, R.: Random sample consensus: A paradigm for model fitting with applications to image analysis and automated cartography. Comm. of the ACM. 24, 381–395 (1981)
Ko, T., Soatto, S., Estrin, D.: Background subtraction on distributions. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part III. LNCS, vol. 5304, pp. 276–289. Springer, Heidelberg (2008)
Vedaldi, A., Fulkerson, B.: VLFeat: An open and portable library of computer vision algorithms (2008), http://www.vlfeat.org/
Shimodaira, H., Noma, K.I., Nakai, M., Sagayama, S.: Dynamic time alignment kernel in support vector machine. In: Proc. NIPS (2002)
Zhou, F., De la Torre, F., Hodgins, J.K.: Aligned cluster analysis for temporal segmentation of human motion. In: IEEE Conference on Automatic Face and Gestures Recognition (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wnuk, K., Soatto, S. (2011). Analyzing Diving: A Dataset for Judging Action Quality. In: Koch, R., Huang, F. (eds) Computer Vision – ACCV 2010 Workshops. ACCV 2010. Lecture Notes in Computer Science, vol 6468. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-22822-3_27
Download citation
DOI: https://doi.org/10.1007/978-3-642-22822-3_27
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-22821-6
Online ISBN: 978-3-642-22822-3
eBook Packages: Computer ScienceComputer Science (R0)