Abstract
This paper presents a different type of video analysis problem which is cultural activity analysis in general and Indian Classical Dance (ICD) classification in particular. To tackle this problem we propose a novel method for space time interest point (STIP) detection and description using differential geometry. Each video is represented by sparse code of STIP descriptors in each frame and then classification is done by a non-linear SVM with χ 2-kernel. We have created a ICD dataset of six classes (Bharatanatyam, Kathak, Kuchipudi, Mohiniyattam, Manipuri and Odissi) from YouTube and got on an average 68.18% accuracy which is better than the performance of state-of-the-art general human activity classification methods. We also have tested our algorithm on the benchmark datasets, like UCF sports and KTH, and the accuracy is comparable to that of the state-of-the-art.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Aggarwal, J.K., Ryoo, M.S.: Human activity analysis: A review. ACMCS 43 (2011)
Turaga, P., Chellappa, R., Subrahmanian, V.S., Udrea, O.: Machine recognition of human activities: A survey. IEEE Trans. CSVT 18, 1473–1488 (2008)
Massey, R.: India’s Dances. Abhinav Publication (2004)
Wang, H., Ullah, M.M., Klas̈er, A., Laptev, I., Schmid, C.: Evaluation of local spatio-temporal features for action recognition. In: BMVC (2009)
Laptev, I.: On space-time interest points. IJCV 64, 107–123 (2005)
Harris, C., Stephens, M.: A combined corner and edge detector. In: AVC (1988)
Dollár, P., Rabaud, V., Cottrell, G., Belongie, S.: Behavior recognition via sparse spatio-temporal features. In: VS-PETS (2005)
Bregonzio, M., Xiang, T., Gong, S.: Fusing appearance and distribution information of interest points for action recognition. PR 45, 1220–1234 (2012)
Willems, G., Tuytelaars, T., Van Gool, L.: An efficient dense and scale-invariant spatio-temporal interest point detector. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part II. LNCS, vol. 5303, pp. 650–663. Springer, Heidelberg (2008)
Bay, H., Ess, A., Tuytelaars, T., Gool, L.V.: Surf: Speeded up robust features. CVIU 110, 346–359 (2008)
Oikonomopoulos, A., Patras, I., Pantic, M.: Spatiotemporal salient points for visual recognition of human actions. IEEE Trans. SMC, Part B 36, 710–719 (2006)
Samanta, S., Purkait, P., Chanda, B.: Indian classical dance classification by learning dance pose bases. In: WACV, pp. 265–270 (2012)
Haralick, R.M., Shapiro, L.G.: Computer and Robot Vision. AWPC (1992)
Samanta, S., Chanda, B.: Fastip: a new method for detection and description of space-time interest points for human activity classification. In: ICVGIP (2012)
Klaser, A., Marszalek, M., Schmid, C.: A spatio-temporal descriptor based on 3d-gradients. In: BMVC (2008)
Mairal, J., Bach, F., Ponce, J., Sapiro, G.: Online learning for matrix factorization and sparse codings. JMLR 11, 19–68 (2010)
Laptev, I., Marszaek, M., Schmid, C., Rozenfeld, B.: Learning realistic human actions from movies. In: CVPR (2008)
Chang, C.C., Lin, C.J.: Libsvm: a library for support vector machines. ACM Trans. IST 27, 1–27 (2011)
Wang, H., Kläser, A., Schmid, C., Cheng-Lin, L.: Action recognition by dense trajectories. In: CVPR, pp. 3169–3176 (2011)
Rodriguez, M.D., Ahmed, J., Shah, M.: Action mach: A spatio-temporal maximum average correlation height filter for action recognition. In: CVPR (2008)
Kovashka, A., Grauman, K.: Learning a hierarchy of discriminative space-time neighborhood features for human action recognition. In: CVPR (2010)
Guha, T., Ward, R.K.: Learning sparse representations for human action recognition. IEEE Trans. PAMI 34, 1576–1588 (2012)
Schuldt, C., Laptev, I., Caputo, B.: Recognizing human actions: A local svm approach. In: ICPR (2004)
Nowozin, S., Bakir, G., Tsuda, K.: Discriminative subsequence mining for action classification. In: ICCV (2007)
Liu, J., Luo, J., Shah, M.: Recognizing realistic actions from videos in the wild. In: CVPR (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Samanta, S., Chanda, B. (2013). A Novel Technique for Space-Time-Interest Point Detection and Description for Dance Video Classification. In: Bebis, G., et al. Advances in Visual Computing. ISVC 2013. Lecture Notes in Computer Science, vol 8033. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-41914-0_50
Download citation
DOI: https://doi.org/10.1007/978-3-642-41914-0_50
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-41913-3
Online ISBN: 978-3-642-41914-0
eBook Packages: Computer ScienceComputer Science (R0)