sNN-LDS: Spatio-temporal Non-negative Sparse Coding for Human Action Recognition

Guthier, Thomas; Šošić, Adrian; Willert, Volker; Eggert, Julian

doi:10.1007/978-3-319-11179-7_24

sNN-LDS: Spatio-temporal Non-negative Sparse Coding for Human Action Recognition

Thomas Guthier²¹,
Adrian Šošić²²,
Volker Willert²¹ &
…
Julian Eggert²³

Conference paper

4325 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 8681))

Abstract

Current state-of-the-art approaches for visual human action recognition focus on complex local spatio-temporal descriptors, while the spatio-temporal relations between the descriptors are discarded. These bag-of-features (BOF) based methods come with the disadvantage of limited descriptive power, because class-specific mid- and large-scale spatio-temporal information, such as body pose sequences, cannot be represented. To overcome this restriction, we propose sparse non-negative linear dynamical systems (sNN-LDS) as a dynamic, parts-based, spatio-temporal representation of local descriptors. We provide novel learning rules based on sparse non-negative matrix factorization (sNMF) to simultaneously learn both the parts as well as their transitions. On the challenging UCF-Sports dataset our sNN-LDS combined with simple local features is competitive with state-of-the-art BOF-SVM methods.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Blank, M., Gorelick, L., Shechtman, E., Irani, M., Basri, R.: Actions as Space-Time Shapes. In: IEEE Int. Conf. on Computer Vision, ICCV (2005)
Google Scholar
Rodriguez, M.D., Ahmed, J., Shah, M.: Action MACH: A Spatio-temporal Maximum Average Correlation Height Filter for Action Recognition. In: IEEE Conf. on Computer Vision and Pattern Recognition, CVPR (2008)
Google Scholar
Olshausen, B., Field, D.J.: Emergence of simple-cell receptive field properties by learning a sparse code for natural images. Nature 381, 607–609 (1996)
Article Google Scholar
Paatero, P., Tapper, U.: Positive matrix factorization: A non-negative factor model with optimal utilization of error estimates of data values. Environmetrics 5(2), 111–126 (1994)
Article Google Scholar
Lee, D.D., Seung, S.: Learning the parts of objects by non-negative matrix factorization. Nature 401, 788–791 (1999)
Article Google Scholar
Tian, Y., Sukthankar, R., Shah, M.: Spatiotemporal Deformable Part Models for Action Detection. In: Int. Conf. on Computer Vision and Pattern Recognition, CVPR (2013)
Google Scholar
Guthier, T., Eggert, J., Willert, V.: Unsupervised learning of motion patterns. In: European Symposium on Artificial Neural Networks, ESANN (2012)
Google Scholar
Hoyer, P.O.: Non-negative sparse coding. IEEE Neural Networks for Signal Processing (2002)
Google Scholar
Eggert, J., Koerner, E.: Sparse coding and NMF. In: IEEE Int. Joint Conf. on Neural Networks (IJCNN), vol. 4, pp. 2529–2533 (2004)
Google Scholar
Amiri, S.M., Nasiopoulos, P., Leung, V.: Non-negative sparse coding for human action recognition. In: IEEE Int. Conf. on Image Processing, ICIP (2012)
Google Scholar
Guha, T., Ward, R.K.: Learning sparse representations for human action recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence 34(8), 1576–1588 (2012)
Article Google Scholar
Guthier, T., Willert, V., Schnall, A., Kreuter, K., Eggert, J.: Non-negative sparse coding for motion extraction. In: IEEE Int. Joint Conf. on Neural Networks, IJCNN (2013)
Google Scholar
Wang, H., Ullah, M.M., Klaser, A., Laptev, I., Schmid, C.: Evaluation of local spatio-temporal features for action recognition. In: British Machine Vision Conference, BMVC (2009)
Google Scholar
Wang, H., Klaser, A., Schmid, C., Liu, C.L.: Dense trajectories and motion boundary descriptors for action recognition. International Journal of Computer Vision, 1–20 (2013)
Google Scholar
Lakshminarayanan, B., Raich, R.: Non-negative matrix factorization for parameter estimation in hidden markov models. In: IEEE Int. Workshop on Machine Learning for Signal Processing, MLSP (2010)
Google Scholar
Bilinski, P., Bremond, F.: Contextual statistics of space-time ordered features for human action recognition. In: IEEE Int. Conf. on Advanced Video and Signal-Based Surveillance (AVSS), pp. 228–233 (2012)
Google Scholar
Wang, J., Chen, Z., Wu, Y.: Action recognition with multiscale spatio-temporal contexts. In: IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), pp. 3185–3192 (2011)
Google Scholar
Tibshirani, R., Saunders, M., Rosset, S., Zhu, J., Kneight, K.: Sparsity and smoothness via the fused lasso. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 91–108 (2005)
Google Scholar
Klaser, A., Marszałek, M., Laptev, I., Schmid, C., et al.: Will person detection help bag-of-features action recognition (2010)
Google Scholar

Download references

Author information

Authors and Affiliations

Control Theory and Robotics, TU Darmstadt, Landgraf-Georg Strasse 4, Darmstadt, Germany
Thomas Guthier & Volker Willert
Signal Processing Group, TU Darmstadt, Merckstrasse 25, Darmstadt, Germany
Adrian Šošić
Honda Research Institute Europe, Carl-Legien Strasse 30, Offenbach, Germany
Julian Eggert

Authors

Thomas Guthier
View author publications
You can also search for this author in PubMed Google Scholar
Adrian Šošić
View author publications
You can also search for this author in PubMed Google Scholar
Volker Willert
View author publications
You can also search for this author in PubMed Google Scholar
Julian Eggert
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Informatics, University of Hamburg, Vogt-Kölln-Straße 30, 22527, Hamburg, Germany
Stefan Wermter , Cornelius Weber & Sven Magg , &
Department of Informatics, Nicolaus Compernicus University, ul. Grudziądzka 5, 87-100, Torun, Poland
Włodzisław Duch
Department of Modern Languages, University of Helsinki, P.O. Box 24, 00014, Helsinki, Finland
Timo Honkela
Institute of Information and Communication Technologies, Bulgarian Academy of Sciences, Acad. G. Bonchev str. bl. 25A, 1113, Sofia, Bulgaria
Petia Koprinkova-Hristova
Institute of Neural Information Processing, University of Ulm, 89069, Oberer Eselsberg, Ulm, Germany
Günther Palm
Department of Information Systems, Quartier UNIL-Dorigny, Bâtiment Internef, University of Lausanne, 1015, Lausanne, Switzerland
Alessandro E. P. Villa

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Guthier, T., Šošić, A., Willert, V., Eggert, J. (2014). sNN-LDS: Spatio-temporal Non-negative Sparse Coding for Human Action Recognition. In: Wermter, S., et al. Artificial Neural Networks and Machine Learning – ICANN 2014. ICANN 2014. Lecture Notes in Computer Science, vol 8681. Springer, Cham. https://doi.org/10.1007/978-3-319-11179-7_24

Download citation

DOI: https://doi.org/10.1007/978-3-319-11179-7_24
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-11178-0
Online ISBN: 978-3-319-11179-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics