Unsupervised Hyperbolic Action Recognition

Castro-Vargas, John-Alejandro; Garcia-Garcia, Alberto; Martinez-Gonzalez, Pablo; Oprea, Sergiu; Garcia-Rodriguez, Jose

doi:10.1007/978-3-031-21062-4_39

John-Alejandro Castro-Vargas¹⁴,
Alberto Garcia-Garcia¹⁴,
Pablo Martinez-Gonzalez¹⁴,
Sergiu Oprea¹⁴ &
…
Jose Garcia-Rodriguez¹⁴

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 590))

Included in the following conference series:

Iberian Robotics conference

936 Accesses

Abstract

Methods based on Deep Geometric Learning allow the development of solutions with a geometric approximation in different applications. In particular, the curved feature of hyperbolic space has the ability to describe hierarchical structures in a better manner. In this paper, we aim to define an unsupervised learning model for action recognition. The curved feature space is intended to be used to describe a hierarchical relationship between the clips that compose a complete video sequence. These, in turn, are related to each other by means of a triplet loss function and a VAE (Variational Auto-Encoder) neural architecture, which establishes a similarity relationship between clips to identify actions from a set of unlabelled data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 119.00; Price excludes VAT (USA)

Softcover Book: USD 159.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Deep Embedding Features for Action Recognition on Raw Depth Maps

Second-order motion descriptors for efficient action recognition

Article 28 October 2020

Spatial-Temporal Neural Networks for Action Recognition

References

Ariza Colpas, P., et al.: Unsupervised human activity recognition using the clustering approach: a review. Sensors 20(9), 2702 (2020)
Article Google Scholar
Bronstein, M.M., Bruna, J., LeCun, Y., Szlam, A., Vandergheynst, P.: Geometric deep learning: going beyond euclidean data. IEEE Signal Process. Mag. 34(4), 18–42 (2017)
Article Google Scholar
Chaaraoui, A.A., Climent-Pérez, P., Flórez-Revuelta, F.: A review on vision techniques applied to human behaviour analysis for ambient-assisted living. Expert Syst. Appl. 39(12), 10873–10888 (2012)
Article Google Scholar
Cook, D.J., Crandall, A.S., Thomas, B.L., Krishnan, N.C.: Casas: a smart home in a box. Computer 46(7), 62–69 (2012)
Article Google Scholar
Doersch, C., Gupta, A., Efros, A.A.: Unsupervised visual representation learning by context prediction. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 1422–1430 (2015)
Google Scholar
Fernando, B., Bilen, H., Gavves, E., Gould, S.: Self-supervised video representation learning with odd-one-out networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3636–3645 (2017)
Google Scholar
Friji, R., Drira, H., Chaieb, F., Kchok, H., Kurtek, S.: Geometric deep neural network using rigid and non-rigid transformations for human action recognition. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 12611–12620 (2021)
Google Scholar
Hoffer, E., Hubara, I., Ailon, N.: Deep unsupervised learning through spatial contrasting. arXiv preprint arXiv:1610.00243 (2016)
Hsu, J., Gu, J., Wu, G., Chiu, W., Yeung, S.: Capturing implicit hierarchical structure in 3D biomedical images with self-supervised hyperbolic representations. In: Advances in Neural Information Processing Systems 34, 5112–5123 (2021)
Google Scholar
Hu, W.Y., Scott, J.S.: Behavioral obstacles in the annuity market. Financ. Anal. J. 63(6), 71–82 (2007)
Article Google Scholar
Huang, W., Wu, Q.J.: Human action recognition based on self organizing map. In: 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 2130–2133. IEEE (2010)
Google Scholar
Jing, L., Yang, X., Liu, J., Tian, Y.: Self-supervised spatiotemporal feature learning via video rotation prediction. arXiv preprint arXiv:1811.11387 (2018)
Larsson, G., Maire, M., Shakhnarovich, G.: Colorization as a proxy task for visual understanding. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 6874–6883 (2017)
Google Scholar
Li, Y., Paluri, M., Rehg, J.M., Dollár, P.: Unsupervised learning of edges. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1619–1627 (2016)
Google Scholar
Lou, A., Katsman, I., Jiang, Q., Belongie, S., Lim, S.N., De Sa, C.: Differentiating through the fréchet mean. In: International Conference on Machine Learning, pp. 6393–6403. PMLR (2020)
Google Scholar
Mathieu, E., Le Lan, C., Maddison, C.J., Tomioka, R., Teh, Y.W.: Continuous hierarchical representations with poincaré variational auto-encoders. In: Advances in Neural Information Processing Systems, vol. 32 (2019)
Google Scholar
Pu, Y., et al.: Variational autoencoder for deep learning of images, labels and captions. In: Advances in Neural Information Processing Systems, vol. 29 (2016)
Google Scholar
Rawassizadeh, R., Dobbins, C., Akbari, M., Pazzani, M.: Indexing multivariate mobile data through spatio-temporal event detection and clustering. Sensors 19(3), 448 (2019)
Article Google Scholar
Sarabu, A., Santra, A.K.: Human action recognition in videos using convolution long short-term memory network with spatio-temporal networks. Emerg. Sci. J. 5(1), 25–33 (2021)
Article Google Scholar
Surís, D., Liu, R., Vondrick, C.: Learning the predictability of the future. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 12607–12617 (2021)
Google Scholar
Surís, D., Liu, R., Vondrick, C.: Learning the predictability of the future. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 12607–12617 (2021)
Google Scholar
Tran, D., Wang, H., Torresani, L., Ray, J., LeCun, Y., Paluri, M.: A closer look at spatiotemporal convolutions for action recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 6450–6459 (2018)
Google Scholar
Ungar, A.A.: The möbius gyrovector space. In: Beyond the Einstein Addition Law and its Gyroscopic Thomas Precession, pp. 161–210. Springer (2001). https://doi.org/10.1007/0-306-47134-5_6
Wang, X., Gupta, A.: Unsupervised learning of visual representations using videos. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2794–2802 (2015)
Google Scholar
Yan, S., Xiong, Y., Lin, D.: Spatial temporal graph convolutional networks for skeleton-based action recognition. In: Thirty-Second AAAI Conference on Artificial Intelligence, pp. 7444–7452 (2018)
Google Scholar
Yao, G., Lei, T., Zhong, J.: A review of convolutional-neural-network-based action recognition. Pattern Recogn. Lett. 118, 14–22 (2019)
Article Google Scholar

Download references

Acknowledgment

We would like to thank “A way of making Europe” European Regional Development Fund (ERDF) and MCIN/AEI/10.13039/501100011033 for supporting this work under the MoDeaAS project (grant PID2019-104818RB-I00). This work has also been supported by two Spanish national grants for PhD studies, FPU17/00166, and UAFPU2019-13 respectively. Furthermore, we would like to thank Nvidia for their generous hardware donation that made these experiments possible.

Author information

Authors and Affiliations

3D Perception Lab, University of Alicante, Alicante, Spain
John-Alejandro Castro-Vargas, Alberto Garcia-Garcia, Pablo Martinez-Gonzalez, Sergiu Oprea & Jose Garcia-Rodriguez

Authors

John-Alejandro Castro-Vargas
View author publications
You can also search for this author in PubMed Google Scholar
Alberto Garcia-Garcia
View author publications
You can also search for this author in PubMed Google Scholar
Pablo Martinez-Gonzalez
View author publications
You can also search for this author in PubMed Google Scholar
Sergiu Oprea
View author publications
You can also search for this author in PubMed Google Scholar
Jose Garcia-Rodriguez
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to John-Alejandro Castro-Vargas .

Editor information

Editors and Affiliations

Centro Universitario de la Defensa (CUD), Zaragoza, Spain
Danilo Tardioli
Grupo de Robótica, Universidad de León, León, Spain
Vicente Matellán
GRVC Robotics Lab, Escuela Técnica Superior de Ingeniería, Universidad de Sevilla, Sevilla, Spain
Guillermo Heredia
School of Engineering, Polytechnic Institute of Porto, Porto, Portugal
Manuel F. Silva
Institute of Systems and Robotics, University of Coimbra, Coimbra, Portugal
Lino Marques

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Castro-Vargas, JA., Garcia-Garcia, A., Martinez-Gonzalez, P., Oprea, S., Garcia-Rodriguez, J. (2023). Unsupervised Hyperbolic Action Recognition. In: Tardioli, D., Matellán, V., Heredia, G., Silva, M.F., Marques, L. (eds) ROBOT2022: Fifth Iberian Robotics Conference. ROBOT 2022. Lecture Notes in Networks and Systems, vol 590. Springer, Cham. https://doi.org/10.1007/978-3-031-21062-4_39

Download citation

DOI: https://doi.org/10.1007/978-3-031-21062-4_39
Published: 19 November 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-21061-7
Online ISBN: 978-3-031-21062-4
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics