research-article

Self-Relational Graph Convolution Network for Skeleton-Based Action Recognition

Authors:

Sophyani Banaamwini Yussif,

Ning Xie,

Yang Yang,

Heng Tao ShenAuthors Info & Claims

MM '23: Proceedings of the 31st ACM International Conference on Multimedia

Pages 27 - 36

https://doi.org/10.1145/3581783.3612280

Published: 27 October 2023 Publication History

Get Access

Abstract

Using a Graph convolution network (GCN) for constructing and aggregating node features has been helpful for skeleton-based action recognition. The strength of the nodes' relation of an action sequence distinguishes it from other actions. This work proposes a novel spatial module called Multi-scale self-relational graph convolution (MS-SRGC) for dynamically modeling joint relations of action instances. Modeling the joints' relations is crucial in determining the spatial distinctiveness between skeleton sequences; hence MS-SRGC shows effectiveness for activity recognition. We also propose a Hybrid multi-scale temporal convolution network (HMS-TCN) that captures different ranges of time steps along the temporal dimension of the skeleton sequence. In addition, we propose a Spatio-temporal blackout (STB) module that randomly zeroes some continue frames for selected strategic joint groups. We sequentially stack our spatial (MS-SRGC) and temporal (HMS-TCN) modules to form a Self-relational graph convolution network (SR-GCN) block, which we use to construct our SR-GCN model. We append our STB on the SR-GCN model top for the randomized operation. With the effectiveness of ensemble networks, we perform extensive experiments on single and multiple ensembles. Our results beat the state-of-the-art methods on the NTU RGB-D, NTU RGB-D 120, and Northwestern-UCLA datasets.

Supplemental Material

MP4 File

This talk specifically presents the research scope of our paper and describes the issues existing in this area, followed by a succinct explanation of our approach and experimental results and the principal contributions of our paper.

Download
158.76 MB

References

[1]

Sami Abu-El-Haija, Bryan Perozzi, Amol Kapoor, Nazanin Alipourfard, Kristina Lerman, Hrayr Harutyunyan, Greg Ver Steeg, and Aram Galstyan. 2019. Mixhop: Higher-order graph convolutional architectures via sparsified neighborhood mixing. In International Conference on Machine Learning. 21--29.

Abstract

Supplemental Material

References

Cited By

Index Terms

Recommendations

MGSAN: multimodal graph self-attention network for skeleton-based action recognition: MGSAN: multimodal graph self-attention network for skeleton-based action recognition

Temporal-Aware Graph Convolution Network for Skeleton-based Action Recognition

Multi-level Temporal-Guided Graph Convolutional Networks for Skeleton-Based Action Recognition

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Funding Sources

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Share

Share this Publication link

Share on social media

Affiliations