The illustration shows APMFNet using the ACSA653 dataset to extract visual (RGB and optical flow) and skeleton (joint and bone) information, fusing these to achieve state...
Abstract:
Stereotyped movements play a crucial role in diagnosing Autism Spectrum Disorder (ASD). However, recognizing them poses challenges, due to limited data availability and t...Show MoreMetadata
Abstract:
Stereotyped movements play a crucial role in diagnosing Autism Spectrum Disorder (ASD). However, recognizing them poses challenges, due to limited data availability and the movements' specificity and varying duration. To support in-depth analysis of ASD children's movements, we constructed the ACSA653 dataset, comprising 653 videos across six classes of stereotyped movements. This dataset surpasses existing ones in both scale and category. To improve the recognition of stereotyped movements, we propose APMFNet, a model that integrates three modules: Visual Motion Learning (VML), Skeleton Relation Mining (SRM), and Multi-channel Fusion (MF). The VML module focuses on extracting spatial and motion information from RGB and optical-flow sequences. The SRM module effectively mines essential motion patterns associated with stereotyped movements through cross-modal graph. The MF module fuses multi-modal information through cross-modality attention to facilitate decision-making. Tested on ACSA653, APMFNet outperforms current state-of-the-art methods, suggesting its potential to identify stable patterns of stereotyped movements in children with ASD.
The illustration shows APMFNet using the ACSA653 dataset to extract visual (RGB and optical flow) and skeleton (joint and bone) information, fusing these to achieve state...
Published in: IEEE Journal of Biomedical and Health Informatics ( Volume: 29, Issue: 3, March 2025)