Abstract
Autism is a childhood developmental disorder characterized by impairments in social interaction, verbal and nonverbal communication, narrow interests, and repetitive stereotypes, starting in infancy and early childhood. Children’s Response to Instruction (RTI) is an important protocol in the diagnosis of children with autism. The so-called verbal response refers to a child’s response to a doctor’s verbal or gestural instructions in the process of performing a specific task. This paper proposes an end-to-end network for detecting Human-Object Interaction (HOI), which can simply, quickly and efficiently evaluate the behaviors of children, doctors, and parents in the RTI protocol. Through behavior analysis, we can determine whether the children’s performance in this protocol is normal, and can also give a screening score for autism. In order to verify the proposed method, 16 children aged 5–8 are recruited to take participate in this study. The HOI detection accuracy and autism screening accuracy can reach 81.9% and 89%, respectively, which proves the effectiveness of our method in ASD screening.
Supported by GuangDong Basic and Applied Basic Research Foundation (2021A1515110438).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Gao, C., Zou, Y., Huang, J.B.: iCAN: instance-centric attention network for human-object interaction detection, pp. 1–13. arXiv preprint arXiv:1808.10437 (2018)
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
Kim, D.-J., Sun, X., Choi, J., Lin, S., Kweon, I.S.: Detecting human-object interactions with action co-occurrence priors. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12366, pp. 718–736. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58589-1_43
Li, J., Chen, Z., Li, G., Ouyang, G., Li, X.: Automatic classification of ASD children using appearance-based features from videos. Neurocomputing 470, 40–50 (2022)
Liao, Y., Liu, S., Wang, F., Chen, Y., Qian, C., Feng, J.: PPDM: parallel point detection and matching for real-time human-object interaction detection. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 482–490 (2020)
Lin, T.Y., Goyal, P., Girshick, R., He, K., Dollár, P.: Focal loss for dense object detection. In: IEEE International Conference on Computer Vision, pp. 2980–2988 (2017)
Liu, J., Wang, Z., Xu, K., Ji, B., Liu, H.: Early screening of autism in toddlers via response-to-instructions protocol. IEEE Trans. Cybern. 52, 3914–3924 (2020)
Loshchilov, I., Hutter, F.: Decoupled weight decay regularization, pp. 1–8 (2017)
Qin, H., et al.: Vision-based pointing estimation and evaluation in toddlers for autism screening. In: Liu, X.-J., Nie, Z., Yu, J., Xie, F., Song, R. (eds.) ICIRA 2021. LNCS (LNAI), vol. 13015, pp. 177–185. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-89134-3_17
Rezatofighi, H., Tsoi, N., Gwak, J., Sadeghian, A., Reid, I., Savarese, S.: Generalized intersection over union: a metric and a loss for bounding box regression. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 658–666 (2019)
Stewart, R., Andriluka, M., Ng, A.Y.: End-to-end people detection in crowded scenes. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 2325–2333 (2016)
Vaswani, A., et al.: Attention is all you need. In: Advances in Neural Information Processing Systems, pp. 1–11 (2017)
Wang, T., et al.: Deep contextual attention for human-object interaction detection. In: IEEE International Conference on Computer Vision, pp. 5694–5702 (2019)
Wang, X., Zhang, J., Zhang, H., Zhao, S., Liu, H.: Vision-based gaze estimation: a review. IEEE Trans. Cogn. Dev. Syst. 14, 316–332 (2021)
Zhang, D., Toptan, C.M., Zhang, G., Zhao, S., Liu, H.: Diversity and complexity of hand movement for autism spectrum disorder intervention. In: International Conference on Advanced Computational Intelligence (2021)
Zhang, H., et al.: Gaze-driven interaction system for cognitive ability assessment. In: International Conference on Intelligent Control and Information Processing, pp. 346–351 (2021)
Zhong, X., Qu, X., Ding, C., Tao, D.: Glance and gaze: inferring action-aware points for one-stage human-object interaction detection. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 13234–13243 (2021)
Zhou, T., Qi, S., Wang, W., Shen, J., Zhu, S.C.: Cascaded parsing of human-object interaction recognition. IEEE Trans. Pattern Anal. Mach. Intell. 44, 2827–2840 (2022)
Zou, C., et al.: End-to-end human object interaction detection with HOI transformer. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 11825–11834 (2021)
Acknowledgment
This research is supported by GuangDong Basic and Applied Basic Research Foundation (2021A1515110438).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Jiang, W. et al. (2022). Detection of Response to Instruction in Autistic Children Based on Human-Object Interaction. In: Liu, H., et al. Intelligent Robotics and Applications. ICIRA 2022. Lecture Notes in Computer Science(), vol 13458. Springer, Cham. https://doi.org/10.1007/978-3-031-13841-6_64
Download citation
DOI: https://doi.org/10.1007/978-3-031-13841-6_64
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-13840-9
Online ISBN: 978-3-031-13841-6
eBook Packages: Computer ScienceComputer Science (R0)