Human-object interaction detection via interactive visual-semantic graph learning

Wu, Tongtong; Duan, Fuqing; Chang, Liang; Lu, Ke

doi:10.1007/s11432-021-3427-2

Human-object interaction detection via interactive visual-semantic graph learning

Letter
Special Focus on Deep Learning for Computer Vision
Published: 22 April 2022

Volume 65, article number 160108, (2022)
Cite this article

Science China Information Sciences Aims and scope Submit manuscript

Tongtong Wu¹,
Fuqing Duan¹,
Liang Chang¹ &
…
Ke Lu²

133 Accesses
2 Citations
Explore all metrics

Conclusion

In this study, we propose modeling the context visually and semantically by combining a visual graph and a semantic graph and learning a vital context in the HOI problem using a group of graph update-modules, including graph inner update modules and graph cross update modules. We fuse the contextual features from the visual graph and semantic graph with the visual characteristics of the human-object pairs in a network to detect HOIs. We evaluate our proposed model on two challenging datasets, HICO-DET and V-COCO, and demonstrate excellent performance. Our work can provide a reference for modeling contextual information in the HOI problem.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Gao C, Zou Y L, Huang J-B. iCAN: instance-centric attention network for human-object interaction detection. 2018. ArXiv:1808.10437
Wang T, Anwer R M, Khan M H, et al. Deep contextual attention for human-object interaction detection. In: Proceedings of IEEE International Conference on Computer Vision, 2019
Mikolov T, Sutskever I, Chen K, et al. Distributed representations of words and phrases and their compositionality. In: Proceedings of Advances in Neural Information Processing Systems, 2013. 3111–3119
Veličković P, Cucurull G, Casanova A, et al. Graph attention networks. 2017. ArXiv:1710.10903

Download references

Acknowledgements

This work was supported by National Key Research and Development Program of China (Grant No. 2018AAA0100802).

Author information

Authors and Affiliations

College of Artificial Intelligence, Beijing Normal University, Beijing, 100875, China
Tongtong Wu, Fuqing Duan & Liang Chang
School of Computer and Control Engineering, University of Chinese Academy of Sciences, Beijing, 100049, China
Ke Lu

Authors

Tongtong Wu
View author publications
You can also search for this author in PubMed Google Scholar
Fuqing Duan
View author publications
You can also search for this author in PubMed Google Scholar
Liang Chang
View author publications
You can also search for this author in PubMed Google Scholar
Ke Lu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Fuqing Duan.

Additional information

Supporting information

Appendixes A–C. The supporting information is available online at info.scichina.com and link.springer.com. The supporting materials are published as submitted, without typesetting or editing. The responsibility for scientific accuracy and content remains entirely with the authors.

Electronic supplementary material