Graph attention mechanism with global contextual information for multi-label image recognition

Xiaoxiao Ban; Peihua Li; Qilong Wang; Shoujun Zhou; Shijie Guo; Yuanquan Wang

doi:10.1117/1.JEI.30.6.063031

28 December 2021 Graph attention mechanism with global contextual information for multi-label image recognition

Xiaoxiao Ban, Peihua Li, Qilong Wang, Shoujun Zhou, Shijie Guo, Yuanquan Wang

Author Affiliations +

Journal of Electronic Imaging, Vol. 30, Issue 6, 063031 (December 2021). https://doi.org/10.1117/1.JEI.30.6.063031

Abstract

Recent works have shown that multi-label image recognition is still a challenging task in computer vision due to the complicatedness and diversity of multi-label images. However, the existing works ignore the co-occurrence correlation and global contextual information between image space and objects. We present a model to solve these problems. On the one hand, we devise the graph attention mechanism to compute the hidden representations of different categories in multi-label images. It can specify different weights to different neighbor objects and well model the label dependency. On the other hand, we iterate the global contextual information by the second-order covariance pooling to enhance nonlinear modeling capability and use basic residual network to extract features. The proposed model is thoroughly evaluated on PASCAL VOC 2007 and MS-COCO datasets. Compared with classical ML-GCN, the model can better combine the image features and label embedding. Meanwhile, experiments show that it outperforms the state-of-the-art methods such as residual multi-layer perceptron, EfficientNet, and vision transformer.

Citation Download Citation

Xiaoxiao Ban, Peihua Li, Qilong Wang, Shoujun Zhou, Shijie Guo, and Yuanquan Wang "Graph attention mechanism with global contextual information for multi-label image recognition," Journal of Electronic Imaging 30(6), 063031 (28 December 2021). https://doi.org/10.1117/1.JEI.30.6.063031

Received: 4 August 2021; Accepted: 8 December 2021; Published: 28 December 2021

ACCESS THE FULL ARTICLE

INSTITUTIONAL
Select your institution to access the SPIE Digital Library.

SELECT YOUR INSTITUTION

PERSONAL
Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.

PERSONAL SIGN IN

No SPIE Account? Create one

PURCHASE THIS CONTENT

SUBSCRIBE TO DIGITAL LIBRARY

50 downloads per 1-year subscription

Members: $195

Non-members: $335 ADD TO CART

25 downloads per 1 - year subscription

Members: $145

Non-members: $250 ADD TO CART

PURCHASE SINGLE ARTICLE

Includes PDF, HTML & Video, when available