Loading web-font TeX/Math/Italic
Complementary Parts Contrastive Learning for Fine-Grained Weakly Supervised Object Co-Localization | IEEE Journals & Magazine | IEEE Xplore

Complementary Parts Contrastive Learning for Fine-Grained Weakly Supervised Object Co-Localization


Abstract:

The aim of weakly supervised object co-localization is to locate different objects of the same superclass in a dataset. Recent methods achieve impressive co-localization ...Show More

Abstract:

The aim of weakly supervised object co-localization is to locate different objects of the same superclass in a dataset. Recent methods achieve impressive co-localization performance by multiple instance learning and self-supervised learning. However, these methods ignore the common part information shared by fine-grained objects and the influence of the complementary parts on the co-localization of the fine-grained objects. To solve these issues, we propose a complementary parts contrastive learning method for fine-grained weakly supervised object co-localization. The proposed method follows such an assumption that fine-grained object parts with the same/different semantic meaning should have similar/dissimilar feature representations in the feature space. The proposed method tackles two critical issues in this task: i) how to spread the model’s attention and suppress the complex background noise, and ii) how to leverage the cross-category common parts information to mitigate the context co-occurrence problem. To address i) , we attempt to integrate local and context cues via three types of attention including self-supervised attention, channel, and spatial attention to spread the model’s attention toward automatically identifying and localizing most discriminative parts of objects in the fine-grained images. To solve ii) , we propose a cross-category object complementarity part contrastive learning module to identify the extracted part regions with different semantic information by pulling the same part features closer and pushing different part features away, which can mitigate the confounding bias caused by the co-occurrence surroundings within specific classes. Extensive qualitative and quantitative evaluations demonstrate the effectiveness of the proposed method on four fine-grained co-localization datasets: CUB-200–2011, Stanford Cars, FGVC-Aircraft, and Stanford Dogs. Code and models are available at https://github.com/Zhao-fan/CPCL.
Page(s): 6635 - 6648
Date of Publication: 10 April 2023

ISSN Information:

Funding Agency:


Contact IEEE to Subscribe

References

References is not available for this document.