research-article

Hierarchical Few-Shot Object Detection: Problem, Benchmark and Method

Authors:

Shuigeng ZhouAuthors Info & Claims

MM '22: Proceedings of the 30th ACM International Conference on Multimedia

Pages 2002 - 2011

https://doi.org/10.1145/3503161.3548412

Published: 10 October 2022 Publication History

Get Access

Abstract

Few-shot object detection (FSOD) is to detect objects with a few examples. However, existing FSOD methods do not consider hierarchical fine-grained category structures of objects that exist widely in real life. For example, animals are taxonomically classified into orders, families, genera and species etc. In this paper, we propose and solve a new problem called hierarchical few-shot object detection (Hi-FSOD), which aims to detect objects with hierarchical categories in the FSOD paradigm. To this end, on the one hand, we build the first large-scale and high-quality Hi-FSOD benchmark dataset HiFSOD-Bird, which contains 176,350 wild-bird images falling to 1,432 categories. All the categories are organized into a 4-level taxonomy, consisting of 32 orders, 132 families, 572 genera and 1,432 species. On the other hand, we propose the first Hi-FSOD method HiCLPL, where a hierarchical contrastive learning approach is developed to constrain the feature space so that the feature distribution of objects is consistent with the hierarchical taxonomy and the model's generalization power is strengthened. Meanwhile, a probabilistic loss is designed to enable the child nodes to correct the classification errors of their parent nodes in the taxonomy. Extensive experiments on the benchmark dataset HiFSOD-Bird show that our method HiCLPL outperforms the existing FSOD methods.

Supplementary Material

MP4 File (MM22-fp3100.mp4)

Existing FSOD methods do not consider hierarchical fine-grained category structures of objects that exist widely in real life. For example, animals are taxonomically classified into orders, families, genera and species etc. In this paper, we propose and solve a new problem called hierarchical few-shot object detection (Hi-FSOD), which aims to detect objects with hierarchical categories in the FSOD paradigm.

Download
16.67 MB

References

[1]

Björn Barz and Joachim Denzler. 2019. Hierarchy-Based Image Embeddings for Semantic Image Retrieval. In 2019 IEEE Winter Conference on Applications of Computer Vision. 638--647. https://doi.org/10.1109/WACV.2019.00073

Abstract

Supplementary Material

References

Cited By

Index Terms

Recommendations

FSODv2: A Deep Calibrated Few-Shot Object Detection Network

Self-supervised Prototype Conditional Few-Shot Object Detection

A broader study of cross-domain few-shot object detection

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Funding Sources

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Share

Share this Publication link

Share on social media

Affiliations