skip to main content
10.1145/3561613.3561621acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicccvConference Proceedingsconference-collections
research-article

Fine-tuning Faster-RCNN tailored to Feature Reweighting for Few-shot Object Detection

Authors Info & Claims
Published:09 November 2022Publication History

ABSTRACT

Few-shot object detection has drawn more attention in computer vision now. One acknowledged task setting is that train the network to detect both of the base classes with abundant images and novel classes with only a few. Under this scenario, two classic pipelines of few-shot object detection are developed. One is fine-tuning, which trains the detection network on images of base classes and fine-tunes the last layer on images of both base and novel classes. The other one is meta learning, in which one pioneering model utilizes a meta-learner to transform supporting images into reweighting vectors, which are used to reweight features of the query images obtained through the feature extractor. A typical meta learning method splits the training process into two phases: meta-training and meta-testing. Firstly in the meta-training phase, the model is trained on base classes, then on both base and novel classes. In this paper, we synthesize these two pipelines together. For the network structure, we tailor Faster-RCNN to the reweighting module; for training, we follow the meta-training procedure and fine-tune the reweighting module and only the last layer of Faster-RCNN during meta-testing. Experiments on NWPU VHR-10 images show that our method improves the mAP by about 10 ∼ 20 percentages than both of the reweighting and fine-tuning methods.

References

  1. Rashid Ali, Ran Liu, Yongping He, Anand Nayyar, and Basit Qureshi. 2021. Systematic review of dynamic multi-object identification and localization: Techniques and technologies. IEEE Access (2021).Google ScholarGoogle Scholar
  2. Jafar Alzubi, Anand Nayyar, and Akshi Kumar. 2018. Machine learning from theory to algorithms: an overview. In Journal of physics: conference series, Vol. 1142. IOP Publishing, 012012.Google ScholarGoogle Scholar
  3. Gong Cheng, Junwei Han, Peicheng Zhou, and Lei Guo. 2014. Multi-class geospatial object detection and geographic image classification based on collection of part detectors. ISPRS Journal of Photogrammetry and Remote Sensing 98 (2014), 119–132.Google ScholarGoogle ScholarCross RefCross Ref
  4. Chelsea Finn, Pieter Abbeel, and Sergey Levine. 2017. Model-agnostic Meta-learning for Fast Adaptation of Deep Networks. In Proceedings of the 34th International Conference on Machine Learning - Volume 70(ICML’17). JMLR.org, Sydney, NSW, Australia, 1126–1135.Google ScholarGoogle Scholar
  5. Kun Fu, Tengfei Zhang, Yue Zhang, Menglong Yan, Zhonghan Chang, Zhengyuan Zhang, and Xian Sun. 2019. Meta-SSD: Towards fast adaptation for few-shot object detection with meta-learning. IEEE Access 7(2019), 77597–77606.Google ScholarGoogle ScholarCross RefCross Ref
  6. Ryo Hasegawa, Yutaro Iwamoto, and Yen-Wei Chen. 2020. Robust Japanese road sign detection and recognition in complex scenes using convolutional neural networks. Journal of Image and Graphics 8, 3 (2020), 59–66.Google ScholarGoogle ScholarCross RefCross Ref
  7. Timothy Hospedales, Antreas Antoniou, Paul Micaelli, and Amos Storkey. 2020. Meta-Learning in Neural Networks: A Survey. (April 2020). arxiv:2004.05439 [cs.LG]Google ScholarGoogle Scholar
  8. Bingyi Kang, Zhuang Liu, Xin Wang, Fisher Yu, Jiashi Feng, and Trevor Darrell. 2019. Few-shot object detection via feature reweighting. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 8420–8429.Google ScholarGoogle ScholarCross RefCross Ref
  9. Wei Liu, Dragomir Anguelov, Dumitru Erhan, Christian Szegedy, Scott Reed, Cheng-Yang Fu, and Alexander C Berg. 2016. Ssd: Single shot multibox detector. In European conference on computer vision. Springer, 21–37.Google ScholarGoogle ScholarCross RefCross Ref
  10. Ying Liu, Luyao Geng, Weidong Zhang, Yanchao Gong, and Zhijie Xu. 2021. Survey of Video Based Small Target Detection. Journal of Image and Graphics 9, 4 (2021).Google ScholarGoogle Scholar
  11. Florian Spiess, Lucas Reinhart, Norbert Strobel, Dennis Kaiser, Samuel Kounev, and Tobias Kaupp. 2021. People Detection with Depth Silhouettes and Convolutional Neural Networks on a Mobile Robot. Journal of Image and Graphics 9, 4 (2021).Google ScholarGoogle Scholar
  12. Xin Wang, Thomas E Huang, Trevor Darrell, Joseph E Gonzalez, and Fisher Yu. 2020. Frustratingly simple few-shot object detection. arXiv preprint arXiv:2003.06957(2020).Google ScholarGoogle Scholar
  13. Yaqing Wang, Quanming Yao, James T Kwok, and Lionel M Ni. 2020. Generalizing from a few examples: A survey on few-shot learning. ACM Computing Surveys (CSUR) 53, 3 (2020), 1–34.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Yu-Xiong Wang, Deva Ramanan, and Martial Hebert. 2019. Meta-learning to detect rare objects. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 9925–9934.Google ScholarGoogle ScholarCross RefCross Ref
  15. Xiongwei Wu, Doyen Sahoo, and Steven Hoi. 2020. Meta-rcnn: Meta learning for few-shot object detection. In Proceedings of the 28th ACM International Conference on Multimedia. 1679–1687.Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Xiaopeng Yan, Ziliang Chen, Anni Xu, Xiaoxi Wang, Xiaodan Liang, and Liang Lin. 2019. Meta r-cnn: Towards general solver for instance-level low-shot learning. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 9577–9586.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Fine-tuning Faster-RCNN tailored to Feature Reweighting for Few-shot Object Detection

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Other conferences
        ICCCV '22: Proceedings of the 5th International Conference on Control and Computer Vision
        August 2022
        241 pages
        ISBN:9781450397315
        DOI:10.1145/3561613

        Copyright © 2022 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 9 November 2022

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article
        • Research
        • Refereed limited
      • Article Metrics

        • Downloads (Last 12 months)20
        • Downloads (Last 6 weeks)1

        Other Metrics

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      HTML Format

      View this article in HTML Format .

      View HTML Format