skip to main content
10.1145/3647649.3647678acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicigpConference Proceedingsconference-collections
research-article

Improving the Long-tailed Remote Sensing Target Detection via Target-level Comparision

Authors Info & Claims
Published:03 May 2024Publication History

ABSTRACT

With the continuous development of earth observation technology, high-resolution satellite images with more detailed ground object information have become increasingly prevalent. As a result, multi-classification target detection has become one of the hot spots in this field. However, in reality, remote sensing images face the problem of limited samples under a long-tailed distribution, which hinders the model's ability to extract discriminative features for the target. To address this issue, we designed the TCLM module based on contrastive learning, aiming to address the difficulty of feature extraction for tail class targets. Meanwhile, there exists another issue: complex background noises also interfere with target feature extraction. To overcome this problem of background noise interference in complex remote sensing images, we propose the PARM module, which can help the model to focus on the target itself, thereby decreasing its attention on the background. Experimental results on the MSAR and DOTA datasets show that our method significantly improves detection performance.

References

  1. F. Liu, R. Chen, J. Zhang, K. Xing, H. Liu, and J. Qin, “R2yolox: A lightweight refined anchor-free rotated detector for object detection in aerial images,” IEEE Transactions on Geoscience and Remote Sensing, vol. 60, pp. 1–15, 2022.Google ScholarGoogle Scholar
  2. P. Shamsolmoali, M. Zareapoor, J. Chanussot, H. Zhou, and J. Yang, “Rotation equivariant feature image pyramid network for object de- tection in optical remote sensing imagery,” IEEE Transactions on Geoscience and Remote Sensing, vol. 60, pp. 1–14, 2022.Google ScholarGoogle ScholarCross RefCross Ref
  3. Z. Ren, B. Hou, Q. Wu, Z. Wen, and L. Jiao, “A distribution and structure match generative adversarial network for sar image classification,” IEEE Transactions on Geoscience and Remote Sensing, vol. 58, no. 6, pp. 3864–3880, 2020.Google ScholarGoogle ScholarCross RefCross Ref
  4. J. Geng, X. Deng, X. Ma, and W. Jiang, “Transfer learning for sar image classification via deep joint distribution adaptation networks,” IEEE Transactions on Geoscience and Remote Sensing, vol. 58, no. 8, pp. 5377–5392, 2020.Google ScholarGoogle ScholarCross RefCross Ref
  5. W. Xie, G. Ma, F. Zhao, H. Liu, and L. Zhang, “Polsar image classifica- tion via a novel semi-supervised recurrent complex-valued convolution neural network,” Neurocomputing, vol. 388, pp. 255–268, 2020.Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. J. Tan, C. Wang, B. Li, Q. Li, W. Ouyang, C. Yin, and J. Yan, “Equalization loss for long-tailed object recognition,” in Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2020, pp. 11 662–11 671.Google ScholarGoogle ScholarCross RefCross Ref
  7. B. Li, Y. Yao, J. Tan, G. Zhang, F. Yu, J. Lu, and Y. Luo, “Equalized focal loss for dense long-tailed object detection,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, pp. 6990–6999.Google ScholarGoogle ScholarCross RefCross Ref
  8. Z. H. Jie Chen, B. W. Runfan Xia, L. S. Lei Sheng, and B. Yao., “Large- scale multi-class sar image target detection dataset-1.0[ol],” Journal of Radars, 2022.Google ScholarGoogle Scholar
  9. R. Xia, J. Chen, Z. Huang, H. Wan, B. Wu, L. Sun, B. Yao, H. Xiang, and M. Xing, “Crtranssar: A visual transformer based on contextual joint representation learning for sar ship detection,” Remote Sensing, vol. 14, no. 6, p. 1488, 2022.Google ScholarGoogle ScholarCross RefCross Ref
  10. G.-S. Xia, X. Bai, J. Ding, Z. Zhu, S. Belongie, J. Luo, M. Datcu, M. Pelillo, and L. Zhang, “Dota: A large-scale dataset for object detection in aerial images,” in Proceedings of the IEEE conference on computer vision and pattern recognition, 2018, pp. 3974–3983.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Improving the Long-tailed Remote Sensing Target Detection via Target-level Comparision

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      ICIGP '24: Proceedings of the 2024 7th International Conference on Image and Graphics Processing
      January 2024
      480 pages
      ISBN:9798400716720
      DOI:10.1145/3647649

      Copyright © 2024 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 3 May 2024

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed limited
    • Article Metrics

      • Downloads (Last 12 months)2
      • Downloads (Last 6 weeks)2

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format .

    View HTML Format