skip to main content
10.1145/3594315.3594316acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiccaiConference Proceedingsconference-collections
research-article

Remote Sensing Image Object Detection Algorithm Combining Attention

Published: 02 August 2023 Publication History

Abstract

To address the issue of the complex background being difficult to differentiate and the object percentage, rotation angle, and position affecting the identification accuracy, a better object detection algorithm for remote sensing images is given. This algorithm uses the improved frequency channel attention to make the network take more notice of the foreground information, in order to achieve the effect of suppressing the complex background information. It also introduces a multiple attention mechanism in the baseline, which is conducive to alleviating the issue of increasing the detection difficulty due to the different proportion of objects in remote sensing images, arbitrary rotation angle, and position. Experiments on the DIOR, NWPUVHR-10, and RSOD datasets, respectively, are conducted to confirm the efficacy of the proposed algorithm. The suggested method's average accuracy on the DIOR dataset is 1.46% higher than that of the single-stage ATSS algorithm, and it has also produced results that are competitive on the NWPUVHR-10 and RSOD datasets.

References

[1]
Huang, S., Wang, Y., & Su, P. (2016). "A New Synthetical Method of Feature Enhancement and Detection for SAR Image Targets". Journal of Image and Graphics, 4(2), 73-77.
[2]
Khan, R., Raisa, T. F., & Debnath, R. (2018). "An efficient contour based fine-grained algorithm for multi category object detection". Journal of Image and Graphics, 6(2), 127-136.
[3]
Utaminingrum, F., & Prasetya, R. P. (2020). Rizdania," Combining Multiple Feature for Robust Traffic Sign Detection". Journal of Image and Graphics, 8(2), 53-58.
[4]
Spiess, F., Reinhart, L., Strobel, N., Kaiser, D., Kounev, S., & Kaupp, T. (2021). "People detection with depth silhouettes and convolutional neural networks on a mobile robot". Journal of Image and Graphics, 9(4), 135-139.
[5]
Hasegawa, R., Iwamoto, Y., & Chen, Y. W. (2020). "Robust Japanese road sign detection and recognition in complex scenes using convolutional neural networks". Journal of Image and Graphics, 8(3), 59-66.
[6]
Liu, Y., Geng, L., Zhang, W., Gong, Y., & Xu, Z. (2021). "Survey of video based small target detection". Journal of Image and Graphics, 9(4), 122-134.
[7]
Lo, E. (2019). "Target Detection Algorithms in Hyperspectral Imaging Based on Discriminant Analysis". Journal of Image and Graphics, 7(4), 140-144.
[8]
Zhang, S., C. Chi, Y. Yao, Z. Lei and S. Z. Li (2020). "Bridging the Gap Between Anchor-based and Anchor-free Detection via Adaptive Training Sample Selection." Proceedings of the IEEE/CVF conference on computer vision and pattern recognition: 9759-9768.
[9]
Lin, T. Y., Goyal, P., Girshick, R., He, K., & P Dollár. (2017). "Focal loss for dense object detection ". IEEE Transactions on Pattern Analysis & Machine Intelligence, PP(99), 2999-3007.
[10]
Tian, Z., Shen, C., Chen, H., & He, T. (2019). "Fcos: Fully convolutional one-stage object detection ". In Proceedings of the IEEE/CVF international conference on computer vision (pp. 9627-9636).
[11]
Ahmed, N., T. Natarajan and K. R. Rao (1974). "Discrete cosine transform." IEEE transactions on Computers 100(1): 90-93.
[12]
Hu, J., L. Shen and G. Sun (2018). "Squeeze-and-excitation networks." Proceedings of the IEEE conference on computer vision and pattern recognition: 7132-7141.
[13]
Woo, S., J. Park, J.-Y. Lee and I. S. Kweon (2018). "Cbam: Convolutional block attention module." Proceedings of the European conference on computer vision (ECCV): 3-19.
[14]
Qin, Z., P. Zhang, F. Wu and X. Li (2021). "Fcanet: Frequency channel attention networks." Proceedings of the IEEE/CVF international conference on computer vision: 783-792.
[15]
Dai, X., Y. Chen, B. Xiao, D. Chen, M. Liu, L. Yuan and L. Zhang (2021). "Dynamic head: Unifying object detection heads with attentions." Proceedings of the IEEE/CVF conference on computer vision and pattern recognition: 7373-7382.
[16]
Li, K., G. Wan, G. Cheng, L. Meng and J. Han (2020). "Object detection in optical remote sensing images: A survey and a new benchmark." ISPRS Journal of Photogrammetry and Remote Sensing 159: 296-307.
[17]
Cheng, G., P. Zhou and J. Han (2016). "Learning rotation-invariant convolutional neural networks for object detection in VHR optical remote sensing images." IEEE Transactions on Geoscience and Remote Sensing 54(12): 7405-7415.
[18]
Xiao, Z., Q. Liu, G. Tang and X. Zhai (2015). "Elliptic Fourier transformation-based histograms of oriented gradients for rotationally invariant object detection in remote-sensing images." International Journal of Remote Sensing 36(2): 618-644.
[19]
Lin, T.-Y., P. Dollár, R. Girshick, K. He, B. Hariharan and S. Belongie (2017). "Feature pyramid networks for object detection." Proceedings of the IEEE conference on computer vision and pattern recognition: 2117-2125.
[20]
Simonyan, K. and A. Zisserman (2014). "Very deep convolutional networks for large-scale image recognition." arXiv preprint arXiv:1409.1556.
[21]
Ren, S., He, K., Girshick, R., & Sun, J. (2015). Faster r-cnn: Towards real-time object detection with region proposal networks. Advances in neural information processing systems, 28.
[22]
He, K., Gkioxari, G., Dollár, P., & Girshick, R. (2017). Mask r-cnn. In Proceedings of the IEEE international conference on computer vision: 2961-2969.
[23]
Liu, S., L. Qi, H. Qin, J. Shi and J. Jia (2018). "Path aggregation network for instance segmentation." Proceedings of the IEEE conference on computer vision and pattern recognition: 8759-8768.
[24]
Sun, P., R. Zhang, Y. Jiang, T. Kong, C. Xu, W. Zhan, M. Tomizuka, L. Li, Z. Yuan and C. Wang (2021). "Sparse r-cnn: End-to-end object detection with learnable proposals." Proceedings of the IEEE/CVF conference on computer vision and pattern recognition: 14454-14463.
[25]
Liu, W., D. Anguelov, D. Erhan, C. Szegedy, S. Reed, C.-Y. Fu and A. C. Berg (2016). "Ssd: Single shot multibox detector." European conference on computer vision: 21-37.
[26]
Redmon, J., & Farhadi, A. (2018). Yolov3: An incremental improvement. arXiv preprint arXiv:1804.02767.

Cited By

View all
  • (2024)A dual-difference change detection network for detecting building changes on high-resolution remote sensing imagesGeocarto International10.1080/10106049.2024.232208039:1Online publication date: 13-Mar-2024

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICCAI '23: Proceedings of the 2023 9th International Conference on Computing and Artificial Intelligence
March 2023
824 pages
ISBN:9781450399029
DOI:10.1145/3594315
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 02 August 2023

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

ICCAI 2023

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)44
  • Downloads (Last 6 weeks)4
Reflects downloads up to 22 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2024)A dual-difference change detection network for detecting building changes on high-resolution remote sensing imagesGeocarto International10.1080/10106049.2024.232208039:1Online publication date: 13-Mar-2024

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media