research-article

Remote Sensing Image Object Detection Algorithm Combining Attention

Authors:

Shangde XinAuthors Info & Claims

ICCAI '23: Proceedings of the 2023 9th International Conference on Computing and Artificial Intelligence

Pages 1 - 7

https://doi.org/10.1145/3594315.3594316

Published: 02 August 2023 Publication History

Abstract

To address the issue of the complex background being difficult to differentiate and the object percentage, rotation angle, and position affecting the identification accuracy, a better object detection algorithm for remote sensing images is given. This algorithm uses the improved frequency channel attention to make the network take more notice of the foreground information, in order to achieve the effect of suppressing the complex background information. It also introduces a multiple attention mechanism in the baseline, which is conducive to alleviating the issue of increasing the detection difficulty due to the different proportion of objects in remote sensing images, arbitrary rotation angle, and position. Experiments on the DIOR, NWPUVHR-10, and RSOD datasets, respectively, are conducted to confirm the efficacy of the proposed algorithm. The suggested method's average accuracy on the DIOR dataset is 1.46% higher than that of the single-stage ATSS algorithm, and it has also produced results that are competitive on the NWPUVHR-10 and RSOD datasets.

References

[1]

Huang, S., Wang, Y., & Su, P. (2016). "A New Synthetical Method of Feature Enhancement and Detection for SAR Image Targets". Journal of Image and Graphics, 4(2), 73-77.

[2]

Khan, R., Raisa, T. F., & Debnath, R. (2018). "An efficient contour based fine-grained algorithm for multi category object detection". Journal of Image and Graphics, 6(2), 127-136.

[3]

Utaminingrum, F., & Prasetya, R. P. (2020). Rizdania," Combining Multiple Feature for Robust Traffic Sign Detection". Journal of Image and Graphics, 8(2), 53-58.

[4]

Spiess, F., Reinhart, L., Strobel, N., Kaiser, D., Kounev, S., & Kaupp, T. (2021). "People detection with depth silhouettes and convolutional neural networks on a mobile robot". Journal of Image and Graphics, 9(4), 135-139.

[5]

Hasegawa, R., Iwamoto, Y., & Chen, Y. W. (2020). "Robust Japanese road sign detection and recognition in complex scenes using convolutional neural networks". Journal of Image and Graphics, 8(3), 59-66.

[6]

Liu, Y., Geng, L., Zhang, W., Gong, Y., & Xu, Z. (2021). "Survey of video based small target detection". Journal of Image and Graphics, 9(4), 122-134.

[7]

Lo, E. (2019). "Target Detection Algorithms in Hyperspectral Imaging Based on Discriminant Analysis". Journal of Image and Graphics, 7(4), 140-144.

[8]

Zhang, S., C. Chi, Y. Yao, Z. Lei and S. Z. Li (2020). "Bridging the Gap Between Anchor-based and Anchor-free Detection via Adaptive Training Sample Selection." Proceedings of the IEEE/CVF conference on computer vision and pattern recognition: 9759-9768.

[9]

Lin, T. Y., Goyal, P., Girshick, R., He, K., & P Dollár. (2017). "Focal loss for dense object detection ". IEEE Transactions on Pattern Analysis & Machine Intelligence, PP(99), 2999-3007.

[10]

Tian, Z., Shen, C., Chen, H., & He, T. (2019). "Fcos: Fully convolutional one-stage object detection ". In Proceedings of the IEEE/CVF international conference on computer vision (pp. 9627-9636).

[11]

Ahmed, N., T. Natarajan and K. R. Rao (1974). "Discrete cosine transform." IEEE transactions on Computers 100(1): 90-93.

Digital Library

[12]

Hu, J., L. Shen and G. Sun (2018). "Squeeze-and-excitation networks." Proceedings of the IEEE conference on computer vision and pattern recognition: 7132-7141.

[13]

Woo, S., J. Park, J.-Y. Lee and I. S. Kweon (2018). "Cbam: Convolutional block attention module." Proceedings of the European conference on computer vision (ECCV): 3-19.

[14]

Qin, Z., P. Zhang, F. Wu and X. Li (2021). "Fcanet: Frequency channel attention networks." Proceedings of the IEEE/CVF international conference on computer vision: 783-792.

[15]

Dai, X., Y. Chen, B. Xiao, D. Chen, M. Liu, L. Yuan and L. Zhang (2021). "Dynamic head: Unifying object detection heads with attentions." Proceedings of the IEEE/CVF conference on computer vision and pattern recognition: 7373-7382.

[16]

Li, K., G. Wan, G. Cheng, L. Meng and J. Han (2020). "Object detection in optical remote sensing images: A survey and a new benchmark." ISPRS Journal of Photogrammetry and Remote Sensing 159: 296-307.

[17]

Cheng, G., P. Zhou and J. Han (2016). "Learning rotation-invariant convolutional neural networks for object detection in VHR optical remote sensing images." IEEE Transactions on Geoscience and Remote Sensing 54(12): 7405-7415.

[18]

Xiao, Z., Q. Liu, G. Tang and X. Zhai (2015). "Elliptic Fourier transformation-based histograms of oriented gradients for rotationally invariant object detection in remote-sensing images." International Journal of Remote Sensing 36(2): 618-644.

Digital Library

[19]

Lin, T.-Y., P. Dollár, R. Girshick, K. He, B. Hariharan and S. Belongie (2017). "Feature pyramid networks for object detection." Proceedings of the IEEE conference on computer vision and pattern recognition: 2117-2125.

[20]

Simonyan, K. and A. Zisserman (2014). "Very deep convolutional networks for large-scale image recognition." arXiv preprint arXiv:1409.1556.

[21]

Ren, S., He, K., Girshick, R., & Sun, J. (2015). Faster r-cnn: Towards real-time object detection with region proposal networks. Advances in neural information processing systems, 28.

[22]

He, K., Gkioxari, G., Dollár, P., & Girshick, R. (2017). Mask r-cnn. In Proceedings of the IEEE international conference on computer vision: 2961-2969.

[23]

Liu, S., L. Qi, H. Qin, J. Shi and J. Jia (2018). "Path aggregation network for instance segmentation." Proceedings of the IEEE conference on computer vision and pattern recognition: 8759-8768.

[24]

Sun, P., R. Zhang, Y. Jiang, T. Kong, C. Xu, W. Zhan, M. Tomizuka, L. Li, Z. Yuan and C. Wang (2021). "Sparse r-cnn: End-to-end object detection with learnable proposals." Proceedings of the IEEE/CVF conference on computer vision and pattern recognition: 14454-14463.

[25]

Liu, W., D. Anguelov, D. Erhan, C. Szegedy, S. Reed, C.-Y. Fu and A. C. Berg (2016). "Ssd: Single shot multibox detector." European conference on computer vision: 21-37.

[26]

Redmon, J., & Farhadi, A. (2018). Yolov3: An incremental improvement. arXiv preprint arXiv:1804.02767.

Cited By

Xu ZZhang CQi JLi XYao BWang L(2024)A dual-difference change detection network for detecting building changes on high-resolution remote sensing imagesGeocarto International10.1080/10106049.2024.232208039:1Online publication date: 13-Mar-2024
https://doi.org/10.1080/10106049.2024.2322080

Index Terms

Remote Sensing Image Object Detection Algorithm Combining Attention
1. Computing methodologies
  1. Machine learning
    1. Machine learning algorithms
      1. Ensemble methods
        Boosting
2. Networks
  1. Network performance evaluation
    1. Network performance analysis

Recommendations

Remote sensing image super-resolution and object detection: Benchmark and state of the art
Highlights
- High spatial resolution dataset for small object detection in Remote Sensing images.
Abstract
For the past two decades, there have been significant efforts to develop methods for object detection in Remote Sensing (RS) images. In most cases, the datasets for small object detection in remote sensing images are inadequate. Many ...
YOLO-RSOD: Improved YOLO Remote Sensing Object Detection
Pattern Recognition
Abstract
Remote sensing object detection has important application value in fields such as environmental monitoring and resource detection and analysis. However, the current universal object detectors are not very effective in detecting remote sensing ...
Adversarial Attacks Against Object Detection in Remote Sensing Images
Artificial Intelligence Security and Privacy
Abstract
With the continuous development of artificial intelligence technology and the increasing richness of remote sensing data, deep convolutional neural networks(DNNs) have been widely used in the field of remote sensing images. Object detection in ... $_{}$

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

ICCAI '23: Proceedings of the 2023 9th International Conference on Computing and Artificial Intelligence

March 2023

824 pages

ISBN:9781450399029

DOI:10.1145/3594315

Copyright © 2023 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 02 August 2023

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Qualifiers

Research-article
Research
Refereed limited

Conference

ICCAI 2023

ICCAI 2023: 2023 9th International Conference on Computing and Artificial Intelligence

March 17 - 20, 2023

Tianjin, China

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

1
Total Citations
View Citations
78
Total Downloads

Downloads (Last 12 months)44
Downloads (Last 6 weeks)4

Reflects downloads up to 23 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Xu ZZhang CQi JLi XYao BWang L(2024)A dual-difference change detection network for detecting building changes on high-resolution remote sensing imagesGeocarto International10.1080/10106049.2024.232208039:1Online publication date: 13-Mar-2024
https://doi.org/10.1080/10106049.2024.2322080

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Figures

Tables

Media

View Table of Conten