Abstract:
Object detection (OD) for remote sensing images (RSIs) is an important research topic in remote sensing data analysis. Many efforts have been devoted to remote sensing OD...Show MoreMetadata
Abstract:
Object detection (OD) for remote sensing images (RSIs) is an important research topic in remote sensing data analysis. Many efforts have been devoted to remote sensing OD (RSOD) tasks, most of which try to use attention mechanisms to improve the performance of detectors. However, the difference in information between the global features and local features of feature maps is ignored. In this article, we design a novel global and local enhanced attention mechanism (GLE-AM) to capture this different information. Then, we propose a global and local feature-enhanced network (GLE-Net) to fully utilize the features extracted by the GLE-AM. Furthermore, to make the path aggregation feature pyramid network (PAFPN) more suitable for extracting fused features and small OD tasks, we improve the cross-stage partial layer (CSP-Layer) and the spatial pyramid pooling (SPP), respectively. Experiments conducted on two publicly available remote sensing datasets demonstrate the effectiveness of our proposed methods. On the DIOR dataset, GLFE-YOLOX improved on the mAP metric by 3.19% compared to the YOLOX-m baseline, and on the NWPU VHR-10 dataset GLFE-YOLOX reaches 90.93% on mAP, which is 3.18% higher than YOLOX-l, comparing with the comparison algorithms, our proposed GLFE-YOLOX achieves the best results in mAP metric for both datasets.
Published in: IEEE Transactions on Instrumentation and Measurement ( Volume: 73)