Abstract:
Unmanned aerial vehicles (UAVs) are widely used in various industries, and various visual tasks under the perspective of the UAV have been widely studied. In particular, ...Show MoreMetadata
Abstract:
Unmanned aerial vehicles (UAVs) are widely used in various industries, and various visual tasks under the perspective of the UAV have been widely studied. In particular, the red-green-blue-T (RGB-T) detection method based on UAVs has shown significant advantages. However, the existing RGB-T methods are designed based on registration image pairs rather than detecting images directly acquired by UAVs. This detection process is limited by the accuracy of image registration, and image registration wastes a lot of time. To solve the above problems, we construct an unregistered RGB-T image salient object detection (SOD) dataset under the UAV perspective, known as UAV RGB-T 2400. The dataset includes many challenging scenes, and the images are not manually registered. Furthermore, we construct a modality registration and object search (MROS) framework for unregistered RGB-T SOD. First, a modality registration scheme is proposed to solve the unregistration problem of modal features. We successively perform pixel-level registration from a local perspective and semantic-level registration from a global perspective for different modal features, and we carry out the channel and spatial interaction for the different modal features in modality registration. Aiming at the interference problem in the UAV detection environment, we propose an object search scheme. The two high-level features are used to search the object location, and the three low-level features are used to refine the object and produce prediction results. Experimental results on the UAV RGB-T 2400 dataset show that MROS is effective compared with the state-of-the-art methods. The code is available at: https://github.com/VDT-2048/UAV-RGB-T-2400.
Published in: IEEE Transactions on Geoscience and Remote Sensing ( Volume: 61)