research-article

RKformer: Runge-Kutta Transformer with Random-Connection Attention for Infrared Small Target Detection

Authors:

Jie Guo,

Xinbo GaoAuthors Info & Claims

MM '22: Proceedings of the 30th ACM International Conference on Multimedia

Pages 1730 - 1738

https://doi.org/10.1145/3503161.3547817

Published: 10 October 2022 Publication History

Get Access

Abstract

Infrared small target detection (IRSTD) refers to segmenting the small targets from infrared images, which is of great significance in practical applications. However, due to the small scale of targets as well as noise and clutter in the background, current deep neural network-based methods struggle in extracting features with discriminative semantics while preserving fine details. In this paper, we address this problem by proposing a novel RKformer model with an encoder-decoder structure, where four specifically designed Runge-Kutta transformer (RKT) blocks are stacked sequentially in the encoder. Technically, it has three key designs. First, we adopt a parallel encoder block (PEB) of the transformer and convolution to take their advantages in long-range dependency modeling and locality modeling for extracting semantics and preserving details. Second, we propose a novel random-connection attention (RCA) block, which has a reservoir structure to learn sparse attention via random connections during training. RCA encourages the target to attend to sparse relevant positions instead of all the large-area background pixels, resulting in more informative attention scores. It has fewer parameters and computations than the original self-attention in the transformer while performing better. Third, inspired by neural ordinary differential equations (ODE), we stack two PEBs with several residual connections as the basic encoder block to implement the Runge-Kutta method for solving ODE, which can effectively enhance the feature and suppress noise. Experiments on the public NUAA-SIRST dataset and IRSTD-1k dataset demonstrate the superiority of the RKformer over state-of-the-art methods.

Supplementary Material

MP4 File (MM22-fp00321.mp4)

Here is a video description of our work "RKformer: Runge-Kutta Transformer with Random-Connection Attention for Infrared Small Target Detection". Infrared small target detection is useful in many practical applications. However, due to the characteristics of infrared image, current deep learning-based methods cannot preserve fine details while extracting features with discriminative semantics. To address this challenge, we propose a novel RKformer. Specifically, we adopt a parallel encoder block (PEB) of the transformer and convolution to take their advantages in long-range dependency modeling and locality modeling. Two PEBs are stacked with several residual connections under the guidance of RK methods for solving ODE as the basic encoder block, which enhance the feature and suppress noise. We propose a random-connection attention (RCA) block, which learns sparse attention via random connections during training. Extensive experiments have verified the effectiveness of RKformer.

Download
12.01 MB

References

[1]

Xiangzhi Bai and Fugen Zhou. 2010. Analysis of new top-hat transformation and the application for infrared dim small target detection. Pattern Recognition 43, 6 (2010), 2145--2156.

Abstract

Supplementary Material

References

Cited By

Index Terms

Recommendations

Using Runge-Kutta method for numerical solution of the system of Volterra integral equation

High-order Runge–Kutta structure-preserving methods for the coupled nonlinear Schrödinger–KdV equations

Energy-conserving Runge-Kutta methods for the incompressible Navier-Stokes equations

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Funding Sources

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Share

Share this Publication link

Share on social media

Affiliations