Abstract
Infrared small-dim object detection is a challenging task due to the small size, weak features, lack of prominent structural information, and vulnerability to background interference. During the process of deep learning-based feature extraction, as the number of layers increases, the size of the feature map decreases, resulting in a reduction in resolution for small object features. This reduction negatively affects the network’s ability to capture fine-grained details and compromises the detection efficiency. Besides, the infrared objects can be easily overwhelmed by strong background interference, which further diminishes the original faint representations. To solve these issues, we proposes a high-resolution feature representation driven network for infrared small-dim object detection (HRFRD-Net). This network comprises three key components: High-Resolution Feature Representation Branch (HRFR), Infrared Small-Dim Object Detection Branch (ISDOD), and Spatial-Frequency Interaction Feature Enhancement Module (SFIFE). The HRFR branch employs implicit neural representation to super-resolve the infrared small objects in a self-supervised learning scheme. To effectively detect the small-scale objects, ISDOD leverages the shared encoder from HRFR to construct high-resolution and high-quality representation of infrared small objects in a resolution-free manner. To address the issue of dim objects, SFIFE incorporates a global-local mixed receptive field via the features interaction in spatial-frequency dual domains, which significantly improves the accuracy of infrared dim object detection. Experiments conducted on the MSISTD and MDvsFA datasets demonstrate the effectiveness of our approach, especially in complex scenarios where the objects are heavily obscured by the background and background interference closely resembles the objects.
Y. Dong and Y. Wang—Contribute equally to this work.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Bae, T.W., Zhang, F., Kweon, I.S.: Edge directional 2d lms filter for infrared small target detection. Infrared Phys. Technol. 55(1), 137–145 (2012)
Chen, Y., Liu, S., Wang, X.: Learning continuous image representation with local implicit image function. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 8628–8638 (2021)
Chi, L., Jiang, B., Mu, Y.: Fast fourier convolution. Adv. Neural. Inf. Process. Syst. 33, 4479–4488 (2020)
Dai, Y., Wu, Y., Zhou, F., Barnard, K.: Asymmetric contextual modulation for infrared small target detection. In: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pp. 950–959 (2021)
Dai, Y., Wu, Y., Zhou, F., Barnard, K.: Attentional local contrast networks for infrared small target detection. IEEE Trans. Geosci. Remote Sens. 59(11), 9813–9824 (2021)
Han, J., Liang, K., Zhou, B., Zhu, X., Zhao, J., Zhao, L.: Infrared small target detection utilizing the multiscale relative local contrast measure. IEEE Geosci. Remote Sens. Lett. 15(4), 612–616 (2018)
Huang, S., Liu, Y., He, Y., Zhang, T., Peng, Z.: Structure-adaptive clutter suppression for infrared small target detection: chain-growth filtering. Remote Sens. 12(1), 47 (2019)
Jiang, C., Sud, A., Makadia, A., Huang, J., Nießner, M., Funkhouser, T., et al.: Local implicit grid representations for 3d scenes. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 6001–6010 (2020)
Li, B., et al.: Dense nested attention network for infrared small target detection. IEEE Trans. Image Process. (2022)
Moradi, S., Moallem, P., Sabahi, M.F.: Fast and robust small infrared target detection using absolute directional mean difference algorithm. Sig. Process. 177, 107727 (2020)
Park, J.J., Florence, P., Straub, J., Newcombe, R., Lovegrove, S.: Deepsdf: learning continuous signed distance functions for shape representation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 165–174 (2019)
Sinha, A.K., Moorthi, S.M., Dhar, D.: Nl-ffc: non-local fast fourier convolution for image super resolution. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 467–476 (2022)
Sitzmann, V., Martel, J., Bergman, A., Lindell, D., Wetzstein, G.: Implicit neural representations with periodic activation functions. Adv. Neural. Inf. Process. Syst. 33, 7462–7473 (2020)
Wang, A., Li, W., Huang, Z., Wu, X., Jie, F., Tao, R.: Prior-guided data augmentation for infrared small target detection. IEEE J. Sel. Top. Appl. Earth Observations Remote Sens. 15, 10027–10040 (2022)
Wang, H., Zhou, L., Wang, L.: Miss detection vs. false alarm: adversarial learning for small object segmentation in infrared images. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 8509–8518 (2019)
Wang, X., Peng, Z., Zhang, P., He, Y.: Infrared small target detection via nonnegativity-constrained variational mode decomposition. IEEE Geosci. Remote Sens. Lett. 14(10), 1700–1704 (2017)
Wei, Y., You, X., Li, H.: Multiscale patch-based contrast measure for small infrared target detection. Pattern Recogn. 58, 216–226 (2016)
Wu, X., Hong, D., Chanussot, J.: Uiu-net: U-net in u-net for infrared small object detection. IEEE Trans. Image Process. 32, 364–376 (2022)
Xu, J., Li, Z., Du, B., Zhang, M., Liu, J.: Reluplex made more practical: Leaky relu. In: 2020 IEEE Symposium on Computers and communications (ISCC), pp. 1–7. IEEE (2020)
Zhang, T., Cao, S., Pu, T., Peng, Z.: Agpcnet: attention-guided pyramid context networks for infrared small target detection. arXiv preprint arXiv:2111.03580 (2021)
Zhu, H., Ni, H., Liu, S., Xu, G., Deng, L.: Tnlrs: target-aware non-local low-rank modeling with saliency filtering regularization for infrared small target detection. IEEE Trans. Image Process. 29, 9546–9558 (2020)
Zhu, X., Lyu, S., Wang, X., Zhao, Q.: Tph-yolov5: Improved yolov5 based on transformer prediction head for object detection on drone-captured scenarios. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 2778–2788 (2021)
Acknowledgements
The work was supported in part by the National Natural Science Foundation of China under Grant 82172033, U19B2031, 61971369, 52105126, 82272071, 62271430, and the Fundamental Research Funds for the Central Universities 20720230104.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
1 Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Dong, Y., Wang, Y., Fan, L., Ding, X., Huang, Y. (2024). High-Resolution Feature Representation Driven Infrared Small-Dim Object Detection. In: Liu, Q., et al. Pattern Recognition and Computer Vision. PRCV 2023. Lecture Notes in Computer Science, vol 14436. Springer, Singapore. https://doi.org/10.1007/978-981-99-8555-5_25
Download citation
DOI: https://doi.org/10.1007/978-981-99-8555-5_25
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-8554-8
Online ISBN: 978-981-99-8555-5
eBook Packages: Computer ScienceComputer Science (R0)