High-Resolution Feature Representation Driven Infrared Small-Dim Object Detection

Dong, Yuhang; Wang, Yingying; Fan, Linyu; Ding, Xinghao; Huang, Yue

doi:10.1007/978-981-99-8555-5_25

Yuhang Dong¹⁵,
Yingying Wang¹⁶,
Linyu Fan¹⁵,
Xinghao Ding^15,16 &
…
Yue Huang^15,16

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14436))

Included in the following conference series:

Chinese Conference on Pattern Recognition and Computer Vision (PRCV)

357 Accesses

Abstract

Infrared small-dim object detection is a challenging task due to the small size, weak features, lack of prominent structural information, and vulnerability to background interference. During the process of deep learning-based feature extraction, as the number of layers increases, the size of the feature map decreases, resulting in a reduction in resolution for small object features. This reduction negatively affects the network’s ability to capture fine-grained details and compromises the detection efficiency. Besides, the infrared objects can be easily overwhelmed by strong background interference, which further diminishes the original faint representations. To solve these issues, we proposes a high-resolution feature representation driven network for infrared small-dim object detection (HRFRD-Net). This network comprises three key components: High-Resolution Feature Representation Branch (HRFR), Infrared Small-Dim Object Detection Branch (ISDOD), and Spatial-Frequency Interaction Feature Enhancement Module (SFIFE). The HRFR branch employs implicit neural representation to super-resolve the infrared small objects in a self-supervised learning scheme. To effectively detect the small-scale objects, ISDOD leverages the shared encoder from HRFR to construct high-resolution and high-quality representation of infrared small objects in a resolution-free manner. To address the issue of dim objects, SFIFE incorporates a global-local mixed receptive field via the features interaction in spatial-frequency dual domains, which significantly improves the accuracy of infrared dim object detection. Experiments conducted on the MSISTD and MDvsFA datasets demonstrate the effectiveness of our approach, especially in complex scenarios where the objects are heavily obscured by the background and background interference closely resembles the objects.

Y. Dong and Y. Wang—Contribute equally to this work.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 59.99; Price excludes VAT (USA)

Softcover Book: USD 79.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Bae, T.W., Zhang, F., Kweon, I.S.: Edge directional 2d lms filter for infrared small target detection. Infrared Phys. Technol. 55(1), 137–145 (2012)
Article Google Scholar
Chen, Y., Liu, S., Wang, X.: Learning continuous image representation with local implicit image function. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 8628–8638 (2021)
Google Scholar
Chi, L., Jiang, B., Mu, Y.: Fast fourier convolution. Adv. Neural. Inf. Process. Syst. 33, 4479–4488 (2020)
Google Scholar
Dai, Y., Wu, Y., Zhou, F., Barnard, K.: Asymmetric contextual modulation for infrared small target detection. In: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pp. 950–959 (2021)
Google Scholar
Dai, Y., Wu, Y., Zhou, F., Barnard, K.: Attentional local contrast networks for infrared small target detection. IEEE Trans. Geosci. Remote Sens. 59(11), 9813–9824 (2021)
Article Google Scholar
Han, J., Liang, K., Zhou, B., Zhu, X., Zhao, J., Zhao, L.: Infrared small target detection utilizing the multiscale relative local contrast measure. IEEE Geosci. Remote Sens. Lett. 15(4), 612–616 (2018)
Article Google Scholar
Huang, S., Liu, Y., He, Y., Zhang, T., Peng, Z.: Structure-adaptive clutter suppression for infrared small target detection: chain-growth filtering. Remote Sens. 12(1), 47 (2019)
Article Google Scholar
Jiang, C., Sud, A., Makadia, A., Huang, J., Nießner, M., Funkhouser, T., et al.: Local implicit grid representations for 3d scenes. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 6001–6010 (2020)
Google Scholar
Li, B., et al.: Dense nested attention network for infrared small target detection. IEEE Trans. Image Process. (2022)
Google Scholar
Moradi, S., Moallem, P., Sabahi, M.F.: Fast and robust small infrared target detection using absolute directional mean difference algorithm. Sig. Process. 177, 107727 (2020)
Article Google Scholar
Park, J.J., Florence, P., Straub, J., Newcombe, R., Lovegrove, S.: Deepsdf: learning continuous signed distance functions for shape representation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 165–174 (2019)
Google Scholar
Sinha, A.K., Moorthi, S.M., Dhar, D.: Nl-ffc: non-local fast fourier convolution for image super resolution. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 467–476 (2022)
Google Scholar
Sitzmann, V., Martel, J., Bergman, A., Lindell, D., Wetzstein, G.: Implicit neural representations with periodic activation functions. Adv. Neural. Inf. Process. Syst. 33, 7462–7473 (2020)
Google Scholar
Wang, A., Li, W., Huang, Z., Wu, X., Jie, F., Tao, R.: Prior-guided data augmentation for infrared small target detection. IEEE J. Sel. Top. Appl. Earth Observations Remote Sens. 15, 10027–10040 (2022)
Article Google Scholar
Wang, H., Zhou, L., Wang, L.: Miss detection vs. false alarm: adversarial learning for small object segmentation in infrared images. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 8509–8518 (2019)
Google Scholar
Wang, X., Peng, Z., Zhang, P., He, Y.: Infrared small target detection via nonnegativity-constrained variational mode decomposition. IEEE Geosci. Remote Sens. Lett. 14(10), 1700–1704 (2017)
Article Google Scholar
Wei, Y., You, X., Li, H.: Multiscale patch-based contrast measure for small infrared target detection. Pattern Recogn. 58, 216–226 (2016)
Article Google Scholar
Wu, X., Hong, D., Chanussot, J.: Uiu-net: U-net in u-net for infrared small object detection. IEEE Trans. Image Process. 32, 364–376 (2022)
Article Google Scholar
Xu, J., Li, Z., Du, B., Zhang, M., Liu, J.: Reluplex made more practical: Leaky relu. In: 2020 IEEE Symposium on Computers and communications (ISCC), pp. 1–7. IEEE (2020)
Google Scholar
Zhang, T., Cao, S., Pu, T., Peng, Z.: Agpcnet: attention-guided pyramid context networks for infrared small target detection. arXiv preprint arXiv:2111.03580 (2021)
Zhu, H., Ni, H., Liu, S., Xu, G., Deng, L.: Tnlrs: target-aware non-local low-rank modeling with saliency filtering regularization for infrared small target detection. IEEE Trans. Image Process. 29, 9546–9558 (2020)
Article MathSciNet Google Scholar
Zhu, X., Lyu, S., Wang, X., Zhao, Q.: Tph-yolov5: Improved yolov5 based on transformer prediction head for object detection on drone-captured scenarios. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 2778–2788 (2021)
Google Scholar

Download references

Acknowledgements

The work was supported in part by the National Natural Science Foundation of China under Grant 82172033, U19B2031, 61971369, 52105126, 82272071, 62271430, and the Fundamental Research Funds for the Central Universities 20720230104.

Author information

Authors and Affiliations

School of Informatics, Xiamen University, Xiamen, 361001, China
Yuhang Dong, Linyu Fan, Xinghao Ding & Yue Huang
Institute of Artificial Intelligence, Xiamen University, Xiamen, 361001, China
Yingying Wang, Xinghao Ding & Yue Huang

Authors

Yuhang Dong
View author publications
You can also search for this author in PubMed Google Scholar
Yingying Wang
View author publications
You can also search for this author in PubMed Google Scholar
Linyu Fan
View author publications
You can also search for this author in PubMed Google Scholar
Xinghao Ding
View author publications
You can also search for this author in PubMed Google Scholar
Yue Huang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xinghao Ding .

Editor information

Editors and Affiliations

Nanjing University of Information Science and Technology, Nanjing, China
Qingshan Liu
Xiamen University, Xiamen, China
Hanzi Wang
Beijing University of Posts and Telecommunications, Beijing, China
Zhanyu Ma
Sun Yat-sen University, Guangzhou, China
Weishi Zheng
Peking University, Beijing, China
Hongbin Zha
Chinese Academy of Sciences, Beijing, China
Xilin Chen
Chinese Academy of Sciences, Beijing, China
Liang Wang
Xiamen University, Xiamen, China
Rongrong Ji

1 Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (pdf 148 KB)

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Dong, Y., Wang, Y., Fan, L., Ding, X., Huang, Y. (2024). High-Resolution Feature Representation Driven Infrared Small-Dim Object Detection. In: Liu, Q., et al. Pattern Recognition and Computer Vision. PRCV 2023. Lecture Notes in Computer Science, vol 14436. Springer, Singapore. https://doi.org/10.1007/978-981-99-8555-5_25

Download citation

DOI: https://doi.org/10.1007/978-981-99-8555-5_25
Published: 28 December 2023
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-8554-8
Online ISBN: 978-981-99-8555-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics