Infrared Object Detection Algorithm Based on Spatial Feature Enhancement

Guo, Hao; Hou, Zhiqiang; Sun, Ying; Li, Juanjuan; Ma, Sugang

doi:10.1007/978-3-031-18916-6_28

Hao Guo¹⁵,
Zhiqiang Hou¹⁵,
Ying Sun¹⁵,
Juanjuan Li¹⁵ &
…
Sugang Ma¹⁵

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13537))

Included in the following conference series:

Chinese Conference on Pattern Recognition and Computer Vision (PRCV)

1429 Accesses

Abstract

Focusing on the problems of CenterNet in infrared images, such as feature loss and insufficient information utilization, an improved algorithm based on spatial feature enhancement is proposed. Firstly, a frequency-space enhancement module is used to enhance the details of the target region. Secondly, a module that can count global information is introduced into the backbone network to model the feature graph globally. Finally, in the case of no increase in computation and complexity, the residual mechanism is adopted to redesign the overall structure of the algorithm, which strengthens the feature interaction simply and efficiently. Experimental tests are carried out on the self-established infrared object detection dataset G-TIR and public infrared object detection dataset FLIR. The proposed algorithm improves the accuracy of the baseline by 8.4% and 15.3% respectively, and is better than many mainstream object detection algorithms in recent years. Meanwhile, the detection speed reaches 72 FPS, which balances the detection accuracy and speed well, then meet the real-time detection requirements.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Zhou, Y., Tuzel, O.: Voxelnet: end-to-end learning for point cloud based 3d object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4490–4499 (2018)
Google Scholar
Girshick, R., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 580–587 (2014)
Google Scholar
Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. IEEE Trans. Pattern Anal. Mach. Intell. 39(6), 1137–1149 (2016)
Article Google Scholar
Bochkovskiy, A., Wang, C., Liao, H.: Yolov4: Optimal speed and accuracy of object detection (2020). arXiv preprint arXiv: 2004.10934
Google Scholar
Ge, Z., Liu, S., Wang, F., Li, Z., Sun, J.: Yolox: Exceeding yolo series in 2021 (2021). arXiv preprint arXiv: 2107.08430
Google Scholar
Cai, Y., et al.: Yolobile: Real-time object detection on mobile devices via compression-compilation co-design (2020). arXiv preprint arXiv:2009.05697
Zhou, X., Wang, D., Krähenbühl, P.: Objects as points (2019). arXiv preprint arXiv:1904.07850
Chen, Y., Xie, H., Shin, H.: Multi-layer fusion techniques using a CNN for multispectral pedestrian detection. IET Comput. Vision 12(8), 1179–1187 (2018)
Article Google Scholar
Liu, J., Zhang, S., Wang, S., Metaxas, D.N.: Multispectral deep neural networks for pedestrian detection (2016). arXiv preprint arXiv:1611.02644
Ghose, D., Desai, S.M., Bhattacharya, S., Chakraborty, D., Fiterau, M., Rahman, T.: Pedestrian detection in thermal images using saliency maps. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, p. 0 (2019)
Google Scholar
Wei, S., Wang, C., Chen, Z., Zhang, C., Zhang, X.: Infrared dim target detection based on human visual mechanism. Acta Photonica Sinica 50(1), 0110001 (2021)
Google Scholar
Dai, X., Yuan, X., Wei, X.: TIRNet: object detection in thermal infrared images for autonomous driving. Appl. Intell. 51(3), 1244–1261 (2020). https://doi.org/10.1007/s10489-020-01882-2
Article Google Scholar
Zhu, X., Hu, H., Lin, S., Dai, J.: Deformable convnets v2: more deformable, better results. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 9308–9316 (2019)
Google Scholar
Yi, J., Wu, P., Liu, B., Huang, Q., Qu, H., Metaxas, D.: Oriented object detection in aerial images with box boundary-aware vectors. In: Proceedings of the IEEE Winter Conference on Applications of Computer Vision, pp. 2150–2159 (2021)
Google Scholar
Qin, Z., Zhang, P., Wu, F., Li, X.: Fcanet: frequency channel attention networks. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 783–792 (2021)
Google Scholar
Srinivas, A., Lin, T.Y., Parmar, N., Shlens, J., Abbeel, P., Vaswani, A.: Bottleneck transformers for visual recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 16519–16529 (2021)
Google Scholar
FLIR Homepage. https://www.flir.cn/oem/adas/adas-dataset-form. Accessed 12 Apr 2022
Zhang, S., Chi, C., Yao, Y., Lei, Z., Li, S.Z.: Bridging the gap between anchor-based and anchor-free detection via adaptive training sample selection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 9759–9768 (2020)
Google Scholar
Tian, Z., Shen, C., Chen, H.: Fcos: fully convolutional one-stage object detection. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 9627–9636 (2019)
Google Scholar

Download references

Author information

Authors and Affiliations

Shaanxi Key Laboratory of Network Data Analysis and Intelligent Processing, Xi’an University of Posts and Telecommunications, Xi’an, 710121, China
Hao Guo, Zhiqiang Hou, Ying Sun, Juanjuan Li & Sugang Ma

Authors

Hao Guo
View author publications
You can also search for this author in PubMed Google Scholar
Zhiqiang Hou
View author publications
You can also search for this author in PubMed Google Scholar
Ying Sun
View author publications
You can also search for this author in PubMed Google Scholar
Juanjuan Li
View author publications
You can also search for this author in PubMed Google Scholar
Sugang Ma
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hao Guo .

Editor information

Editors and Affiliations

Southern University of Science and Technology, Shenzhen, China
Shiqi Yu
Institute of Automation, Chinese Academy of Sciences, Beijing, China
Zhaoxiang Zhang
Hong Kong Baptist University, Hong Kong, China
Pong C. Yuen
Northwestern Polytechnical University, Xi'an, China
Junwei Han
Institute of Automation, Chinese Academy of Sciences, Beijing, China
Tieniu Tan
Hong Kong Baptist University, Hong Kong, China
Yike Guo
Sun Yat-sen University, Guangzhou, China
Jianhuang Lai
Southern University of Science and Technology, Shenzhen, China
Jianguo Zhang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Guo, H., Hou, Z., Sun, Y., Li, J., Ma, S. (2022). Infrared Object Detection Algorithm Based on Spatial Feature Enhancement. In: Yu, S., et al. Pattern Recognition and Computer Vision. PRCV 2022. Lecture Notes in Computer Science, vol 13537. Springer, Cham. https://doi.org/10.1007/978-3-031-18916-6_28

Download citation

DOI: https://doi.org/10.1007/978-3-031-18916-6_28
Published: 27 October 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-18915-9
Online ISBN: 978-3-031-18916-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics