FAST-Det: Feature Aligned SSD Towards Remote Sensing Detector

Niu, Yutong; Li, Ao; Li, Jie; Wang, Yangwei

doi:10.1007/978-3-031-04245-4_22

Yutong Niu¹⁸,
Ao Li¹⁸,
Jie Li¹⁹ &
…
Yangwei Wang¹⁹

Part of the book series: Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering ((LNICST,volume 439))

Included in the following conference series:

International Conference on 5G for Future Wireless Networks

842 Accesses

Abstract

Object detection based on large-scale, high-resolution visible light Remote sensing images are widely used in military such as reconnaissance and civilian such as marine resource management. It is also an important task for the application of computer vision in remote sensing images. With the development of deep learning, more and more object detectors use deep network as the backbone, and accurate detection results and indicators can be obtained on conventional images. However, compared with conventional images, remote sensing images have more object numbers and object sizes, and the object distribution is also denser, which makes detection more difficult. At present, there are two types of object detectors: single-stage and two-stage. The single-stage detector directly obtains the detection result based on the feature map and pays more attention to the detection speed, while the two-stage detector generates the region of interest (RoI) by using feature map. More attention is paid to the accuracy of the test results when the test results are obtained through RoIs. This paper proposes a bilateral filtering refining method based on a single-stage detector, which refines the results obtained by a single-stage detector and approaches the performance of a two-stage detector without losing too much detection speed. Experiments conducted on the public large-scale visible light remote sensing dataset DOTA have proved the effectiveness of this method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 99.00; Price excludes VAT (USA)

Softcover Book: USD 129.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Zou, Z., Shi, Z., Guo, Y., et al.: Object detection in 20 years: a survey. arXiv preprint arXiv:1905.05055 (2019)
Girshick, R., Donahue, J., Darrell, T., et al.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 580–587 (2014)
Google Scholar
Ren, S., He, K., Girshick, R., et al.: Faster R-CNN: towards real-time object detection with region proposal networks. arXiv preprint arXiv:1506.01497 (2015)
Liu, L., Ouyang, W., Wang, X., et al.: Deep learning for generic object detection: a survey. Int. J. Comput. Vis. 128(2), 261–318 (2020)
Article Google Scholar
Yang, X., Liu, Q., Yan, J., et al.: R3Det: refined single-stage detector with feature refinement for rotating object. arXiv preprint arXiv:1908.05612 (2019)
Liu, W., et al.: SSD: single shot MultiBox detector. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9905, pp. 21–37. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46448-0_2
Chapter Google Scholar
Lin, T.Y., Goyal, P., Girshick, R., et al.: Focal loss for dense object detection. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2980–2988 (2017)
Google Scholar
Lin, T.Y., Dollár, P., Girshick, R., et al.: Feature pyramid networks for object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2117–2125 (2017)
Google Scholar
Xia, G.S., Bai, X., Ding, J., et al.: DOTA: a large-scale dataset for object detection in aerial images. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3974–3983 (2018)
Google Scholar
Liu, Z., Yuan, L., Weng, L., et al.: A high resolution optical satellite image dataset for ship recognition and some new baselines. In: International Conference on Pattern Recognition Applications and Methods, vol. 2, pp. 324–331. SCITEPRESS (2017)
Google Scholar
Dai, J., Li, Y., He, K., et al.: R-FCN: object detection via region-based fully convolutional networks. arXiv preprint arXiv:1605.06409 (2016)
Chen, Z., et al.: PIoU loss: towards accurate oriented object detection in complex environments. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12350, pp. 195–211. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58558-7_12
Chapter Google Scholar
Azimi, S.M., Vig, E., Bahmanyar, R., Körner, M., Reinartz, P.: Towards multi-class object detection in unconstrained remote sensing imagery. In: Jawahar, C.V., Li, H., Mori, G., Schindler, K. (eds.) ACCV 2018. LNCS, vol. 11363, pp. 150–165. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-20893-6_10
Chapter Google Scholar
Li, Y., Huang, Q., Pei, X., et al.: RADet: refine feature pyramid network and multi-layer attention network for arbitrary-oriented object detection of remote sensing images. Remote Sens. 12(3), 389 (2020)
Article Google Scholar
Wei, H., Zhang, Y., Chang, Z., et al.: Oriented objects as pairs of middle lines. ISPRS J. Photogram. Remote Sens. 169, 268–279 (2020)
Article Google Scholar
Jiang, Y., Zhu, X., Wang, X., et al.: R2CNN: rotational region CNN for orientation robust scene text detection. arXiv preprint arXiv:1706.09579 (2017)
Ma, J., Shao, W., Ye, H., et al.: Arbitrary-oriented scene text detection via rotation proposals. IEEE Trans. Multimed. 20(11), 3111–3122 (2018)
Article Google Scholar
Ding, J., Xue, N., Long, Y., et al.: Learning RoI transformer for oriented object detection in aerial images. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 2849–2858 (2019)
Google Scholar

Download references

Acknowledgment

This work was supported in part by the National Natural Science Foundation of China under Grant 62071157, Natural Science Foundation of Heilongjiang Province under Grant YQ2019F011 and Postdoctoral Foundation of Heilongjiang Province under Grant LBH-Q19112.

Author information

Authors and Affiliations

School of Computer Science and Technology, Harbin University of Science and Technology, Harbin, China
Yutong Niu & Ao Li
Shandong Provincial Innovation and Practice Base for Postdoctors, Weihaizhenyu Intelligence Technology Co., Ltd., Weihai, China
Jie Li & Yangwei Wang

Authors

Yutong Niu
View author publications
You can also search for this author in PubMed Google Scholar
Ao Li
View author publications
You can also search for this author in PubMed Google Scholar
Jie Li
View author publications
You can also search for this author in PubMed Google Scholar
Yangwei Wang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yutong Niu .

Editor information

Editors and Affiliations

Harbin Institute of Technology, Harbin, China
Shuo Shi
Harbin Institute of Technology, Weihai, China
Ruofei Ma
Zhejiang University of Technology, Hangzhou, China
Weidang Lu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Niu, Y., Li, A., Li, J., Wang, Y. (2022). FAST-Det: Feature Aligned SSD Towards Remote Sensing Detector. In: Shi, S., Ma, R., Lu, W. (eds) 6GN for Future Wireless Networks. 6GN 2021. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 439. Springer, Cham. https://doi.org/10.1007/978-3-031-04245-4_22

Download citation

DOI: https://doi.org/10.1007/978-3-031-04245-4_22
Published: 05 May 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-04244-7
Online ISBN: 978-3-031-04245-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics