Abstract
Object detection based on large-scale, high-resolution visible light Remote sensing images are widely used in military such as reconnaissance and civilian such as marine resource management. It is also an important task for the application of computer vision in remote sensing images. With the development of deep learning, more and more object detectors use deep network as the backbone, and accurate detection results and indicators can be obtained on conventional images. However, compared with conventional images, remote sensing images have more object numbers and object sizes, and the object distribution is also denser, which makes detection more difficult. At present, there are two types of object detectors: single-stage and two-stage. The single-stage detector directly obtains the detection result based on the feature map and pays more attention to the detection speed, while the two-stage detector generates the region of interest (RoI) by using feature map. More attention is paid to the accuracy of the test results when the test results are obtained through RoIs. This paper proposes a bilateral filtering refining method based on a single-stage detector, which refines the results obtained by a single-stage detector and approaches the performance of a two-stage detector without losing too much detection speed. Experiments conducted on the public large-scale visible light remote sensing dataset DOTA have proved the effectiveness of this method.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Zou, Z., Shi, Z., Guo, Y., et al.: Object detection in 20 years: a survey. arXiv preprint arXiv:1905.05055 (2019)
Girshick, R., Donahue, J., Darrell, T., et al.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 580–587 (2014)
Ren, S., He, K., Girshick, R., et al.: Faster R-CNN: towards real-time object detection with region proposal networks. arXiv preprint arXiv:1506.01497 (2015)
Liu, L., Ouyang, W., Wang, X., et al.: Deep learning for generic object detection: a survey. Int. J. Comput. Vis. 128(2), 261–318 (2020)
Yang, X., Liu, Q., Yan, J., et al.: R3Det: refined single-stage detector with feature refinement for rotating object. arXiv preprint arXiv:1908.05612 (2019)
Liu, W., et al.: SSD: single shot MultiBox detector. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9905, pp. 21–37. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46448-0_2
Lin, T.Y., Goyal, P., Girshick, R., et al.: Focal loss for dense object detection. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2980–2988 (2017)
Lin, T.Y., Dollár, P., Girshick, R., et al.: Feature pyramid networks for object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2117–2125 (2017)
Xia, G.S., Bai, X., Ding, J., et al.: DOTA: a large-scale dataset for object detection in aerial images. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3974–3983 (2018)
Liu, Z., Yuan, L., Weng, L., et al.: A high resolution optical satellite image dataset for ship recognition and some new baselines. In: International Conference on Pattern Recognition Applications and Methods, vol. 2, pp. 324–331. SCITEPRESS (2017)
Dai, J., Li, Y., He, K., et al.: R-FCN: object detection via region-based fully convolutional networks. arXiv preprint arXiv:1605.06409 (2016)
Chen, Z., et al.: PIoU loss: towards accurate oriented object detection in complex environments. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12350, pp. 195–211. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58558-7_12
Azimi, S.M., Vig, E., Bahmanyar, R., Körner, M., Reinartz, P.: Towards multi-class object detection in unconstrained remote sensing imagery. In: Jawahar, C.V., Li, H., Mori, G., Schindler, K. (eds.) ACCV 2018. LNCS, vol. 11363, pp. 150–165. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-20893-6_10
Li, Y., Huang, Q., Pei, X., et al.: RADet: refine feature pyramid network and multi-layer attention network for arbitrary-oriented object detection of remote sensing images. Remote Sens. 12(3), 389 (2020)
Wei, H., Zhang, Y., Chang, Z., et al.: Oriented objects as pairs of middle lines. ISPRS J. Photogram. Remote Sens. 169, 268–279 (2020)
Jiang, Y., Zhu, X., Wang, X., et al.: R2CNN: rotational region CNN for orientation robust scene text detection. arXiv preprint arXiv:1706.09579 (2017)
Ma, J., Shao, W., Ye, H., et al.: Arbitrary-oriented scene text detection via rotation proposals. IEEE Trans. Multimed. 20(11), 3111–3122 (2018)
Ding, J., Xue, N., Long, Y., et al.: Learning RoI transformer for oriented object detection in aerial images. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 2849–2858 (2019)
Acknowledgment
This work was supported in part by the National Natural Science Foundation of China under Grant 62071157, Natural Science Foundation of Heilongjiang Province under Grant YQ2019F011 and Postdoctoral Foundation of Heilongjiang Province under Grant LBH-Q19112.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering
About this paper
Cite this paper
Niu, Y., Li, A., Li, J., Wang, Y. (2022). FAST-Det: Feature Aligned SSD Towards Remote Sensing Detector. In: Shi, S., Ma, R., Lu, W. (eds) 6GN for Future Wireless Networks. 6GN 2021. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 439. Springer, Cham. https://doi.org/10.1007/978-3-031-04245-4_22
Download citation
DOI: https://doi.org/10.1007/978-3-031-04245-4_22
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-04244-7
Online ISBN: 978-3-031-04245-4
eBook Packages: Computer ScienceComputer Science (R0)