Feature Adaption with Predicted Boxes for Oriented Object Detection in Aerial Images

Zou, Minhao; Hu, Ziye; Guan, Yuxiang; Gan, Zhongxue; Guan, Chun; Leng, Siyang

doi:10.1007/978-3-030-89370-5_27

Minhao Zou¹²,
Ziye Hu¹²,
Yuxiang Guan¹²,
Zhongxue Gan¹²,
Chun Guan¹² &
…
Siyang Leng¹²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13033))

Included in the following conference series:

Pacific Rim International Conference on Artificial Intelligence

1450 Accesses
2 Citations

Abstract

Object detection is a fundamental research field in computer vision. Arbitrary-oriented objects inevitably appear in face, natural scene text, and aerial image detection, which have attracted widespread attention recently. However, existing rotation detectors still suffer from the feature misalignment problem, due to the fixed convolution kernel adopted in detecting arbitrary-oriented and deformed objects. In this paper, we propose a novel method, One-stage Feature Adaption Network (OFA-Net), for oriented object detection in aerial images. A feature adaption module, implemented by the deformable convolution and the align convolution, is proposed to refine the feature maps according to the predicted offsets and decoded boxes. Furthermore, specific to the long-existing periodic angle regression problem in the detection, the box regression branch is decoupled into the size branch and the angle branch, with a new periodic loss in the angle regression branch to leverage the periodic orientation of the object. Extensive experiments demonstrate the effectiveness of our approach, achieving promising results compared with state-of-the-art methods in three benchmark datasets, DOTA, HRSC2016, and UCAS-AOD.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 69.99; Price excludes VAT (USA)

Softcover Book: USD 89.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Azimi, S.M., Vig, E., Bahmanyar, R., Körner, M., Reinartz, P.: Towards multi-class object detection in unconstrained remote sensing imagery. In: Jawahar, C.V., Li, H., Mori, G., Schindler, K. (eds.) ACCV 2018. LNCS, vol. 11363, pp. 150–165. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-20893-6_10
Chapter Google Scholar
Chen, K., et al.: MMDetection: open MMlab detection toolbox and benchmark. arXiv preprint arXiv:1906.07155 (2019)
Dai, J., et al.: Deformable convolutional networks. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 764–773 (2017)
Google Scholar
Ding, J., Xue, N., Long, Y., Xia, G.S., Lu, Q.: Learning RoI transformer for oriented object detection in aerial images. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 2849–2858 (2019)
Google Scholar
Girshick, R.: Fast R-CNN. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 1440–1448 (2015)
Google Scholar
Han, J., Ding, J., Li, J., Xia, G.S.: Align deep features for oriented object detection. IEEE Trans. Geosci. Remote Sens. (2021)
Google Scholar
Han, J., Ding, J., Xue, N., Xia, G.S.: ReDet: a rotation-equivariant detector for aerial object detection. arXiv preprint arXiv:2103.07733 (2021)
Liu, L., Pan, Z., Lei, B.: Learning a rotation invariant detector with rotatable bounding box. arXiv preprint arXiv:1711.09405 (2017)
Liu, Z., Yuan, L., Weng, L., Yang, Y.: A high resolution optical satellite image dataset for ship recognition and some new baselines. In: International Conference on Pattern Recognition Applications and Methods, vol. 2, pp. 324–331. SCITEPRESS (2017)
Google Scholar
Ma, J., et al.: Arbitrary-oriented scene text detection via rotation proposals. IEEE Trans. Multimedia 20(11), 3111–3122 (2018)
Article Google Scholar
Ming, Q., Zhou, Z., Miao, L., Zhang, H., Li, L.: Dynamic anchor learning for arbitrary-oriented object detection. arXiv preprint arXiv:2012.04150 (2020)
Pan, X., et al.: Dynamic refinement network for oriented and densely packed object detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 11207–11216 (2020)
Google Scholar
Paszke, A., et al.: PyTorch: an imperative style, high-performance deep learning library. arXiv preprint arXiv:1912.01703 (2019)
Qian, W., Yang, X., Peng, S., Guo, Y., Yan, J.: Learning modulated loss for rotated object detection. arXiv preprint arXiv:1911.08299 (2019)
Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. arXiv preprint arXiv:1506.01497 (2015)
Shi, X., Shan, S., Kan, M., Wu, S., Chen, X.: Real-time rotation-invariant face detection with progressive calibration networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2295–2303 (2018)
Google Scholar
Wei, H., Zhang, Y., Chang, Z., Li, H., Wang, H., Sun, X.: Oriented objects as pairs of middle lines. ISPRS J. Photogram. Remote. Sens. 169, 268–279 (2020)
Article Google Scholar
Xia, G.S., et al.: DOTA: a large-scale dataset for object detection in aerial images. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3974–3983 (2018)
Google Scholar
Xu, C., Li, C., Cui, Z., Zhang, T., Yang, J.: Hierarchical semantic propagation for object detection in remote sensing imagery. IEEE Trans. Geosci. Remote Sens. 58(6), 4353–4364 (2020)
Article Google Scholar
Xu, Y., et al.: Gliding vertex on the horizontal bounding box for multi-oriented object detection. IEEE Trans. Pattern Anal. Mach. Intell. (2020)
Google Scholar
Yang, F., Fan, H., Chu, P., Blasch, E., Ling, H.: Clustered object detection in aerial images. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 8311–8320 (2019)
Google Scholar
Yang, X., Hou, L., Zhou, Y., Wang, W., Yan, J.: Dense label encoding for boundary discontinuity free rotation detection. arXiv preprint arXiv:2011.09670 (2020)
Yang, X., Liu, Q., Yan, J., Li, A., Zhang, Z., Yu, G.: R3Det: refined single-stage detector with feature refinement for rotating object. arXiv preprint arXiv:1908.05612 (2019)
Yang, X., Yan, J.: Arbitrary-oriented object detection with circular smooth label. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12353, pp. 677–694. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58598-3_40
Chapter Google Scholar
Yang, X., et al.: SCRDet: towards more robust detection for small, cluttered and rotated objects. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 8232–8241 (2019)
Google Scholar
Zhang, H., Lan, X., Bai, S., Zhou, X., Tian, Z., Zheng, N.: RoI-based robotic grasp detection for object overlapping scenes. In: 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 4768–4775. IEEE (2019)
Google Scholar
Zhu, H., Chen, X., Dai, W., Fu, K., Ye, Q., Jiao, J.: Orientation robust object detection in aerial images using deep convolutional neural network. In: 2015 IEEE International Conference on Image Processing (ICIP), pp. 3735–3739. IEEE (2015)
Google Scholar

Download references

Acknowledgements

This work is sponsored by Shanghai Municipal Science and Technology Major Project (No.2021SHZDZX0103) and supported by the Shanghai Engineering Research Center of AI & Robotics, Fudan University, China, and the Engineering Research Center of AI & Robotics, Ministry of Education, China. S. Leng is sponsored by Shanghai Sailing Program (No. 21YF1402300).

Author information

Authors and Affiliations

Academy for Engineering and Technology, Fudan University, Shanghai, 200433, China
Minhao Zou, Ziye Hu, Yuxiang Guan, Zhongxue Gan, Chun Guan & Siyang Leng

Authors

Minhao Zou
View author publications
You can also search for this author in PubMed Google Scholar
Ziye Hu
View author publications
You can also search for this author in PubMed Google Scholar
Yuxiang Guan
View author publications
You can also search for this author in PubMed Google Scholar
Zhongxue Gan
View author publications
You can also search for this author in PubMed Google Scholar
Chun Guan
View author publications
You can also search for this author in PubMed Google Scholar
Siyang Leng
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Zhongxue Gan , Chun Guan or Siyang Leng .

Editor information

Editors and Affiliations

MIMOS Berhad, Kuala Lumpur, Malaysia
Duc Nghia Pham
Sirindhorn International Institute of Science and Technology, Thammasat University, Mueang Pathum Thani, Thailand
Thanaruk Theeramunkong
Data61, CSIRO, Brisbane, QLD, Australia
Guido Governatori
Department of Philosophy, Tsinghua University, Beijing, China
Fenrong Liu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zou, M., Hu, Z., Guan, Y., Gan, Z., Guan, C., Leng, S. (2021). Feature Adaption with Predicted Boxes for Oriented Object Detection in Aerial Images. In: Pham, D.N., Theeramunkong, T., Governatori, G., Liu, F. (eds) PRICAI 2021: Trends in Artificial Intelligence. PRICAI 2021. Lecture Notes in Computer Science(), vol 13033. Springer, Cham. https://doi.org/10.1007/978-3-030-89370-5_27

Download citation

DOI: https://doi.org/10.1007/978-3-030-89370-5_27
Published: 01 November 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-89369-9
Online ISBN: 978-3-030-89370-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics