A Shape-Based Quadrangle Detector for Aerial Images

Rao, Chaofan; Li, Wenbo; Xie, Xingxing; Cheng, Gong

doi:10.1007/978-981-99-8462-6_30

Chaofan Rao¹⁵,
Wenbo Li¹⁵,
Xingxing Xie¹⁵ &
…
Gong Cheng¹⁵

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14428))

Included in the following conference series:

Chinese Conference on Pattern Recognition and Computer Vision (PRCV)

847 Accesses

Abstract

The performance of oriented object detectors has been adversely impacted by the substantial variations in object orientation. In this paper, we propose a simple but efficient object detection framework for oriented objects in aerial images, termed QuadDet. Instead of adopting oriented bounding box to represent the object, we directly predict the four vertices of the object’s quadrilateral. Specially, we introduce a fast sorting method for four vertexes of quadrangles, called the Vertex Sorting Function. The function confirms that the vertexes can compose a valid quadrangle by sorting tangents of the vertexes. Furthermore, we employ an efficient polygon IoU loss function, named the PolyIoU Loss Function, to progressively align the predicted quadrangle’s shape with the ground truth. Under these strategies, our model achieves competitive performance. Without bells and whistles, our method with ResNet50 achieves 73.63% mAP on the DOTA-v1.0 dataset running at 23.4 FPS, which surpasses all recent one-stage oriented object detectors by a significant margin. Moreover, on the largest dataset DOTA-v2.0, our QuadDet with ResNet50 obtains 51.54% mAP. The code and models are available at https://github.com/DDGRCF/QuadDet.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 59.99; Price excludes VAT (USA)

Softcover Book: USD 79.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Braden, B.: The surveyor’s area formula. Coll. Math. J. 17(4), 326–337 (1986)
Article Google Scholar
Chen, W., Miao, S., Wang, G., Cheng, G.: Recalibrating features and regression for oriented object detection. Remote Sens. 15(8), 2134 (2023). https://doi.org/10.3390/rs15082134
Article Google Scholar
Cheng, G., Li, Q., Wang, G., Xie, X., Min, L., Han, J.: SFRNet: fine-grained oriented object recognition via separate feature refinement. IEEE Trans. Geosci. Remote Sens. 61, 1–10 (2023). https://doi.org/10.1109/TGRS.2023.3277626
Article Google Scholar
Cheng, G., et al.: Anchor-free oriented proposal generator for object detection. IEEE Trans. Geosci. Remote Sens. 60, 1–11 (2022). https://doi.org/10.1109/TGRS.2022.3183022
Article Google Scholar
Cheng, G., et al.: Dual-aligned oriented detector. IEEE Trans. Geosci. Remote Sens. 60, 1–11 (2022). https://doi.org/10.1109/TGRS.2022.3149780
Article Google Scholar
Cheng, G., et al.: Towards large-scale small object detection: survey and benchmarks. IEEE Trans. Pattern Anal. Mach. Intell. (2023)
Google Scholar
Ding, J., Xue, N., Long, Y., Xia, G.S., Lu, Q.: Learning RoI transformer for oriented object detection in aerial images. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition, pp. 2849–2858 (2019)
Google Scholar
Ding, J., et al.: Object detection in aerial images: a large-scale benchmark and challenges. IEEE Trans. Pattern Anal. Mach. Intell. 44(11), 7778–7796 (2022). https://doi.org/10.1109/TPAMI.2021.3117983
Article Google Scholar
Girshick, R.: Fast R-CNN. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 1440–1448 (2015)
Google Scholar
Girshick, R., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 580–587 (2014)
Google Scholar
Graham, R.L.: An efficient algorithm for determining the convex hull of a finite planar set. Inf. Process. Lett. 1, 132–133 (1972)
Article Google Scholar
Han, J., Ding, J., Li, J., Xia, G.S.: Align deep features for oriented object detection. IEEE Trans. Geosci. Remote Sens. 1–11 (2021). https://doi.org/10.1109/TGRS.2021.3062048
Han, J., Ding, J., Xue, N., Xia, G.S.: ReDet: a rotation-equivariant detector for aerial object detection. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition, pp. 2786–2795 (2021)
Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
Google Scholar
Hou, L., Lu, K., Xue, J., Li, Y.: Shape-adaptive selection and measurement for oriented object detection. In: Proceedings of the AAAI Conference on Artificial Intelligence (2022)
Google Scholar
Li, C., Cheng, G., Wang, G., Zhou, P., Han, J.: Instance-aware distillation for efficient object detection in remote sensing images. IEEE Trans. Geosci. Remote Sens. 61, 1–11 (2023)
Google Scholar
Lin, T.Y., Dollár, P., Girshick, R., He, K., Hariharan, B., Belongie, S.: Feature pyramid networks for object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2117–2125 (2017)
Google Scholar
Lin, T.Y., Goyal, P., Girshick, R., He, K., Dollár, P.: Focal loss for dense object detection. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2980–2988 (2017)
Google Scholar
Ma, J., et al.: Arbitrary-oriented scene text detection via rotation proposals. IEEE Trans. Multimedia 20, 3111–3122 (2018)
Article Google Scholar
Ming, Q., Zhou, Z., Miao, L., Zhang, H., Li, L.: Dynamic anchor learning for arbitrary-oriented object detection. In: Proceedings of the AAAI Conference on Artificial Intelligence, pp. 2355–2363 (2021)
Google Scholar
Qian, W., Yang, X., Peng, S., Yan, J., Guo, Y.: Learning modulated loss for rotated object detection. In: Proceedings of the AAAI Conference on Artificial Intelligence, pp. 2458–2466 (2021)
Google Scholar
Qian, X., Wu, B., Cheng, G., Yao, X., Wang, W., Han, J.: Building a bridge of bounding box regression between oriented and horizontal object detection in remote sensing images. IEEE Trans. Geosci. Remote Sens. 61, 1–9 (2023)
Google Scholar
Rao, C., Wang, J., Cheng, G., Xie, X., Han, J.: Learning orientation-aware distances for oriented object detection. IEEE Trans. Geosci. Remote Sens. 1 (2023). https://doi.org/10.1109/TGRS.2023.3278933
Redmon, J., Divvala, S., Girshick, R., Farhadi, A.: You only look once: unified, real-time object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 779–788 (2016)
Google Scholar
Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. In: Proceedings of Conference on Advances in Neural Information Processing Systems, pp. 91–99 (2015)
Google Scholar
Tian, Z., Shen, C., Chen, H., He, T.: FCOS: fully convolutional one-stage object detection. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 9627–9636 (2019)
Google Scholar
Xia, G.S., et al.: DOTA: a large-scale dataset for object detection in aerial images. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3974–3983 (2018)
Google Scholar
Xie, X., Cheng, G., Li, Q., Miao, S., Li, K., Han, J.: Fewer is more: efficient object detection in large aerial images. Sci. China Inf. Sci. (2023)
Google Scholar
Xie, X., Cheng, G., Wang, J., Yao, X., Han, J.: Oriented R-CNN for object detection. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 3520–3529 (2021)
Google Scholar
Xu, Y., et al.: Gliding vertex on the horizontal bounding box for multi-oriented object detection. IEEE Trans. Pattern Anal. Mach. Intell. 43(4), 1452–1459 (2020)
Article Google Scholar
Yang, X., Yan, J.: Arbitrary-oriented object detection with circular smooth label. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12353, pp. 677–694. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58598-3_40
Chapter Google Scholar
Yang, X., Yan, J., Feng, Z., He, T.: R3Det: refined single-stage detector with feature refinement for rotating object. In: Proceedings of the AAAI Conference on Artificial Intelligence, pp. 3163–3171 (2021)
Google Scholar
Yang, X., Yan, J., Ming, Q., Wang, W., Zhang, X., Tian, Q.: Rethinking rotated object detection with gaussian Wasserstein distance loss. In: Proceedings of IEEE International Conference on Machine Learning, pp. 11830–11841 (2021)
Google Scholar
Yang, X., et al.: SCRDet: towards more robust detection for small, cluttered and rotated objects. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition, pp. 8232–8241 (2019)
Google Scholar
Yang, X., et al.: Learning high-precision bounding box for rotated object detection via kullback-leibler divergence. In: Proceedings of Conference on Advances in Neural Information Processing Systems, pp. 18381–18394 (2021)
Google Scholar
Yang, X., et al.: The KFIoU loss for rotated object detection. In: Proceedings of International Conference on Learning Representations (2023)
Google Scholar
Yang, Z., Liu, S., Hu, H., Wang, L., Lin, S.: Reppoints: point set representation for object detection. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition (2022)
Google Scholar
Yao, Y., et al.: On improving bounding box representations for oriented object detection. IEEE Trans. Geosci. Remote Sens. 1–11 (2022)
Google Scholar
Zhang, S., Chi, C., Yao, Y., Lei, Z., Li, S.Z.: Bridging the gap between anchor-based and anchor-free detection via adaptive training sample selection. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 9759–9768 (2020)
Google Scholar
Zhou, Y., et al.: MMRotate: a rotated object detection benchmark using pytorch. In: Proceedings of ACM International Conference on Multimedia (2022)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Automation, Northwestern Polytechnical University, Xi’an, China
Chaofan Rao, Wenbo Li, Xingxing Xie & Gong Cheng

Authors

Chaofan Rao
View author publications
You can also search for this author in PubMed Google Scholar
Wenbo Li
View author publications
You can also search for this author in PubMed Google Scholar
Xingxing Xie
View author publications
You can also search for this author in PubMed Google Scholar
Gong Cheng
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Gong Cheng .

Editor information

Editors and Affiliations

Nanjing University of Information Science and Technology, Nanjing, China
Qingshan Liu
Xiamen University, Xiamen, China
Hanzi Wang
Beijing University of Posts and Telecommunications, Beijing, China
Zhanyu Ma
Sun Yat-sen University, Guangzhou, China
Weishi Zheng
Peking University, Beijing, China
Hongbin Zha
Chinese Academy of Sciences, Beijing, China
Xilin Chen
Chinese Academy of Sciences, Beijing, China
Liang Wang
Xiamen University, Xiamen, China
Rongrong Ji

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Rao, C., Li, W., Xie, X., Cheng, G. (2024). A Shape-Based Quadrangle Detector for Aerial Images. In: Liu, Q., et al. Pattern Recognition and Computer Vision. PRCV 2023. Lecture Notes in Computer Science, vol 14428. Springer, Singapore. https://doi.org/10.1007/978-981-99-8462-6_30

Download citation

DOI: https://doi.org/10.1007/978-981-99-8462-6_30
Published: 26 December 2023
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-8461-9
Online ISBN: 978-981-99-8462-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

A Shape-Based Quadrangle Detector for Aerial Images