Abstract
The performance of oriented object detectors has been adversely impacted by the substantial variations in object orientation. In this paper, we propose a simple but efficient object detection framework for oriented objects in aerial images, termed QuadDet. Instead of adopting oriented bounding box to represent the object, we directly predict the four vertices of the object’s quadrilateral. Specially, we introduce a fast sorting method for four vertexes of quadrangles, called the Vertex Sorting Function. The function confirms that the vertexes can compose a valid quadrangle by sorting tangents of the vertexes. Furthermore, we employ an efficient polygon IoU loss function, named the PolyIoU Loss Function, to progressively align the predicted quadrangle’s shape with the ground truth. Under these strategies, our model achieves competitive performance. Without bells and whistles, our method with ResNet50 achieves 73.63% mAP on the DOTA-v1.0 dataset running at 23.4 FPS, which surpasses all recent one-stage oriented object detectors by a significant margin. Moreover, on the largest dataset DOTA-v2.0, our QuadDet with ResNet50 obtains 51.54% mAP. The code and models are available at https://github.com/DDGRCF/QuadDet.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Braden, B.: The surveyor’s area formula. Coll. Math. J. 17(4), 326–337 (1986)
Chen, W., Miao, S., Wang, G., Cheng, G.: Recalibrating features and regression for oriented object detection. Remote Sens. 15(8), 2134 (2023). https://doi.org/10.3390/rs15082134
Cheng, G., Li, Q., Wang, G., Xie, X., Min, L., Han, J.: SFRNet: fine-grained oriented object recognition via separate feature refinement. IEEE Trans. Geosci. Remote Sens. 61, 1–10 (2023). https://doi.org/10.1109/TGRS.2023.3277626
Cheng, G., et al.: Anchor-free oriented proposal generator for object detection. IEEE Trans. Geosci. Remote Sens. 60, 1–11 (2022). https://doi.org/10.1109/TGRS.2022.3183022
Cheng, G., et al.: Dual-aligned oriented detector. IEEE Trans. Geosci. Remote Sens. 60, 1–11 (2022). https://doi.org/10.1109/TGRS.2022.3149780
Cheng, G., et al.: Towards large-scale small object detection: survey and benchmarks. IEEE Trans. Pattern Anal. Mach. Intell. (2023)
Ding, J., Xue, N., Long, Y., Xia, G.S., Lu, Q.: Learning RoI transformer for oriented object detection in aerial images. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition, pp. 2849–2858 (2019)
Ding, J., et al.: Object detection in aerial images: a large-scale benchmark and challenges. IEEE Trans. Pattern Anal. Mach. Intell. 44(11), 7778–7796 (2022). https://doi.org/10.1109/TPAMI.2021.3117983
Girshick, R.: Fast R-CNN. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 1440–1448 (2015)
Girshick, R., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 580–587 (2014)
Graham, R.L.: An efficient algorithm for determining the convex hull of a finite planar set. Inf. Process. Lett. 1, 132–133 (1972)
Han, J., Ding, J., Li, J., Xia, G.S.: Align deep features for oriented object detection. IEEE Trans. Geosci. Remote Sens. 1–11 (2021). https://doi.org/10.1109/TGRS.2021.3062048
Han, J., Ding, J., Xue, N., Xia, G.S.: ReDet: a rotation-equivariant detector for aerial object detection. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition, pp. 2786–2795 (2021)
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
Hou, L., Lu, K., Xue, J., Li, Y.: Shape-adaptive selection and measurement for oriented object detection. In: Proceedings of the AAAI Conference on Artificial Intelligence (2022)
Li, C., Cheng, G., Wang, G., Zhou, P., Han, J.: Instance-aware distillation for efficient object detection in remote sensing images. IEEE Trans. Geosci. Remote Sens. 61, 1–11 (2023)
Lin, T.Y., Dollár, P., Girshick, R., He, K., Hariharan, B., Belongie, S.: Feature pyramid networks for object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2117–2125 (2017)
Lin, T.Y., Goyal, P., Girshick, R., He, K., Dollár, P.: Focal loss for dense object detection. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2980–2988 (2017)
Ma, J., et al.: Arbitrary-oriented scene text detection via rotation proposals. IEEE Trans. Multimedia 20, 3111–3122 (2018)
Ming, Q., Zhou, Z., Miao, L., Zhang, H., Li, L.: Dynamic anchor learning for arbitrary-oriented object detection. In: Proceedings of the AAAI Conference on Artificial Intelligence, pp. 2355–2363 (2021)
Qian, W., Yang, X., Peng, S., Yan, J., Guo, Y.: Learning modulated loss for rotated object detection. In: Proceedings of the AAAI Conference on Artificial Intelligence, pp. 2458–2466 (2021)
Qian, X., Wu, B., Cheng, G., Yao, X., Wang, W., Han, J.: Building a bridge of bounding box regression between oriented and horizontal object detection in remote sensing images. IEEE Trans. Geosci. Remote Sens. 61, 1–9 (2023)
Rao, C., Wang, J., Cheng, G., Xie, X., Han, J.: Learning orientation-aware distances for oriented object detection. IEEE Trans. Geosci. Remote Sens. 1 (2023). https://doi.org/10.1109/TGRS.2023.3278933
Redmon, J., Divvala, S., Girshick, R., Farhadi, A.: You only look once: unified, real-time object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 779–788 (2016)
Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. In: Proceedings of Conference on Advances in Neural Information Processing Systems, pp. 91–99 (2015)
Tian, Z., Shen, C., Chen, H., He, T.: FCOS: fully convolutional one-stage object detection. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 9627–9636 (2019)
Xia, G.S., et al.: DOTA: a large-scale dataset for object detection in aerial images. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3974–3983 (2018)
Xie, X., Cheng, G., Li, Q., Miao, S., Li, K., Han, J.: Fewer is more: efficient object detection in large aerial images. Sci. China Inf. Sci. (2023)
Xie, X., Cheng, G., Wang, J., Yao, X., Han, J.: Oriented R-CNN for object detection. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 3520–3529 (2021)
Xu, Y., et al.: Gliding vertex on the horizontal bounding box for multi-oriented object detection. IEEE Trans. Pattern Anal. Mach. Intell. 43(4), 1452–1459 (2020)
Yang, X., Yan, J.: Arbitrary-oriented object detection with circular smooth label. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12353, pp. 677–694. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58598-3_40
Yang, X., Yan, J., Feng, Z., He, T.: R3Det: refined single-stage detector with feature refinement for rotating object. In: Proceedings of the AAAI Conference on Artificial Intelligence, pp. 3163–3171 (2021)
Yang, X., Yan, J., Ming, Q., Wang, W., Zhang, X., Tian, Q.: Rethinking rotated object detection with gaussian Wasserstein distance loss. In: Proceedings of IEEE International Conference on Machine Learning, pp. 11830–11841 (2021)
Yang, X., et al.: SCRDet: towards more robust detection for small, cluttered and rotated objects. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition, pp. 8232–8241 (2019)
Yang, X., et al.: Learning high-precision bounding box for rotated object detection via kullback-leibler divergence. In: Proceedings of Conference on Advances in Neural Information Processing Systems, pp. 18381–18394 (2021)
Yang, X., et al.: The KFIoU loss for rotated object detection. In: Proceedings of International Conference on Learning Representations (2023)
Yang, Z., Liu, S., Hu, H., Wang, L., Lin, S.: Reppoints: point set representation for object detection. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition (2022)
Yao, Y., et al.: On improving bounding box representations for oriented object detection. IEEE Trans. Geosci. Remote Sens. 1–11 (2022)
Zhang, S., Chi, C., Yao, Y., Lei, Z., Li, S.Z.: Bridging the gap between anchor-based and anchor-free detection via adaptive training sample selection. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 9759–9768 (2020)
Zhou, Y., et al.: MMRotate: a rotated object detection benchmark using pytorch. In: Proceedings of ACM International Conference on Multimedia (2022)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Rao, C., Li, W., Xie, X., Cheng, G. (2024). A Shape-Based Quadrangle Detector for Aerial Images. In: Liu, Q., et al. Pattern Recognition and Computer Vision. PRCV 2023. Lecture Notes in Computer Science, vol 14428. Springer, Singapore. https://doi.org/10.1007/978-981-99-8462-6_30
Download citation
DOI: https://doi.org/10.1007/978-981-99-8462-6_30
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-8461-9
Online ISBN: 978-981-99-8462-6
eBook Packages: Computer ScienceComputer Science (R0)