skip to main content
10.1145/3488933.3488936acmotherconferencesArticle/Chapter ViewAbstractPublication PagesaiprConference Proceedingsconference-collections
research-article

An Anchor-free Detector Based on Residual Feature Enhancement Pyramid Network for UAV Vehicle Detection

Authors Info & Claims
Published:25 February 2022Publication History

ABSTRACT

Vehicle detection in Unmanned Aerial Vehicle (UAV) images is a challenging task because there are many small objects in UAV images, and the scale of objects varies greatly, which brings great difficulty to vehicle detection using existing algorithms. This paper proposes an anchor-free detector called Residual Feature Enhancement Pyramid Network (RFEPNet) for UAV vehicle detection. RFEPNet contains a Cross-Level Context Fusion Network (CLCFNet) and a Residual Feature Enhancement Module (RFEM) based on pyramid convolution. Specifically, CLCFNet utilizes the densely connected structure and Dual Attention Fusion Module (DAFM) to increase the sensitivity of high-resolution feature maps to small objects. Simultaneously, RFEM exploits pyramid convolution and residual connection structure to enhance the semantic information of the feature pyramid. In addition, the anchor-free head is used for classification and bounding box regression. The experimental results on the UAVDT dataset show that the proposed RFEPNet achieves state-of-the-art performance.

References

  1. Majid Azimi S. ShuffleDet: Real-Time Vehicle Detection Network in On-board Embedded UAV Imagery[C]. Proceedings of the European Conference on Computer Vision (ECCV) Workshops, 2018: 0-0.Google ScholarGoogle Scholar
  2. Alotaibi E T, Alqefari S S, and Koubaa A J I A. LSAR: Multi-UAV Collaboration for Search and Rescue Missions[J].2019. IEEE Access, 2019, 7: 55817-55832. https://doi.org/10.1109/ACCESS.2019.2912306Google ScholarGoogle ScholarCross RefCross Ref
  3. Lecun Y, Bengio Y J T H O B T, and Networks N. Convolutional Networks for Images, Speech, and Time-Series[J].1995. The handbook of brain theory and neural networks, 1995, 3361(10): 1995Google ScholarGoogle Scholar
  4. Girshick R, Donahue J, Darrell T, and Malik J. Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation[C]. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2014: 580-587.Google ScholarGoogle Scholar
  5. Redmon J, Divvala S, Girshick R, and Farhadi A. You Only Look Once: Unified, Real-Time Object Detection[C]. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016: 779-788.Google ScholarGoogle Scholar
  6. Wang X, Zhang S, Yu Z, Feng L, and Zhang W. Scale-Equalizing Pyramid Convolution for Object Detection[C]. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020: 13359-13368.Google ScholarGoogle Scholar
  7. He K, Zhang X, Ren S, and Sun J. Deep Residual Learning for Image Recognition[C]. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016: 770-778.Google ScholarGoogle Scholar
  8. Ren S, He K, Girshick R, and Sun J. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks[J].2017. IEEE Trans Pattern Anal Mach Intell, 2017, 39(6): 1137-1149. https://doi.org/10.1109/tpami.2016.2577031Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Lin T-Y, Dollár P, Girshick R, He K, Hariharan B, and Belongie S. Feature Pyramid Networks for Object Detection[C]. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017: 2117-2125.Google ScholarGoogle Scholar
  10. Cai Z, and Vasconcelos N. Cascade R-CNN: Delving Into High Quality Object Detection[C]. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018: 6154-6162.Google ScholarGoogle Scholar
  11. Redmon J, and Farhadi A J a P A. YOLOv3: An Incremental Improvement[J].2018. arXiv preprint, 2018, arXiv:1804.02767Google ScholarGoogle Scholar
  12. Liu W, Anguelov D, Erhan D, Szegedy C, Reed S, Fu C-Y, and Berg A C. SSD: Single Shot MultiBox Detector[C]. European Conference on Computer Vision, 2016: 21–37.Google ScholarGoogle Scholar
  13. Lin T-Y, Goyal P, Girshick R, He K, and Dollár P. Focal Loss for Dense Object Detection[C]. Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2017: 2980-2988.Google ScholarGoogle Scholar
  14. Tan M, Pang R, and Le Q V. EfficientDet: Scalable and Efficient Object Detection[C]. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020: 10781-10790.Google ScholarGoogle Scholar
  15. Law H, and Deng J. CornerNet: Detecting Objects as Paired Keypoints[C]. Proceedings of the European Conference on Computer Vision (ECCV), 2018: 734-750.Google ScholarGoogle Scholar
  16. Zhou X, Zhuo J, and Krahenbuhl P. Bottom-Up Object Detection by Grouping Extreme and Center Points[C]. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2019: 850-859.Google ScholarGoogle Scholar
  17. Tian Z, Shen C, Chen H, and He T. FCOS: Fully Convolutional One-Stage Object Detection[C]. Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2019: 9627-9636.Google ScholarGoogle Scholar
  18. Yang J, Xie X, Shi G, and Yang W. A Feature-Enhanced Anchor-Free Network for UAV Vehicle Detection[J].2020. Remote Sensing, 2020, 12(17): 2729. https://doi.org/10.3390/rs12172729Google ScholarGoogle ScholarCross RefCross Ref
  19. Wang H, Wang Z, Jia M, Li A, Feng T, Zhang W, and Jiao L. Spatial Attention for Multi-Scale Feature Refinement for Object Detection[C]. Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2019: 0-0.Google ScholarGoogle Scholar
  20. Liu M, Wang X, Zhou A, Fu X, Ma Y, and Piao C. UAV-YOLO: Small Object Detection on Unmanned Aerial Vehicle Perspective[J].2020. Sensors, 2020, 20(8): 2238. https://doi.org/10.3390/s20082238Google ScholarGoogle ScholarCross RefCross Ref
  21. Zhang P, Zhong Y, and Li X. SlimYOLOv3: Narrower, Faster and Better for Real-Time UAV Applications[C]. Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2019: 0-0.Google ScholarGoogle Scholar
  22. Du D, Qi Y, Yu H, Yang Y, Duan K, Li G, Zhang W, Huang Q, and Tian Q. The Unmanned Aerial Vehicle Benchmark: Object Detection and Tracking[C]. Proceedings of the European Conference on Computer Vision (ECCV), 2018: 370-386.Google ScholarGoogle Scholar
  23. Lin T-Y, Maire M, Belongie S, Hays J, Perona P, Ramanan D, Dollár P, and Zitnick C L. Microsoft COCO: Common Objects in Context[C]. European Conference on Computer Vision, 2014: 740-755.Google ScholarGoogle Scholar
  24. Liu S, Qi L, Qin H, Shi J, and Jia J. Path Aggregation Network for Instance Segmentation[C]. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018: 8759-8768.Google ScholarGoogle Scholar
  25. Zhang X, Wan F, Liu C, Ji X, and Ye Q. Learning to Match Anchors for Visual Object Detection[J].2021. IEEE Trans Pattern Anal Mach Intell, 2021, Pp. https://doi.org/10.1109/tpami.2021.3050494Google ScholarGoogle Scholar
  26. Zhu C, He Y, and Savvides M. Feature Selective Anchor-Free Module for Single-Shot Object Detection[C]. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2019: 840-849.Google ScholarGoogle Scholar
  27. Yang Z, Liu S, Hu H, Wang L, and Lin S. RepPoints: Point Set Representation for Object Detection[C]. Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2019: 9657-9666.Google ScholarGoogle Scholar

Index Terms

  1. An Anchor-free Detector Based on Residual Feature Enhancement Pyramid Network for UAV Vehicle Detection
          Index terms have been assigned to the content through auto-classification.

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Other conferences
            AIPR '21: Proceedings of the 2021 4th International Conference on Artificial Intelligence and Pattern Recognition
            September 2021
            715 pages
            ISBN:9781450384087
            DOI:10.1145/3488933

            Copyright © 2021 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 25 February 2022

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • research-article
            • Research
            • Refereed limited

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader

          HTML Format

          View this article in HTML Format .

          View HTML Format