skip to main content
10.1145/3628797.3628980acmotherconferencesArticle/Chapter ViewAbstractPublication PagessoictConference Proceedingsconference-collections
research-article

Optimizing Results in Aerial Images through Post-Processing Techniques on YOLOv7

Authors Info & Claims
Published:07 December 2023Publication History

ABSTRACT

Object detection in aerial images has garnered significant attention from the research community in recent years. The challenges posed by small objects, diverse orientations, and complex backgrounds have spurred extensive research efforts. In this paper, we focus on object detection in a YOLOv7-based framework, to address the issue of overlapping predicted regions, we introduce and apply the Soft-NMS (Non-Maximum Suppression) technique, a post-processing method known for its effectiveness in improving detection accuracy in such scenarios. Soft-NMS adjusts bounding box scores based on the extent of overlap, allowing for more accurate localization of objects in densely populated regions. Furthermore, we present comprehensive experimental results to validate the efficacy of our proposed approach. The analyses encompass a thorough evaluation on the UCAS Aerial Object Detection (UCAS AOD) dataset, comprising over 1500 aerial images captured from diverse perspectives. Our method has demonstrated an improvement in object detection performance, particularly in scenarios with closely positioned objects. The proposed framework showcases its ability to handle complex aerial scenes with higher precision and recall rates compared to conventional methods.

References

  1. Navaneeth Bodla, Bharat Singh, Rama Chellappa, and Larry S Davis. 2017. Soft-NMS–improving object detection with one line of code. In Proceedings of the IEEE international conference on computer vision. 5561–5569.Google ScholarGoogle ScholarCross RefCross Ref
  2. Yaru Cao, Zhijian He, Lujia Wang, Wenguan Wang, Yixuan Yuan, Dingwen Zhang, Jinglin Zhang, Pengfei Zhu, Luc Van Gool, Junwei Han, 2021. VisDrone-DET2021: The vision meets drone object detection challenge results. In Proceedings of the IEEE/CVF International conference on computer vision. 2847–2854.Google ScholarGoogle ScholarCross RefCross Ref
  3. Chunling Chen, Ziyue Zheng, Tongyu Xu, Shuang Guo, Shuai Feng, Weixiang Yao, and Yubin Lan. 2023. YOLO-Based UAV Technology: A Review of the Research and Its Applications. Drones 7, 3 (2023), 190.Google ScholarGoogle Scholar
  4. MMYOLO Contributors. 2022. MMYOLO: OpenMMLab YOLO series toolbox and benchmark.Google ScholarGoogle Scholar
  5. Tsung-Yi Lin, Priya Goyal, Ross Girshick, Kaiming He, and Piotr Dollár. 2017. Focal loss for dense object detection. In Proceedings of the IEEE international conference on computer vision. 2980–2988.Google ScholarGoogle ScholarCross RefCross Ref
  6. Wei Liu, Dragomir Anguelov, Dumitru Erhan, Christian Szegedy, Scott Reed, Cheng-Yang Fu, and Alexander C Berg. 2016. Ssd: Single shot multibox detector. In Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11–14, 2016, Proceedings, Part I 14. Springer, 21–37.Google ScholarGoogle Scholar
  7. Khang Nguyen, Nhut T Huynh, Phat C Nguyen, Khanh-Duy Nguyen, Nguyen D Vo, and Tam V Nguyen. 2020. Detecting objects from space: An evaluation of deep-learning modern approaches. Electronics 9, 4 (2020), 583.Google ScholarGoogle ScholarCross RefCross Ref
  8. Joseph Redmon, Santosh Divvala, Ross Girshick, and Ali Farhadi. 2016. You only look once: Unified, real-time object detection. In Proceedings of the IEEE conference on computer vision and pattern recognition. 779–788.Google ScholarGoogle ScholarCross RefCross Ref
  9. Shaoqing Ren, Kaiming He, Ross Girshick, and Jian Sun. 2015. Faster r-cnn: Towards real-time object detection with region proposal networks. Advances in neural information processing systems 28 (2015).Google ScholarGoogle Scholar
  10. Chien-Yao Wang, Alexey Bochkovskiy, and Hong-Yuan Mark Liao. 2023. YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 7464–7475.Google ScholarGoogle ScholarCross RefCross Ref
  11. Ze Yang, Shaohui Liu, Han Hu, Liwei Wang, and Stephen Lin. 2019. Reppoints: Point set representation for object detection. In Proceedings of the IEEE/CVF international conference on computer vision. 9657–9666.Google ScholarGoogle ScholarCross RefCross Ref
  12. Haigang Zhu, Xiaogang Chen, Weiqun Dai, Kun Fu, Qixiang Ye, and Jianbin Jiao. 2015. Orientation robust object detection in aerial images using deep convolutional neural network. In 2015 IEEE International Conference on Image Processing (ICIP). IEEE, 3735–3739.Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Pengfei Zhu, Longyin Wen, Dawei Du, Xiao Bian, Heng Fan, Qinghua Hu, and Haibin Ling. 2021. Detection and tracking meet drones challenge. IEEE Transactions on Pattern Analysis and Machine Intelligence 44, 11 (2021), 7380–7399.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Optimizing Results in Aerial Images through Post-Processing Techniques on YOLOv7
        Index terms have been assigned to the content through auto-classification.

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Other conferences
          SOICT '23: Proceedings of the 12th International Symposium on Information and Communication Technology
          December 2023
          1058 pages
          ISBN:9798400708916
          DOI:10.1145/3628797

          Copyright © 2023 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 7 December 2023

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article
          • Research
          • Refereed limited

          Acceptance Rates

          Overall Acceptance Rate147of318submissions,46%
        • Article Metrics

          • Downloads (Last 12 months)23
          • Downloads (Last 6 weeks)3

          Other Metrics

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        HTML Format

        View this article in HTML Format .

        View HTML Format