skip to main content
10.1145/3628797.3629020acmotherconferencesArticle/Chapter ViewAbstractPublication PagessoictConference Proceedingsconference-collections
research-article

Lite FPN_SSD: A Reconfiguration SSD with Adapting Feature Pyramid Network Scheme for Small Object Detection

Published:07 December 2023Publication History

ABSTRACT

Detecting small objects poses a significant challenge in computer vision because of the low resolution and fuzzy feature representation. Although one-stage detection techniques alleviate the problem caused by scale difference to some extent, they also retain redundant features, resulting in resource wastage and slower processing speeds. This research first primarily centers around elucidating the SSD algorithm, a new improved framework, namely Lite FPN-SSD (Lite Single Shot Multibox Detector with Adapting Feature Pyramid Network), then is proposed to solve the weakness of the SSD algorithm. The Lite FPN-SSD is build upon the popular FPN and SSD architectures to create a learnable fusion scheme with controlling the feature information that deep layers deliver to shallow layers. Its lightweight nature, with a minimal increase in parameters, ensures high efficiency for real-time applications. Extensive experiments conducted on VOC, VEDAI and SOHAS datasets demonstrate an impressive results of the proposed models in comparison with the original SSD and its other variations. Particularly, by making a minimal addition of only 2 million parameters, the proposed model achieves a mean average precision (mAP) of 78.36% on the VOC dataset, which is close to another architecture that achieved a 78.40% mAP but require more than 2.6.

References

  1. Tsung Yi Lin, Priya Goyal, Ross Girshick, Kaiming He, and Piotr Doll´ar. 2020. Focal loss for dense object detection,” IEEE Transactions on Pattern Analysis and Machine Intelligence. Vol. 42, 2, 318–327. DOI: https://doi.org/ 10.1109/TPAMI.2018.2858826Google ScholarGoogle ScholarCross RefCross Ref
  2. Joseph Redmon, Santosh Divvala, Ross Girshick, and Ali Farhadi. 2016. You only look once: Unified, real-time object detection. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 779–788. DOI: https://doi.org/ 10.1109/CVPR.2016.91Google ScholarGoogle ScholarCross RefCross Ref
  3. Joseph Redmon, and Ali Farh. 2017. Yolo9000: Better, faster, stronger. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR). DOI: https://doi.org/10.1109/CVPR.2017.690Google ScholarGoogle ScholarCross RefCross Ref
  4. Joseph Redmon, and Ali Farh. 2018. Yolov3: An incremental improvement. In arXiv:1804.02767v1 [cs.CV]. DOI: https://doi.org/10.48550/arXiv.1804.02767Google ScholarGoogle ScholarCross RefCross Ref
  5. Wei Liu, Dragomir Anguelov, Dumitru Erhan, Christian Szegedy, Scott Reed, Cheng Yang Fu, and Alexander C. Berg. 2016. SSD: Single shot MultiBox detector. In Proceedings of the Computer Vision (ECCV). 21–37. DOI: https://doi.org/10.1007/978-3-319-46448-0_2Google ScholarGoogle ScholarCross RefCross Ref
  6. Tsung Yi Lin, Piotr Dollár, Ross Girshick, Kaiming He, Bharath Hariharan, and Serge Belongie. 2017. Feature pyramid networks for object detection. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR). DOI: https://doi.org/10.1109/CVPR.2017.106Google ScholarGoogle ScholarCross RefCross Ref
  7. Xiang Li and Haibo LuoX. 2021. An Improved SSD for small target detection. In 6th International Conference on Multimedia and Image Processing (ICMIP). ACM, New York, NY, USA. DOI: https://doi.org/10.1145/3449388.3449391Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Hong Tae Choi, Ho Jun Lee, Hoon Kang, Sungwook Yu, and Ho Hyun Park. 2021. SSD-EMB: An improved ssd using enhanced feature map block for object detection. Sensor. Vol. 21, 8, 1–8. DOI: https://doi.org/10.3390/s21082842Google ScholarGoogle ScholarCross RefCross Ref
  9. Zuoxin Li, and Fuqiang Zhou. 2017. FSSD: Feature fusion single shot multibox detector. arXiv: Computer Vision and Pattern Recognition. DOI: https://doi.org/10.48550/arXiv.1712.00960%20Google ScholarGoogle ScholarCross RefCross Ref
  10. Cheng Yang Fu, Wei Liu, Ananth Ranga, Ambrish Tyagi, and Alexander C. Berg. 2017. DSSD: Deconvolutional single shot detector. arXiv: Computer Vision and Pattern Recognition. DOI: https://doi.org/10.48550/arXiv.1701.06659Google ScholarGoogle ScholarCross RefCross Ref
  11. Lisha Cui, Rui Ma, Pei Lv, Xiaoheng Jiang, Zhimin Gao, Bing Zhou, and Mingliang Xu. 2020. MDSSD: Multi-scale deconvolutional single shot detector for small objects. Sci. China Inf. Sci. Vol. 63. DOI: https://doi.org/10.1007/s11432-019-2723-1Google ScholarGoogle ScholarCross RefCross Ref
  12. Everingham M., Van Gool L., Williams C.K., Winn J., Zisserman A. 2010. The pascal visual object classes (VOC) challenge. Int. J. Comput. Vis. DOI: https://doi.org/10.1007/s11263-009-0275-4Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Sebastien Razakarivony, and Frederic Jurie. 2015. Vehicle detection in aerial imagery: A small target detection benchmark. Journal of Visual Communication and Image Representation. Vol. 34, 187 – 203. DOI: https://doi.org/10.1016/j.jvcir.2015.11.002Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Hamido Fujita, Francisco Herrera. 2020. Object Detection Binary Classifiers methodology based on deep learning to identify small objects handled similarly: Application in video surveillance. Knowledge-Based Systems. Vol 194. DOI: https://doi.org/10.1016/j.knosys.2020.105590Google ScholarGoogle ScholarCross RefCross Ref
  15. Karen Simonyan, and Andrew Zisserman. 2015. Very deep convolutional networks for large-scale image recognition. The 3rd International Conference on Learning Representations (ICLR2015), arXiv preprint arXiv:1409.1556, 2014. 1, 5. https://arxiv.org/abs/1409.1556Google ScholarGoogle Scholar

Index Terms

  1. Lite FPN_SSD: A Reconfiguration SSD with Adapting Feature Pyramid Network Scheme for Small Object Detection

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      SOICT '23: Proceedings of the 12th International Symposium on Information and Communication Technology
      December 2023
      1058 pages
      ISBN:9798400708916
      DOI:10.1145/3628797

      Copyright © 2023 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 7 December 2023

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed limited

      Acceptance Rates

      Overall Acceptance Rate147of318submissions,46%
    • Article Metrics

      • Downloads (Last 12 months)17
      • Downloads (Last 6 weeks)1

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format .

    View HTML Format