skip to main content
10.1145/3447450.3447454acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicvipConference Proceedingsconference-collections
research-article

Research on Image Detection Algorithm Based on Improved Retinanet

Authors Info & Claims
Published:09 April 2021Publication History

ABSTRACT

In many applications, existing image detection algorithms detect small targets in complex scenes with low recognition rate and biased classification. To address this issue, we develop a multi-target detection algorithm based on improved Retinanet. It consists of the following four parts: 1) extract rich texture features using the information interaction module; and 2) extract high-level abstract features through the improved FPN+ module; and 3) make full use of contextual information for detection leveraging SSH detection head; and 4) adopt the weighted loss for regression. For the accuracy of large and small categories and avoid bias, we employ the ensemble of loss function to optimize parameters. The experimental results confirm the feasibility of our detection algorithm, with the improved performance on four metrics. The mAP50 value and mmAP value increase to 96.75% and 75.08% respectively, and the ACD value decreases by 0.907.

References

  1. AGRAWAL P, GIRSHICK R, MALIK J. Analyzing the performance of Multilayer Neural Networks for Object Recognition[J]. Lecture Notes in Computer Science,2014.Google ScholarGoogle Scholar
  2. He K, Zhang X, Ren S, et al.Spatial pyramid pooling in deep convolutional networks for visual recognition[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2015, 37)9(:1904-1916.Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Ren S, He K, Girshick R, et al.Faster R-CNN:towards real-time object detection with region proposal networks[C]// Conference on Neural Information Processing Systems, 2015:91-99.Google ScholarGoogle Scholar
  4. Girshick R.Fast R-CNN[C]//International Conference on Computer Vision,2015:1440-1448.Google ScholarGoogle Scholar
  5. Dai J,Li Y,He K,et al.R-FCN:object detection via region-based fully convolutional networks[C]/ /Conference on Neural Information Processing Systems,2016:379-387.Google ScholarGoogle Scholar
  6. Lin T Y,Goyal P,Girshick R,et al.Focal loss for dense object detection[C]//International Conference on Computer Vision,2017:2999-3007.Google ScholarGoogle Scholar
  7. Duan K,Bai S,Xie L,et al.Center Net:keypoint triplets for object detection[J].ar Xiv:1904.08189,2019.Google ScholarGoogle Scholar
  8. Mark Everingham, John Winn. The PASCAL Visual Object Classes Challenge 2012 (VOC2012) Development Kit:,2012.http://host.robots.ox.ac.uk/pascal/VOC/.Google ScholarGoogle Scholar
  9. LIN Tsungyi, DOLLÁR P, GIRSHICK R, Feature pyramid networks for object detection[C]// Proceedings of the IEEE Conference on Computer Vision And Pattern Recognition. Honolulu, HI, USA: IEEE, 2017:2117-2125.Google ScholarGoogle Scholar
  10. K. He, X. Zhang, S. Ren and J. Sun, "Deep Residual Learning for Image Recognition," 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, 2016, pp. 770-778, doi: 10.1109/CVPR.2016.90.Google ScholarGoogle Scholar
  11. T. Lin, P. Dollár, R. Girshick, K. He, B. Hariharan and S. Belongie, "Feature Pyramid Networks for Object Detection," 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, 2017, pp. 936-944, doi: 10.1109/CVPR.2017.106.Google ScholarGoogle Scholar
  12. Xiu-Shen Wei, Quan Cui, Lei Yang ,et al. RPC: A Large-Scale Retail Product Checkout Dataset[J]. arXiv:1901.07249v1 [cs.CV] 22 Jan 2019.Google ScholarGoogle Scholar

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Other conferences
    ICVIP '20: Proceedings of the 2020 4th International Conference on Video and Image Processing
    December 2020
    255 pages
    ISBN:9781450389075
    DOI:10.1145/3447450

    Copyright © 2020 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 9 April 2021

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • research-article
    • Research
    • Refereed limited
  • Article Metrics

    • Downloads (Last 12 months)10
    • Downloads (Last 6 weeks)0

    Other Metrics

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format