skip to main content
10.1145/3484274.3484283acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicccvConference Proceedingsconference-collections
research-article

Improvement of Detection Rate for Small Objects Using Pre-processing Network

Published: 23 November 2021 Publication History

Abstract

Artificial intelligence (AI) has been developing in a variety of methods over the past decade. However most AI experts worried to build a deep or wide network because the accuracy of AI models depends heavily on the depth of the network. In general deep and wide networks are better at learning than those that are less deep and wide and wide. On the other hand deeper networks are more complex and have many disadvantages such as computational cost and system specification dependency. We propose a novel method to improve the average recall rate for small objects in the deep convolutional network in the paper. The proposed method added pre-processing layer before the network rather than stacking the networks deeper or wide. The presented pre-processing layer consists of two major parts: up-sampling and down-sampling of the data. The overall objective of up-sampling and down-sampling is to enhance the resolution of small objects in the input image. The pre-processing network improves the average recall rate of the base network to 3.56%. This experiment result depicts that the proposed method outperforms the small object detection performance.
CCS CONCEPTS • Computing methodologies • Object detection

References

[1]
K. He, X. Zhang, S. Ren and J. Sun, "Deep Residual Learning for Image Recognition," 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 2016, pp. 770-778.
[2]
T. Lin, P. Goyal, R. Girshick, K. He and P. Dollár, "Focal Loss for Dense Object Detection," in IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 42, no. 2, pp. 318-327, 1 Feb. 2020.
[3]
R. Liu, L. Joel, M. Piero, P. Felipe and F. Eric, "An intriguing failing of convolutional neural networks and the coordconv solution." 2018 arXiv preprint arXiv:1807.03247.
[4]
G. Wang, A. Li, G. He, J. Liu. Z. Zhang, and M. Wang, “Classification of High Spatial Resolution Remote Sensing Images Based on Decision Fusion”  2017 Journal of Advances in Information Technology Vol, 8 (1).
[5]
S. Bhadoria, P. Aggarwal, C. G. Dethe, and R. Vig, “Comparison of segmentation tools for multiple modalities in medical imaging” 2012 Journal of advances in information technology,   3 (4), 197-205.
[6]
B. Xie, X. Zhu, C. Han, Y. Wang, X. Li and Y. Zhang, “Research of the Space-Borne Infrared Ship Target Recognition Technology Based on the Complex Background” 2019 Journal of Advances in Information Technology Vol, 10 (2).
[7]
B. Cheng, S. Cui, and T. Li, “Tensor Locality Preserving Projections Based Urban Building Areas Extraction from High-Resolution SAR Images” 2016  Journal of Advances in Information Technology Vol,   7 (4).
[8]
G. G. Patil, and R. K. Banyal, “A Dynamic Unconstrained Feature Matching Algorithm for Face Recognition” 2020  Journal of Advances in Information Technology Vol,   11 (2).
[9]
C. W. Chuang, and C. P. Fan, “Deep-Learning Based Joint Iris and Sclera Recognition with YOLO Network for Identity Identification” 2021 Journal of Advances in Information Technology Vol,   12 (1).
[10]
J. Redmon, S. Divvala, R. Girshick and A. Farhadi, "You Only Look Once: Unified, Real-Time Object Detection," 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 2016, pp. 779-788.
[11]
R. Girshick, J. Donahue, T. Darrell and J. Malik, "Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation," 2014 IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA, 2014, pp. 580-587.
[12]
R. Girshick, "Fast R-CNN," 2015 IEEE International Conference on Computer Vision (ICCV), Santiago, Chile, 2015, pp. 1440-1448.
[13]
S. Ren, K. He, R Girshick and J. Sun, "Faster r-cnn: Towards real-time object detection with region proposal networks." 2015 arXiv preprint arXiv:1506.01497.
[14]
T. Lin, M. BelongieJames, H. PeronaDeva, P. Dollár and CHarles L. Zitnick, “Microsoft coco: Common objects in context,” In: European conference on computer vision. Springer, Cham, vol. 8693, pp. 740-755, Sep. 2014.

Index Terms

  1. Improvement of Detection Rate for Small Objects Using Pre-processing Network
        Index terms have been assigned to the content through auto-classification.

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image ACM Other conferences
        ICCCV '21: Proceedings of the 4th International Conference on Control and Computer Vision
        August 2021
        207 pages
        ISBN:9781450390477
        DOI:10.1145/3484274
        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 23 November 2021

        Permissions

        Request permissions for this article.

        Check for updates

        Author Tags

        1. COCO dataset
        2. Coordinate Convolutional
        3. Object Detection
        4. pre-processing network

        Qualifiers

        • Research-article
        • Research
        • Refereed limited

        Funding Sources

        • Institute of Information & communications Technology Planning&Evaluation

        Conference

        ICCCV'21

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • 0
          Total Citations
        • 26
          Total Downloads
        • Downloads (Last 12 months)4
        • Downloads (Last 6 weeks)0
        Reflects downloads up to 20 Jan 2025

        Other Metrics

        Citations

        View Options

        Login options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        HTML Format

        View this article in HTML Format.

        HTML Format

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media