research-article

Improvement of Detection Rate for Small Objects Using Pre-processing Network

Authors:

Kwang Nam ChoiAuthors Info & Claims

ICCCV '21: Proceedings of the 4th International Conference on Control and Computer Vision

Pages 50 - 56

https://doi.org/10.1145/3484274.3484283

Published: 23 November 2021 Publication History

Get Access

Abstract

Artificial intelligence (AI) has been developing in a variety of methods over the past decade. However most AI experts worried to build a deep or wide network because the accuracy of AI models depends heavily on the depth of the network. In general deep and wide networks are better at learning than those that are less deep and wide and wide. On the other hand deeper networks are more complex and have many disadvantages such as computational cost and system specification dependency. We propose a novel method to improve the average recall rate for small objects in the deep convolutional network in the paper. The proposed method added pre-processing layer before the network rather than stacking the networks deeper or wide. The presented pre-processing layer consists of two major parts: up-sampling and down-sampling of the data. The overall objective of up-sampling and down-sampling is to enhance the resolution of small objects in the input image. The pre-processing network improves the average recall rate of the base network to 3.56%. This experiment result depicts that the proposed method outperforms the small object detection performance.

CCS CONCEPTS • Computing methodologies • Object detection

References

[1]

K. He, X. Zhang, S. Ren and J. Sun, "Deep Residual Learning for Image Recognition," 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 2016, pp. 770-778.

Crossref

Google Scholar

[2]

T. Lin, P. Goyal, R. Girshick, K. He and P. Dollár, "Focal Loss for Dense Object Detection," in IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 42, no. 2, pp. 318-327, 1 Feb. 2020.

Crossref

Google Scholar

[3]

R. Liu, L. Joel, M. Piero, P. Felipe and F. Eric, "An intriguing failing of convolutional neural networks and the coordconv solution." 2018 arXiv preprint arXiv:1807.03247.

Google Scholar

[4]

G. Wang, A. Li, G. He, J. Liu. Z. Zhang, and M. Wang, “Classification of High Spatial Resolution Remote Sensing Images Based on Decision Fusion” 2017 Journal of Advances in Information Technology Vol, 8 (1).

Google Scholar

[5]

S. Bhadoria, P. Aggarwal, C. G. Dethe, and R. Vig, “Comparison of segmentation tools for multiple modalities in medical imaging” 2012 Journal of advances in information technology, 3 (4), 197-205.

Google Scholar

[6]

B. Xie, X. Zhu, C. Han, Y. Wang, X. Li and Y. Zhang, “Research of the Space-Borne Infrared Ship Target Recognition Technology Based on the Complex Background” 2019 Journal of Advances in Information Technology Vol, 10 (2).

Crossref

Google Scholar

[7]

B. Cheng, S. Cui, and T. Li, “Tensor Locality Preserving Projections Based Urban Building Areas Extraction from High-Resolution SAR Images” 2016 Journal of Advances in Information Technology Vol, 7 (4).

Crossref

Google Scholar

[8]

G. G. Patil, and R. K. Banyal, “A Dynamic Unconstrained Feature Matching Algorithm for Face Recognition” 2020 Journal of Advances in Information Technology Vol, 11 (2).

Crossref

Google Scholar

[9]

C. W. Chuang, and C. P. Fan, “Deep-Learning Based Joint Iris and Sclera Recognition with YOLO Network for Identity Identification” 2021 Journal of Advances in Information Technology Vol, 12 (1).

Google Scholar

[10]

J. Redmon, S. Divvala, R. Girshick and A. Farhadi, "You Only Look Once: Unified, Real-Time Object Detection," 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 2016, pp. 779-788.

Crossref

Google Scholar

[11]

R. Girshick, J. Donahue, T. Darrell and J. Malik, "Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation," 2014 IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA, 2014, pp. 580-587.

Digital Library

Google Scholar

[12]

R. Girshick, "Fast R-CNN," 2015 IEEE International Conference on Computer Vision (ICCV), Santiago, Chile, 2015, pp. 1440-1448.

Digital Library

Google Scholar

[13]

S. Ren, K. He, R Girshick and J. Sun, "Faster r-cnn: Towards real-time object detection with region proposal networks." 2015 arXiv preprint arXiv:1506.01497.

Google Scholar

[14]

T. Lin, M. BelongieJames, H. PeronaDeva, P. Dollár and CHarles L. Zitnick, “Microsoft coco: Common objects in context,” In: European conference on computer vision. Springer, Cham, vol. 8693, pp. 740-755, Sep. 2014.

Crossref

Google Scholar

Index Terms

Improvement of Detection Rate for Small Objects Using Pre-processing Network
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision problems
      2. Computer vision tasks
  2. Machine learning

Index terms have been assigned to the content through auto-classification.

Recommendations

Object Detection by Combining Deep Dilated Convolutions Network and Light-Weight Network
Knowledge Science, Engineering and Management
Abstract
In recent years, the performance of object detection algorithm has been improved continuously, and it has become an important direction in the field of computer vision. All the work in this paper will be based on a two-stage object detection ...
Deep Learning-Based Object Detection in Diverse Weather Conditions

The number of different types of composite images has grown very rapidly in current years, making object detection an extremely critical task that requires a deeper understanding of various deep learning strategies that help to detect objects with higher ...
Enhanced Small Object Detection Neural Network
ICASIT 2020: Proceedings of the 2020 International Conference on Aviation Safety and Information Technology

Faster-RCNN is an vital deep learning object detection algorithm. Nevertheless, the small object detection effect of Faster-RCNN, which does not use multi-layer feature map, is not good enough. In this paper, a new network architecture called enhanced ...

Comments

Information & Contributors

Information

Published In

ICCCV '21: Proceedings of the 4th International Conference on Control and Computer Vision

August 2021

207 pages

ISBN:9781450390477

DOI:10.1145/3484274

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 23 November 2021

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Funding Sources

Institute of Information & communications Technology Planning&Evaluation

Conference

ICCCV'21

ICCCV'21: 2021 4th International Conference on Control and Computer Vision

August 13 - 15, 2021

Macau, China

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
26
Total Downloads

Downloads (Last 12 months)4
Downloads (Last 6 weeks)0

Reflects downloads up to 20 Jan 2025

Other Metrics

View Author Metrics

Citations

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Abstract

References

Index Terms

Recommendations

Object Detection by Combining Deep Dilated Convolutions Network and Light-Weight Network

Deep Learning-Based Object Detection in Diverse Weather Conditions

Enhanced Small Object Detection Neural Network

Comments

Information

Published In

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Funding Sources

Conference

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Login options

Full Access

View options

PDF

eReader

HTML Format

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations