skip to main content
10.1145/3358528.3358574acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicbdtConference Proceedingsconference-collections
research-article

Survey of Deep Learning Based Object Detection

Authors Info & Claims
Published:28 August 2019Publication History

ABSTRACT

The main tasks of computer vision are image classification/location, target detection, target tracking, semantic segmentation and instance segmentation. The task of target detection is to output the borders and labels of a single target from the image. Object detection is an important issue in the field of computer vision. It has important research significance and application value in video monitoring, autonomous driving and human-computer interaction. In recent years, deep learning has made a breakthrough in the research of image classification and led to the rapid development of object vision detection. This paper briefly introduces the object detection algorithm based on deep learning. First, the basic process of object detection is introduced, then several current algorithms of object detection are introduced, and finally the future development trend is prospected.

References

  1. Viola P, Jones M. Rapid object detection using a boosted cascade of simple features[C]//Computer Vision and Pattern Recognition, 2001. CVPR 2001. Proceedings of the 2001 IEEE Computer Society Conference on. IEEE, 2001, 1:I--I.Google ScholarGoogle Scholar
  2. Deng, Zhidong, and Lipu Zhou. "Detection and Recognition of Traffic Planar Objects Using Colorized Laser Scan and Perspective Distortion Rectification." IEEE Transactions on Intelligent Transportation Systems 19.5 (2018): 1485--1495.Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Ren S, He K, Girshick R, et al. Faster r-cnn: Towards real-time object detection with region proposal networks[C]//Advances in neural information processing systems. 2015: 91--99.Google ScholarGoogle Scholar
  4. He, Kaiming, et al. "Deep residual learning for image recognition." Proceedings of the IEEE conference on computer vision and pattern recognition. 2016.Google ScholarGoogle Scholar
  5. Vu, Khanh, Kien A. Hua, and Wallapak Tavanapong. "Image retrieval based on regions of interest." IEEE Transactions on knowledge and data engineering 15.4 (2003): 1045--1049.Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Schuldt, Christian, Ivan Laptev, and Barbara Caputo. "Recognizing human actions: a local SVM approach." Pattern Recognition, 2004. ICPR 2004. Proceedings of the 17th International Conference on. Vol. 3. IEEE, 2004.Google ScholarGoogle Scholar
  7. Girshick R, Donahue J, Darrell T, et al. Rich feature hierarchies for accurate object detection and semantic segmentation[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2014: 580--587.Google ScholarGoogle Scholar
  8. Redmon J, Divvala S, Girshick R, et al. You only look once: Unified, real-time object detection[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2016: 779--788.Google ScholarGoogle Scholar
  9. Krizhevsky, Alex, Ilya Sutskever, and Geoffrey E. Hinton. "Imagenet classification with deep convolutional neural networks." Advances in neural information processing systems. 2012.Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Ma, Jianqi, et al. "Arbitrary-oriented scene text detection via rotation proposals." IEEE Transactions on Multimedia (2018).Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. LeCun, Yann, et al. "Backpropagation applied to handwritten zip code recognition." Neural computation 1.4 (1989): 541--551.Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Liu W, Anguelov D, Erhan D, et al. Ssd: Single shot multibox detector[C]//European conference on computer vision. Springer, Cham, 2016: 21--37.Google ScholarGoogle Scholar
  13. Li Z, Peng C, Yu G, et al. Light-head r-cnn: In defense of two-stage object detector[J]. arXiv preprint arXiv:1711.07264, 2017.Google ScholarGoogle Scholar
  14. Dai J, Li Y, He K, et al. R-fcn: Object detection via region-based fully convolutional networks[C]//Advances in neural information processing systems. 2016: 379--387.Google ScholarGoogle Scholar

Index Terms

  1. Survey of Deep Learning Based Object Detection

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      ICBDT '19: Proceedings of the 2nd International Conference on Big Data Technologies
      August 2019
      382 pages
      ISBN:9781450371926
      DOI:10.1145/3358528

      Copyright © 2019 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 28 August 2019

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed limited

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader