skip to main content
10.1145/3425577.3425581acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicccvConference Proceedingsconference-collections
research-article

Weakly Supervised Learning with Discrimination Mechanism for Object Detection

Published:23 January 2021Publication History

ABSTRACT

In order to reduce the time consuming and expensive process of manually annotating data, and achieve the purpose of lightweight deployment. In this paper, an object detection method for weakly supervised learning with discrimination mechanism is proposed. We introduce the classification branch and the location branch based on the Darknet-53 backbone network of YOLO model, utilize Global Average Pooling (GAP) and Softmax to complete classification on selected areas, and adopt classification activation map for location. In addition, we use a model compression mechanism for model pruning operations, which reduces the size of the model and achieves the lightweight goal. These can effectively solve the problems of object detection to a certain extent. The results show that the improved model achieves good performance in terms of robustness and stability while maintaining the accuracy and efficiency of object detection, further improving the effectiveness of object detection tasks in practical application scenarios.

References

  1. Wang X, Zhang W, Wu X, Real-time vehicle type classification with deep convolutional neural networks[J]. Journal of Real-Time Image Processing, 2019, 16(1): 5-14.Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Li L, Zhang S, Wu J. Efficient Object Detection Framework and Hardware Architecture for Remote Sensing Images[J]. Remote Sensing, 2019, 11(20): 2376.Google ScholarGoogle ScholarCross RefCross Ref
  3. Olague G, Hernández D E, Llamas P, Brain programming as a new strategy to create visual routines for object tracking[J]. Multimedia Tools and Applications, 2019, 78(5): 5881-5918.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Wu X, Sahoo D, Zhang D, Single-shot bidirectional pyramid networks for high-quality object detection[J]. Neurocomputing, 2020.Google ScholarGoogle Scholar
  5. Stathopoulou E K, Remondino F. Semantic photogrammetry: boosting image-based 3D reconstruction with semantic labeling[J]. International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, 2019, 42(2/W9).Google ScholarGoogle ScholarCross RefCross Ref
  6. Wellhausen L, Dosovitskiy A, Ranftl R, Where should i walk? predicting terrain properties from images via self-supervised learning[J]. IEEE Robotics and Automation Letters, 2019, 4(2): 1509-1516.Google ScholarGoogle ScholarCross RefCross Ref
  7. Hamza R, Hassan A, Huang T, An Efficient Cryptosystem for Video Surveillance in the Internet of Things Environment[J]. Complexity, 2019, 2019.Google ScholarGoogle Scholar
  8. Radosavovic I, Dollár P, Girshick R, Data distillation: Towards omni-supervised learning[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2018: 4119-4128.Google ScholarGoogle Scholar
  9. Vorontsov E, Molchanov P, Byeon W, Boosting segmentation with weak supervision from image-to-image translation[J]. arXiv preprint arXiv:1904.01636, 2019.Google ScholarGoogle Scholar
  10. Kim B, Pardo B. Sound Event Detection Using Point-Labeled Data[C]//2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA). IEEE, 2019: 1-5.Google ScholarGoogle Scholar
  11. Bresilla K, Perulli G D, Boini A, Single-shot convolution neural networks for real-time fruit detection within the tree[J]. Frontiers in plant science, 2019, 10.Google ScholarGoogle Scholar
  12. Drugman T, Pylkkonen J, Kneser R. Active and semi-supervised learning in ASR: Benefits on the acoustic and language models[J]. arXiv preprint arXiv:1903.02852, 2019.Google ScholarGoogle Scholar
  13. Sudharshan P J, Petitjean C, Spanhol F, Multiple instance learning for histopathological breast cancer image classification[J]. Expert Systems with Applications, 2019, 117: 103-111.Google ScholarGoogle ScholarCross RefCross Ref
  14. Georgiadis G. Accelerating Convolutional Neural Networks via Activation Map Compression[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2019: 7085-7095.Google ScholarGoogle Scholar
  15. Kim K J, Kim P K, Chung Y S, Multi-scale detector for accurate vehicle detection in traffic surveillance data[J]. IEEE Access, 2019, 7: 78311-78319.Google ScholarGoogle ScholarCross RefCross Ref
  16. Hsiao T Y, Chang Y C, Chou H H, Filter-based deep-compression with global average pooling for convolutional networks[J]. Journal of Systems Architecture, 2019, 95: 9-18.Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Long C, Collins R, Swears E, Deep neural networks in fully connected crf for image labeling with social network metadata[C]//2019 IEEE Winter Conference on Applications of Computer Vision (WACV). IEEE, 2019: 1607-1615.Google ScholarGoogle Scholar
  18. Courtiol P, Maussion C, Moarii M, Deep learning-based classification of mesothelioma improves prediction of patient outcome[J]. Nature medicine, 2019, 25(10): 1519-1525.Google ScholarGoogle Scholar
  19. Shen J, Liu N, Sun H, Vehicle Detection in Aerial Images Based on Hyper Feature Map in Deep Convolutional Network[J]. TIIS, 2019, 13(4): 1989-2011.Google ScholarGoogle Scholar
  20. Bharati P, Pramanik A. Deep Learning Techniques—R-CNN to Mask R-CNN: A Survey[M]//Computational Intelligence in Pattern Recognition. Springer, Singapore, 2020: 657-668.Google ScholarGoogle ScholarCross RefCross Ref
  21. Graham S, Vu Q D, Raza S E A, Hover-Net: Simultaneous segmentation and classification of nuclei in multi-tissue histology images[J]. Medical Image Analysis, 2019, 58: 101563.Google ScholarGoogle ScholarCross RefCross Ref
  22. Teh E W, Taylor G. Apparent age estimation with relational networks[C]//2019 16th Conference on Computer and Robot Vision (CRV). IEEE, 2019: 89-96.Google ScholarGoogle Scholar
  23. Wang D, Li C, Wen S, Daedalus: Breaking non-maximum suppression in object detection via adversarial examples[J]. arXiv, 2019: arXiv: 1902.02067.Google ScholarGoogle Scholar
  24. Yang H, Zhu Y, Liu J. Ecc: Platform-independent energy-constrained deep neural network compression via a bilinear regression model[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2019: 11206-11215.Google ScholarGoogle Scholar
  25. Jiang Y, Wang S, Ko B J, Model Pruning Enables Efficient Federated Learning on Edge Devices[J]. arXiv preprint arXiv:1909.12326, 2019.Google ScholarGoogle Scholar
  26. Benjdira B, Khursheed T, Koubaa A, Car detection using unmanned aerial vehicles: Comparison between faster r-cnn and yolov3[C]//2019 1st International Conference on Unmanned Vehicle Systems-Oman (UVS). IEEE, 2019: 1-6.Google ScholarGoogle Scholar
  27. Maddern W, Pascoe G, Gadd M, Real-time Kinematic Ground Truth for the Oxford RobotCar Dataset[J]. arXiv preprint arXiv:2002.10152, 2020.Google ScholarGoogle Scholar
  28. Li Y, Chen Y, Wang N, Scale-aware trident networks for object detection[C]//Proceedings of the IEEE International Conference on Computer Vision. 2019: 6054-6063.Google ScholarGoogle Scholar
  29. Bao Z, Zhou W, Zhang W. Multi-grained Pruning Method of Convolutional Neural Network[C]//International Conference of Pioneering Computer Scientists, Engineers and Educators. Springer, Singapore, 2019: 564-576.Google ScholarGoogle Scholar
  30. Lee C P, Wright S J. Random permutations fix a worst case for cyclic coordinate descent[J]. IMA Journal of Numerical Analysis, 2019, 39(3): 1246-1275.Google ScholarGoogle ScholarCross RefCross Ref
  31. Ding X, Ding G, Guo Y, Centripetal sgd for pruning very deep convolutional networks with complicated structure[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2019: 4943-4953.Google ScholarGoogle Scholar
  32. He Y, Liu P, Wang Z, Filter pruning via geometric median for deep convolutional neural networks acceleration[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2019: 4340-4349.Google ScholarGoogle Scholar
  33. Salman S, Liu X. Sparsity as the Implicit Gating Mechanism for Residual Blocks[C]//2019 International Joint Conference on Neural Networks (IJCNN). IEEE, 2019: 1-6Google ScholarGoogle Scholar

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Other conferences
    ICCCV '20: Proceedings of the 3rd International Conference on Control and Computer Vision
    August 2020
    114 pages
    ISBN:9781450388023
    DOI:10.1145/3425577

    Copyright © 2020 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 23 January 2021

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • research-article
    • Research
    • Refereed limited
  • Article Metrics

    • Downloads (Last 12 months)7
    • Downloads (Last 6 weeks)0

    Other Metrics

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format