Skip to main content

Design Guidance for Lightweight Object Detection Models

  • Conference paper
  • First Online:
Big Data and Security (ICBDS 2021)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1563))

Included in the following conference series:

  • 1022 Accesses

Abstract

The lightweight target detection model is deployed in an environment with limited computing power and power consumption, which is widely used in many fields. Most of the current lightweight technologies only focus on a few steps of the model implementation and lack a global perspective. Therefore, this paper proposes a general lightweight model implementation framework, including network construction indicators, lightweight backbone network design, and model optimization. By analyzing the complexity indicator of network structure, the factors that affect network performance such as depth and width are summarized. On this basis, combine the One-Shot Aggregation (OSA) idea and Cross-Stage Partial Network (CSPNet) transformation to construct a general lightweight detection network CSPOSA. Further specific optimization strategies are proposed to prune the network structure and training process. For the network structure, the width and depth of the network are adjusted, the amount of parameters of the model is compressed. The training process is divided into the first, middle and last three stages to improve the detection performance of the model without adding extra computation. Taking embedded platform helmet detection as the experimental scene, the parameter amount of the realized model is 1/10 of the mainstream models YOLOv3 and YOLOv4, and the detection accuracy is similar, so it is more suitable for deployment on devices with limited computing power.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 99.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 129.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Bochkovskiy, A., Wang, C., Liao, H.M.: Yolov4: optimal speed and accuracy of object detection. arXiv preprint arXiv:2004.10934 (2020)

  2. Chao, P., Kao, C., Ruan, Y., et al.: HardNet: a low memory traffic network, pp. 3552–3561 (2019)

    Google Scholar 

  3. Girshick, R., Donahue, J., Darrell, T., et al.: Region-based convolutional networks for accurate object detection and segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 38(1), 142–158 (2015)

    Article  Google Scholar 

  4. He, K., Gkioxari, G., Dollár, P., et al.: Mask R-CNN, pp. 2961–2969 (2017)

    Google Scholar 

  5. He, K., Zhang, X., Ren, S., et al.: Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE Trans. Pattern Anal. Mach. Intell. 37(9), 1904–1916 (2015)

    Article  Google Scholar 

  6. Hinton, G., Vinyals, O., Dean, J.: Distilling the knowledge in a neural network. arXiv preprint arXiv:1503.02531 (2015)

  7. Huang, R., Pedoeem, J., Chen, C.: YOLO-LITE: a real-time object detection algorithm optimized for non-GPU computers, pp. 2503–2510 (2018)

    Google Scholar 

  8. Krishnamoorthi, R.: Quantizing deep convolutional networks for efficient inference: a whitepaper. arXiv preprint arXiv:1806.08342 (2018)

  9. Krizhevsky, A., Sutskever, I., Hinton, G.: ImageNet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012)

    Google Scholar 

  10. LeCun, Y.: Lenet-5, convolutional neural networks 20(5), 14 (2015). http://yann.lecun.com/exdb/lenet

  11. Li, H., Kadav, A., Durdanovic, I., et al.: Pruning filters for efficient convnets. arXiv preprint arXiv:1608.08710 (2016)

  12. Liu, W., Anguelov, D., Erhan, D., et al.: SSD: single shot multibox detector, pp. 21–37 (2016)

    Google Scholar 

  13. Plastiras, G., Kyrkou, C., Theocharides, T.: EdgeNet: balancing accuracy and performance for edge-based convolutional neural network object detectors. In: Proceedings of the 13th International Conference on Distributed Smart Cameras, pp. 1–6 (2019)

    Google Scholar 

  14. Redmon, J., Divvala, S., Girshick, R., et al.: You only look once: unified, real-time object detection, pp. 779–788 (2016)

    Google Scholar 

  15. Redmon, J., Farhadi, A.: Yolov3: an incremental improvement. arXiv preprint arXiv:1804.02767 (2018)

  16. Ren, S., He, K., Girshick, R., et al.: Faster R-CNN: towards real-time object detection with region proposal networks. In: Advances in Neural Information Processing Systems, vol. 28, pp. 91–99 (2015)

    Google Scholar 

  17. Szegedy, C., Liu, W., Jia, Y., et al.: Going deeper with convolutions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–9 (2015)

    Google Scholar 

Download references

Acknowledgement

This work is supported by National Key R&D Program of China (No. 2019YFB2101700).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Wei Zhang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Wang, R., Wang, X., Chen, Y., Zhang, W. (2022). Design Guidance for Lightweight Object Detection Models. In: Tian, Y., Ma, T., Khan, M.K., Sheng, V.S., Pan, Z. (eds) Big Data and Security. ICBDS 2021. Communications in Computer and Information Science, vol 1563. Springer, Singapore. https://doi.org/10.1007/978-981-19-0852-1_16

Download citation

  • DOI: https://doi.org/10.1007/978-981-19-0852-1_16

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-19-0851-4

  • Online ISBN: 978-981-19-0852-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics