Design Guidance for Lightweight Object Detection Models

Wang, Rui; Wang, Xueli; Chen, Yunfang; Zhang, Wei

doi:10.1007/978-981-19-0852-1_16

Rui Wang¹⁰,
Xueli Wang¹⁰,
Yunfang Chen¹⁰ &
…
Wei Zhang^10,11

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1563))

Included in the following conference series:

International Conference on Big Data and Security

1168 Accesses

Abstract

The lightweight target detection model is deployed in an environment with limited computing power and power consumption, which is widely used in many fields. Most of the current lightweight technologies only focus on a few steps of the model implementation and lack a global perspective. Therefore, this paper proposes a general lightweight model implementation framework, including network construction indicators, lightweight backbone network design, and model optimization. By analyzing the complexity indicator of network structure, the factors that affect network performance such as depth and width are summarized. On this basis, combine the One-Shot Aggregation (OSA) idea and Cross-Stage Partial Network (CSPNet) transformation to construct a general lightweight detection network CSPOSA. Further specific optimization strategies are proposed to prune the network structure and training process. For the network structure, the width and depth of the network are adjusted, the amount of parameters of the model is compressed. The training process is divided into the first, middle and last three stages to improve the detection performance of the model without adding extra computation. Taking embedded platform helmet detection as the experimental scene, the parameter amount of the realized model is 1/10 of the mainstream models YOLOv3 and YOLOv4, and the detection accuracy is similar, so it is more suitable for deployment on devices with limited computing power.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 99.00; Price excludes VAT (USA)

Softcover Book: USD 129.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

A lightweight YOLOv3 algorithm used for safety helmet detection

Article Open access 29 June 2022

A Lightweight Target Detection Algorithm Based on Improved MobileNetv3-YOLOv3

Research on improved algorithm for helmet detection based on YOLOv5

Article Open access 23 October 2023

References

Bochkovskiy, A., Wang, C., Liao, H.M.: Yolov4: optimal speed and accuracy of object detection. arXiv preprint arXiv:2004.10934 (2020)
Chao, P., Kao, C., Ruan, Y., et al.: HardNet: a low memory traffic network, pp. 3552–3561 (2019)
Google Scholar
Girshick, R., Donahue, J., Darrell, T., et al.: Region-based convolutional networks for accurate object detection and segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 38(1), 142–158 (2015)
Article Google Scholar
He, K., Gkioxari, G., Dollár, P., et al.: Mask R-CNN, pp. 2961–2969 (2017)
Google Scholar
He, K., Zhang, X., Ren, S., et al.: Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE Trans. Pattern Anal. Mach. Intell. 37(9), 1904–1916 (2015)
Article Google Scholar
Hinton, G., Vinyals, O., Dean, J.: Distilling the knowledge in a neural network. arXiv preprint arXiv:1503.02531 (2015)
Huang, R., Pedoeem, J., Chen, C.: YOLO-LITE: a real-time object detection algorithm optimized for non-GPU computers, pp. 2503–2510 (2018)
Google Scholar
Krishnamoorthi, R.: Quantizing deep convolutional networks for efficient inference: a whitepaper. arXiv preprint arXiv:1806.08342 (2018)
Krizhevsky, A., Sutskever, I., Hinton, G.: ImageNet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012)
Google Scholar
LeCun, Y.: Lenet-5, convolutional neural networks 20(5), 14 (2015). http://yann.lecun.com/exdb/lenet
Li, H., Kadav, A., Durdanovic, I., et al.: Pruning filters for efficient convnets. arXiv preprint arXiv:1608.08710 (2016)
Liu, W., Anguelov, D., Erhan, D., et al.: SSD: single shot multibox detector, pp. 21–37 (2016)
Google Scholar
Plastiras, G., Kyrkou, C., Theocharides, T.: EdgeNet: balancing accuracy and performance for edge-based convolutional neural network object detectors. In: Proceedings of the 13th International Conference on Distributed Smart Cameras, pp. 1–6 (2019)
Google Scholar
Redmon, J., Divvala, S., Girshick, R., et al.: You only look once: unified, real-time object detection, pp. 779–788 (2016)
Google Scholar
Redmon, J., Farhadi, A.: Yolov3: an incremental improvement. arXiv preprint arXiv:1804.02767 (2018)
Ren, S., He, K., Girshick, R., et al.: Faster R-CNN: towards real-time object detection with region proposal networks. In: Advances in Neural Information Processing Systems, vol. 28, pp. 91–99 (2015)
Google Scholar
Szegedy, C., Liu, W., Jia, Y., et al.: Going deeper with convolutions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–9 (2015)
Google Scholar

Download references

Acknowledgement

This work is supported by National Key R&D Program of China (No. 2019YFB2101700).

Author information

Authors and Affiliations

School of Computer Science, Nanjing University of Posts and Telecommunications, Nanjing, 210023, Jiangsu, China
Rui Wang, Xueli Wang, Yunfang Chen & Wei Zhang
Jiangsu Key Laboratory of Big Data Security and Intelligent Processing, Nanjing University of Posts and Telecommunications, Nanjing, 210023, Jiangsu, China
Wei Zhang

Authors

Rui Wang
View author publications
You can also search for this author in PubMed Google Scholar
Xueli Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yunfang Chen
View author publications
You can also search for this author in PubMed Google Scholar
Wei Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Wei Zhang .

Editor information

Editors and Affiliations

Nanjing Institute of Technology, Nanjing, China
Yuan Tian
Nanjing University of Information Science and Technology, Nanjing, China
Tinghuai Ma
King Saud University, Riyadh, Saudi Arabia
Muhammad Khurram Khan
Texas Tech University, Lubbock, TX, USA
Victor S. Sheng
Tianjin University, Tianjin, China
Zhaoqing Pan

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wang, R., Wang, X., Chen, Y., Zhang, W. (2022). Design Guidance for Lightweight Object Detection Models. In: Tian, Y., Ma, T., Khan, M.K., Sheng, V.S., Pan, Z. (eds) Big Data and Security. ICBDS 2021. Communications in Computer and Information Science, vol 1563. Springer, Singapore. https://doi.org/10.1007/978-981-19-0852-1_16

Download citation

DOI: https://doi.org/10.1007/978-981-19-0852-1_16
Published: 10 March 2022
Publisher Name: Springer, Singapore
Print ISBN: 978-981-19-0851-4
Online ISBN: 978-981-19-0852-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Design Guidance for Lightweight Object Detection Models