Abstract
The lightweight target detection model is deployed in an environment with limited computing power and power consumption, which is widely used in many fields. Most of the current lightweight technologies only focus on a few steps of the model implementation and lack a global perspective. Therefore, this paper proposes a general lightweight model implementation framework, including network construction indicators, lightweight backbone network design, and model optimization. By analyzing the complexity indicator of network structure, the factors that affect network performance such as depth and width are summarized. On this basis, combine the One-Shot Aggregation (OSA) idea and Cross-Stage Partial Network (CSPNet) transformation to construct a general lightweight detection network CSPOSA. Further specific optimization strategies are proposed to prune the network structure and training process. For the network structure, the width and depth of the network are adjusted, the amount of parameters of the model is compressed. The training process is divided into the first, middle and last three stages to improve the detection performance of the model without adding extra computation. Taking embedded platform helmet detection as the experimental scene, the parameter amount of the realized model is 1/10 of the mainstream models YOLOv3 and YOLOv4, and the detection accuracy is similar, so it is more suitable for deployment on devices with limited computing power.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Bochkovskiy, A., Wang, C., Liao, H.M.: Yolov4: optimal speed and accuracy of object detection. arXiv preprint arXiv:2004.10934 (2020)
Chao, P., Kao, C., Ruan, Y., et al.: HardNet: a low memory traffic network, pp. 3552–3561 (2019)
Girshick, R., Donahue, J., Darrell, T., et al.: Region-based convolutional networks for accurate object detection and segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 38(1), 142–158 (2015)
He, K., Gkioxari, G., Dollár, P., et al.: Mask R-CNN, pp. 2961–2969 (2017)
He, K., Zhang, X., Ren, S., et al.: Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE Trans. Pattern Anal. Mach. Intell. 37(9), 1904–1916 (2015)
Hinton, G., Vinyals, O., Dean, J.: Distilling the knowledge in a neural network. arXiv preprint arXiv:1503.02531 (2015)
Huang, R., Pedoeem, J., Chen, C.: YOLO-LITE: a real-time object detection algorithm optimized for non-GPU computers, pp. 2503–2510 (2018)
Krishnamoorthi, R.: Quantizing deep convolutional networks for efficient inference: a whitepaper. arXiv preprint arXiv:1806.08342 (2018)
Krizhevsky, A., Sutskever, I., Hinton, G.: ImageNet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012)
LeCun, Y.: Lenet-5, convolutional neural networks 20(5), 14 (2015). http://yann.lecun.com/exdb/lenet
Li, H., Kadav, A., Durdanovic, I., et al.: Pruning filters for efficient convnets. arXiv preprint arXiv:1608.08710 (2016)
Liu, W., Anguelov, D., Erhan, D., et al.: SSD: single shot multibox detector, pp. 21–37 (2016)
Plastiras, G., Kyrkou, C., Theocharides, T.: EdgeNet: balancing accuracy and performance for edge-based convolutional neural network object detectors. In: Proceedings of the 13th International Conference on Distributed Smart Cameras, pp. 1–6 (2019)
Redmon, J., Divvala, S., Girshick, R., et al.: You only look once: unified, real-time object detection, pp. 779–788 (2016)
Redmon, J., Farhadi, A.: Yolov3: an incremental improvement. arXiv preprint arXiv:1804.02767 (2018)
Ren, S., He, K., Girshick, R., et al.: Faster R-CNN: towards real-time object detection with region proposal networks. In: Advances in Neural Information Processing Systems, vol. 28, pp. 91–99 (2015)
Szegedy, C., Liu, W., Jia, Y., et al.: Going deeper with convolutions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–9 (2015)
Acknowledgement
This work is supported by National Key R&D Program of China (No. 2019YFB2101700).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Wang, R., Wang, X., Chen, Y., Zhang, W. (2022). Design Guidance for Lightweight Object Detection Models. In: Tian, Y., Ma, T., Khan, M.K., Sheng, V.S., Pan, Z. (eds) Big Data and Security. ICBDS 2021. Communications in Computer and Information Science, vol 1563. Springer, Singapore. https://doi.org/10.1007/978-981-19-0852-1_16
Download citation
DOI: https://doi.org/10.1007/978-981-19-0852-1_16
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-19-0851-4
Online ISBN: 978-981-19-0852-1
eBook Packages: Computer ScienceComputer Science (R0)