Abstract
Although the use of FPGAs for embedded-oriented CNN accelerators has been spreading owing to a high degree of parallelism and low power consumption, it is still difficult to achieve high performance with FPGAs under strict power constraint. In this paper, we propose a YOLOv3-tiny-based new network model for object detection which uses depthwise separable convolution, and evaluate the effectiveness of the proposal in terms of processing speed, detection accuracy, and power consumption. As a result, we reduced 30% of total latency and 20% of total power. The results showed that depthwise separable convolution is effective not only for improving performance but also for reducing power consumption.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Redmon, J., Farhadi, A.: YOLOv3: an incremental improvement (2018). https://pjreddie.com/media/files/papers/YOLOv3.pdf
Redmon, J.C.: Yolo: real-time object detection. https://pjreddie.com/darknet/yolo/
Yu, Z., Bouganis, C.-S.: A parameterisable FPGA-tailored architecture for YOLOv3-tiny. Appl. Reconfig. Comput. Arch. Tools Appl. 330–334 (2020)
Wai, Y.J., Yussof, Z.B.M., Salim, S.I.B., Chuan, L.K.: Fixed point implementation of tiny-Yolo-v2 using OpenCL on FPGA. Int. J. Adv. Comput. Sci. Appl. (IJACSA) 9(10), 506–512 (2018)
Nguyen, D.T., Nguyen, T.N., Kim, H., Lee, H.-J.: A high-throughput and power-efficient FPGA implementation of YOLO CNN for object detection. IEEE Trans. Very Large Scale Integr. (VLSI) Syst.27(8) (2019)
Zhang, N., Wei, X., Chen, H., Liu, W.: FPGA implementation for CNN-based optical remote sensing object detection. Electronics 10(3), 2021 (2021)
Howard, A.G., et al.: MobileNets: efficient convolutional neural networks for mobile vision application. arXiv: 1704.04861v1 [cs.CV] (2017)
Li, Y., Han, Z., Xu, H., Liu, L., Li, X., Zhang, K.: YOLOv3-lite: a lightweight crack detection network for aircraft structure based on depthwise separable convolutions. Appl. Sci. 9(18) (2019)
Dang, L., Pang, P., Lee, J.: Depth-wise separable convolution neural network with residual connection for hyperspectral image classification. Remote Sens. 12(20) (2020)
Kuznetsova, A., et al.: The open images dataset v4: unified image classification, object detection, and visual relationship detection at scale. IJCV 128, 1956–1981 (2020)
Linder-Noren, E.: PyTorch-YOLOv3. https://github.com/eriklindernoren/PyTorch-YOLOv3
Dallinger, D.: FPGA optimized dynamic post-training quantization of tiny-YoloV3 (2021). https://publik.tuwien.ac.at/files/publik_296008.pdf
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Araki, Y., Matsuda, M., Manabe, T., Ishizuka, Y., Shibata, Y. (2022). FPGA Implementation of an Object Recognition System with Low Power Consumption Using a YOLOv3-tiny-based CNN. In: Barolli, L. (eds) Complex, Intelligent and Software Intensive Systems. CISIS 2022. Lecture Notes in Networks and Systems, vol 497. Springer, Cham. https://doi.org/10.1007/978-3-031-08812-4_22
Download citation
DOI: https://doi.org/10.1007/978-3-031-08812-4_22
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-08811-7
Online ISBN: 978-3-031-08812-4
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)