Abstract:
Early exit has been studied as a way to reduce the complex computation of convolutional neural networks. However, in order to determine whether to exit early in a convent...Show MoreMetadata
Abstract:
Early exit has been studied as a way to reduce the complex computation of convolutional neural networks. However, in order to determine whether to exit early in a conventional CNN accelerator, there is a problem that a unit for computing softmax layer having a large hardware overhead is required. To solve this problem, we propose a low cost early exit decision unit. The proposed architecture uses only fully-connected (FC) layer outputs to make early exit decisions, so the computation of the softmax layer is not necessary. Our implementation results show an energy reduction of 68% with an accuracy drop of less than 0.3%.
Published in: 2020 International SoC Design Conference (ISOCC)
Date of Conference: 21-24 October 2020
Date Added to IEEE Xplore: 01 February 2021
ISBN Information:
Print on Demand(PoD) ISSN: 2163-9612