Processing math: 100%
Mobileware: Distributed Architecture With Channel Stationary Dataflow for MobileNet Acceleration | IEEE Journals & Magazine | IEEE Xplore

Mobileware: Distributed Architecture With Channel Stationary Dataflow for MobileNet Acceleration


Abstract:

The depthwise separable convolution, a key feature of the MobileNet models, has a different input reuse pattern from the conventional standard convolution, and a smaller ...Show More

Abstract:

The depthwise separable convolution, a key feature of the MobileNet models, has a different input reuse pattern from the conventional standard convolution, and a smaller number of input/weight pairs are used for a dot product, thereby leading to extremely low MAC utilization. This article proposes a Mobileware architecture for the high-performance acceleration of the MobileNet workloads. A new channel stationary dataflow architecture distributes the on-chip buffers, and the distributed SRAMs are placed near each processing element (PE). By doing so, PEs and SRAMs can communicate with high bandwidth. Our Mobileware architecture shows 1.4\times 29.5\times higher throughput than conventional weight stationary-based hardware architecture, and the proposed design was verified on the Xilinx ZCU102 FPGA evaluation board.
Page(s): 2661 - 2673
Date of Publication: 21 March 2024

ISSN Information:

Funding Agency:


Contact IEEE to Subscribe

References

References is not available for this document.