Conferences >2024 Design, Automation & Tes...

ViT- ToGo: Vision Transformer Accelerator with Grouped Token Pruning

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Vision Transformer (

$V$ iT) has gained prominence for its performance in various vision tasks but comes with considerable computational and memory demands, posing a challe...Show More

Metadata

Abstract:

Vision Transformer (

$V$ iT) has gained prominence for its performance in various vision tasks but comes with considerable computational and memory demands, posing a challenge when deploying it on resource-constrained edge devices. To address this limitation, various token pruning methods have been proposed to reduce the computation. However, the majority of token pruning techniques do not account for practical use in actual embedded devices, which demand a significant reduction in computational load. In this paper, we introduce ViT-ToGo, a

$V$ iT accelerator with grouped token pruning. This enables the parallel execution of the

$V$ iT models and the token pruning process. We implement grouped token pruning with a head-wise importance estimator which simplifies the process need for token pruning, including sorting and reordering. Our proposed method achieves up to 66 % reduction in the number of tokens, resulting in up to 36% reduction in GFLOPs, with only a minimal accuracy drop of around 1 %. Furthermore, the hardware implementation incurs a marginal resource overhead of 1.13% in average.

Published in: 2024 Design, Automation & Test in Europe Conference & Exhibition (DATE)

Date of Conference: 25-27 March 2024

Date Added to IEEE Xplore: 10 June 2024

ISBN Information:

ISSN Information:

DOI: 10.23919/DATE58400.2024.10546804

Conference Location: Valencia, Spain

Funding Agency:

Contents

References is not available for this document.

ViT- ToGo: Vision Transformer Accelerator with Grouped Token Pruning

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

ViT- ToGo: Vision Transformer Accelerator with Grouped Token Pruning

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

References

IEEE Account

Purchase Details

Profile Information

Need Help?