JointNF: Enhancing DNN Performance through Adaptive N:M Pruning across both Weight and Activation
Abstract
References
Recommendations
Reweighted Alternating Direction Method of Multipliers for DNN weight pruning
AbstractAs Deep Neural Networks (DNNs) continue to grow in complexity and size, leading to a substantial computational burden, weight pruning techniques have emerged as an effective solution. This paper presents a novel method for dynamic regularization-...
Attention-based adaptive structured continuous sparse network pruning
AbstractDeep neural network models, especially CNNs, have a wide range of applications in many fields, but their high computational power requirements limit the deployment applications in many resource-constrained embedded devices. Pruning techniques ...
Efficient label-free pruning and retraining for Text-VQA Transformers
AbstractRecent advancements in Scene Text Visual Question Answering (Text-VQA) employ autoregressive Transformers, showing improved performance with larger models and pre-training datasets. Although various pruning frameworks exist to simplify ...
Highlights- We study a label-free importance score for structured pruning of autoregressive Transformers.
- We propose an adaptive retraining approach for pruned Transformer models of varying sizes.
- Our pruned model achieve up to 60% reduction ...
Comments
Information & Contributors
Information
Published In
- Chair:
- Pascal Meinerzhagen,
- Program Chair:
- Kapil Dev,
- Program Co-chair:
- Jerald Yoo
Sponsors
- SIGDA: ACM Special Interest Group on Design Automation
- IEEE CAS
- IEEE EDA
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
Check for updates
Author Tags
Qualifiers
- Research-article
Conference
Acceptance Rates
Contributors
Other Metrics
Bibliometrics & Citations
Bibliometrics
Article Metrics
- 0Total Citations
- 123Total Downloads
- Downloads (Last 12 months)123
- Downloads (Last 6 weeks)29
Other Metrics
Citations
View Options
Login options
Check if you have access through your login credentials or your institution to get full access on this article.
Sign in