Loading web-font TeX/Main/Regular
Constraints-Aware Trainable Pruning With System Optimization For The On-Demand Offloading Edge-Cloud Collaborative System | IEEE Conference Publication | IEEE Xplore

Constraints-Aware Trainable Pruning With System Optimization For The On-Demand Offloading Edge-Cloud Collaborative System


Abstract:

Edge-cloud collaborative systems are becoming essential for neural networks in edge applications. However, while applying pruning on the edge model is a typical process, ...Show More

Abstract:

Edge-cloud collaborative systems are becoming essential for neural networks in edge applications. However, while applying pruning on the edge model is a typical process, previous works have simply removed the networks' sparsity with a global strategy, neglecting the filters' traits, environmental constraints, and edge-cloud joint optimization. This may lead to sub-optimal performance. In this paper, we propose a trainable pruning method considering system optimization. We introduce trainable gates to achieve filter-wise optimization regarding computation-aware and bandwidth-aware training to achieve a better performance. In addition, we adopt a performance predictor and a two-stage training strategy to estimate the optimal constraints-accuracy trade-off to the entire system. Finally, we validate the proposed method on the CIFAR-100 and the Tiny-ImageNet-200 dataset, and the results show that our approach significantly reduces the computation complexity and bandwidth requirement by approximately 75\% and 70\% compared to the prior work.
Date of Conference: 17-20 September 2023
Date Added to IEEE Xplore: 23 October 2023
ISBN Information:

ISSN Information:

Conference Location: Rome, Italy

Funding Agency:


Contact IEEE to Subscribe

References

References is not available for this document.