Conferences >2023 Design, Automation & Tes...

Lattice Quantization

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Post-training quantization of neural networks consists in quantizing a model without retraining nor hyperparameter search, while being fast and data frugal. In this paper...Show More

Metadata

Abstract:

Post-training quantization of neural networks consists in quantizing a model without retraining nor hyperparameter search, while being fast and data frugal. In this paper, we propose LatticeQ, a novel post-training weight quantization method designed for deep convolutional neural networks (DC-NNs). Contrary to scalar rounding widely used in state-of-the-art quantization methods, LatticeQ uses a quantizer based on lattices - discrete algebraic structures. LatticeQ exploits the inner correlations between the model parameters to the benefit of minimizing quantization error. We achieve state-of-the-art results in post-training quantization. In particular, we achieve ImageNet classification results close to full precision on Resnet-18/50, with little to no accuracy drop for 4-bit models. Our code is available here, and a more thorough version of the paper here.

Published in: 2023 Design, Automation & Test in Europe Conference & Exhibition (DATE)

Date of Conference: 17-19 April 2023

Date Added to IEEE Xplore: 02 June 2023

Print on Demand(PoD) ISBN:979-8-3503-9624-9

ISSN Information:

DOI: 10.23919/DATE56975.2023.10137188

Conference Location: Antwerp, Belgium

Contents

References is not available for this document.

Lattice Quantization

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Lattice Quantization

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?