Cited By
View all- Agrawal AReddy SBhattamishra SNookala VVashishth VRong KTumanov A(2024)Inshrinkerator: Compressing Deep Learning Training Checkpoints via Dynamic QuantizationProceedings of the 2024 ACM Symposium on Cloud Computing10.1145/3698038.3698553(1012-1031)Online publication date: 20-Nov-2024
- Byun WWoo JMukhopadhyay S(2024)Hessian-Aware KV Cache Quantization for LLMs2024 IEEE 67th International Midwest Symposium on Circuits and Systems (MWSCAS)10.1109/MWSCAS60917.2024.10658840(243-247)Online publication date: 11-Aug-2024
- Ahmadian ADash SChen HVenkitesh BGou SBlunsom PÜstün AHooker SOh ANaumann TGloberson ASaenko KHardt MLevine S(2023)Intriguing properties of quantization at scaleProceedings of the 37th International Conference on Neural Information Processing Systems10.5555/3666122.3667608(34278-34294)Online publication date: 10-Dec-2023
- Show More Cited By