Cited By
View all- Shubha SShen HIyer AGavrilovska ATerry D(2024)USHERProceedings of the 18th USENIX Conference on Operating Systems Design and Implementation10.5555/3691938.3691989(947-964)Online publication date: 10-Jul-2024
- Pan ZSan Miguel JWu DTsafrir DMusuvathi MGupta RAbu-Ghazaleh N(2024)Carat: Unlocking Value-Level Parallelism for Multiplier-Free GEMMsProceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 210.1145/3620665.3640364(167-184)Online publication date: 27-Apr-2024
- Qi JXiao WLi MYang CLi YLin WYang HLuan ZQian D(2024)ElasticBatch: A Learning-Augmented Elastic Scheduling System for Batch Inference on MIGIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2024.343118935:10(1708-1720)Online publication date: Oct-2024
- Show More Cited By