Cited By
View all- Zhang BLi SLi Z(2024)MIGER: Integrating Multi-Instance GPU and Multi-Process Service for Deep Learning ClustersProceedings of the 53rd International Conference on Parallel Processing10.1145/3673038.3673089(504-513)Online publication date: 12-Aug-2024
- Strati FMa XKlimovic A(2024)Orion: Interference-aware, Fine-grained GPU Sharing for ML ApplicationsProceedings of the Nineteenth European Conference on Computer Systems10.1145/3627703.3629578(1075-1092)Online publication date: 22-Apr-2024
- Dhakal AKulkarni SRamakrishnan K(2024)D-STACK: High Throughput DNN Inference by Effective Multiplexing and Spatio-Temporal Scheduling of GPUsIEEE Transactions on Cloud Computing10.1109/TCC.2024.347621012:4(1344-1358)Online publication date: Oct-2024
- Show More Cited By