Cited By
View all- Cheng SLin SDiao LWu HWang SSi CLiu ZZhao XDu JLin WYou YEeckhout LSmaragdakis GLiang KSampson AKim MRossbach C(2025)Concerto: Automatic Communication Optimization and Scheduling for Large-Scale Deep LearningProceedings of the 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 110.1145/3669940.3707223(198-213)Online publication date: 3-Feb-2025
- Li XGuo CQian KZhang MYang MXu M(2024)Near-Lossless Gradient Compression for Data-Parallel Distributed DNN TrainingProceedings of the 2024 ACM Symposium on Cloud Computing10.1145/3698038.3698541(977-994)Online publication date: 20-Nov-2024
- Li BWang XWang JLiu YGong YLu HDang WZhang WHuang XChen MChen JHe CLiu YHu XLiu CJi XXia YLi XHe ZWang YZou X(2024)TCCL: Co-optimizing Collective Communication and Traffic Routing for GPU-centric ClustersProceedings of the 2024 SIGCOMM Workshop on Networks for AI Computing10.1145/3672198.3673799(48-53)Online publication date: 4-Aug-2024
- Show More Cited By