Cited By
View all- Wang MHe KCao PDuan JLv DYu ZChen YHuang CDou WChen GTian C(2025)Reunion: Receiver-driven network load balancing mechanism in AI training clustersComputer Networks10.1016/j.comnet.2025.111088259(111088)Online publication date: Mar-2025
- Xie MQian CLitz H(2024)En4S: Enabling SLOs in Serverless Storage SystemsProceedings of the 2024 ACM Symposium on Cloud Computing10.1145/3698038.3698529(160-177)Online publication date: 20-Nov-2024
- Qian KXi YCao JGao JXu YGuan YFu BShi XZhu FMiao RWang CWang PZhang PZeng XRuan EYao ZZhai ECai DSekar VYu MSeneviratne AVeitch D(2024)Alibaba HPN: A Data Center Network for Large Language Model TrainingProceedings of the ACM SIGCOMM 2024 Conference10.1145/3651890.3672265(691-706)Online publication date: 4-Aug-2024
- Show More Cited By