Cited By
View all- Wang HXia YYang DZhou XCheng D(2025)Harnessing Inter-GPU Shared Memory for Seamless MoE Communication-Computation FusionProceedings of the 30th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming10.1145/3710848.3710868(170-182)Online publication date: 28-Feb-2025
- Pan XLin WZhang LShi STang ZWang RLi BChu XEeckhout LSmaragdakis GLiang KSampson AKim MRossbach C(2025)FSMoE: A Flexible and Scalable Training System for Sparse Mixture-of-Experts ModelsProceedings of the 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 110.1145/3669940.3707272(524-539)Online publication date: 30-Mar-2025