Cited By
View all- Wang WXia YYang DZhou XCheng D(2024)Accelerating Distributed DLRM Training with Optimized TT Decomposition and Micro-BatchingSC24: International Conference for High Performance Computing, Networking, Storage and Analysis10.1109/SC41406.2024.00055(1-15)Online publication date: 17-Nov-2024
- Zhou GLan HXie YTian WQian JSu T(2024)CSIMD: Cross-Search Algorithm with Improved Multi-dimensional Dichotomy for Micro-Batch-Based Pipeline Parallel Training in DNNEuro-Par 2024: Parallel Processing10.1007/978-3-031-69766-1_20(288-301)Online publication date: 26-Aug-2024
- Jang HJung JSong JYu JKim YLee J(2023)Pipe-BD: Pipelined Parallel Blockwise Distillation2023 Design, Automation & Test in Europe Conference & Exhibition (DATE)10.23919/DATE56975.2023.10137044(1-6)Online publication date: Apr-2023
- Show More Cited By