Cited By
View all- Wei XJiang NYue HWang XZhao JLi GQiu M(2024)ApproxDup: Developing an Approximate Instruction Duplication Mechanism for Efficient SDC Detection in GPGPUsIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems10.1109/TCAD.2023.333082143:4(1051-1064)Online publication date: Apr-2024
- Xu DFeng YShin KKim DJeon HLi D(2024)Efficient Tensor Offloading for Large Deep-Learning Model Training based on Compute Express LinkSC24: International Conference for High Performance Computing, Networking, Storage and Analysis10.1109/SC41406.2024.00100(1-18)Online publication date: 17-Nov-2024
- Lu YZeng LWang TFu XLi WCheng HYang DJin ZCasas MLiu W(2024)AmgT: Algebraic Multigrid Solver on Tensor CoresSC24: International Conference for High Performance Computing, Networking, Storage and Analysis10.1109/SC41406.2024.00058(1-16)Online publication date: 17-Nov-2024
- Show More Cited By