Cited By
View all- Li CXu Y(2024)Foreseer: Knowledge-Driven Acceleration of Memory-Bound Matrix Multiplications for Large Language Model InferenceProceedings of the 17th ACM International Systems and Storage Conference10.1145/3688351.3689153(53-67)Online publication date: 16-Sep-2024
- Ning ZHuang JBian HTan Z(2024)Research on GRAPES Semi-Implicit Semi-Lagrangian Computation Optimization Based on CPU+GPU Heterogeneity2024 6th International Conference on Electronics and Communication, Network and Computer Technology (ECNCT)10.1109/ECNCT63103.2024.10704385(411-417)Online publication date: 19-Jul-2024
- Jangda AMaleki SDehnavi MMusuvathi MSaarikivi OGrosser TDubach CSteuwer MXue JOttoni GQuintão Pereira F(2024)A Framework for Fine-Grained Synchronization of Dependent GPU KernelsProceedings of the 2024 IEEE/ACM International Symposium on Code Generation and Optimization10.1109/CGO57630.2024.10444873(93-105)Online publication date: 2-Mar-2024
- Show More Cited By