Cited By
View all- Zhang YLi KYuan LCheng JZhang YCao TYang M(2024)LoRAStencil: Low-Rank Adaptation of Stencil Computation on Tensor CoresProceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis10.1109/SC41406.2024.00059(1-17)Online publication date: 17-Nov-2024
- Naganawa YKamei HKanetaka YNogami HMaeda YFukushima N(2024)SIMD-Constrained Lookup Table for Accelerating Variable-Weighted Convolution on x86/64 CPUsIEEE Access10.1109/ACCESS.2024.335472012(15800-15819)Online publication date: 2024
- Tayeb HPaillat LBramas B(2023)Autovesk: Automatic Vectorized Code Generation from Unstructured Static Kernels Using Graph TransformationsACM Transactions on Architecture and Code Optimization10.1145/363170921:1(1-25)Online publication date: 9-Nov-2023
- Show More Cited By