Cited By
View all- Chitty-Venkata KMittal SEmani MVishwanath VSomani A(2023)A survey of techniques for optimizing transformer inferenceJournal of Systems Architecture10.1016/j.sysarc.2023.102990144(102990)Online publication date: Nov-2023
- Huang SLiu NLiang YPeng HLi HXu DXie MDing C(2022)An Automatic and Efficient BERT Pruning for Edge AI Systems2022 23rd International Symposium on Quality Electronic Design (ISQED)10.1109/ISQED54688.2022.9806197(1-6)Online publication date: 6-Apr-2022
- Peng HGurevin DHuang SGeng TJiang WKhan ODing C(2022)Towards Sparsification of Graph Neural Networks2022 IEEE 40th International Conference on Computer Design (ICCD)10.1109/ICCD56317.2022.00048(272-279)Online publication date: Oct-2022