Full-Stack Optimizing Transformer Inference on ARM Many-Core CPU | IEEE Journals & Magazine | IEEE Xplore