SOLE: Hardware-Software Co-design of Softmax and LayerNorm for Efficient Transformer Inference | IEEE Conference Publication | IEEE Xplore