Balance Multi-Head Attention based on Software and Hardware Co-design | IEEE Conference Publication | IEEE Xplore