Enhancing Attention Models via Multi-head Collaboration | IEEE Conference Publication | IEEE Xplore