Auto-tuning TRSM with an asynchronous task assignment model on multicore, multi-GPU and coprocessor systems | IEEE Conference Publication | IEEE Xplore