An O(n) Time-Complexity Matrix Transpose on Torus Array Processor | IEEE Conference Publication | IEEE Xplore