Transitive closure on the cell broadband engine: A study on self-scheduling in a multicore processor | IEEE Conference Publication | IEEE Xplore

Transitive closure on the cell broadband engine: A study on self-scheduling in a multicore processor

Publisher: IEEE

Abstract:

In this paper, we present a mapping methodology and optimizations for solving transitive closure on the Cell multicore processor. Using our approach, it is possible to ac...View more

Abstract:

In this paper, we present a mapping methodology and optimizations for solving transitive closure on the Cell multicore processor. Using our approach, it is possible to achieve near peak performance for transitive closure on the Cell processor. We first parallelize the Standard Floyd Warshall algorithm and show through analysis and experimental results that data communication is a bottleneck for performance and scalability. We parallelize a cache optimized version of Floyd Warshall algorithm to remove the memory bottleneck. As is the case with several scientific computing and industrial applications on a multicore processor, synchronization and scheduling of the cores plays a crucial role in determining the performance of this algorithm. We define a self-scheduling mechanism for the cores of a multicore processor and design a self-scheduler for Blocked Floyd Warshall algorithm on the Cell multicore processor to remove the scheduling bottleneck. We also present optimizations in scheduling order to remove synchronization points. Our implementations achieved up to 78GFLOPS.
Date of Conference: 23-29 May 2009
Date Added to IEEE Xplore: 10 July 2009
ISBN Information:
Print ISSN: 1530-2075
Publisher: IEEE
Conference Location: Rome, Italy

References

References is not available for this document.