ABSTRACT
Multi-cores such as the Intel Core 2 Duo, AMD Barcelona and IBM POWER6 are becoming ubiquitous. The number of cores and the resulting hardware parallelism is poised to increase rapidly in the foreseeable future. Nested thread-level speculative parallelization has been proposed as a means to exploit the hardware parallelism of such systems. In this paper, we present a methodology to gauge the efficacy of nested thread-level speculation with increasing level of nesting.
- AMD Confirms 12-Core Opteron Production, Februrary, 2010. http://www.hpcwire.com/blogs/AMD-Confirms-12-Core-Opteron-Production-85118242.html#38593146.Google Scholar
- Multicore Watershed, March, 2010. http://www.hpcwire.com/blogs/Multicore-Watershed-86426622.htm.Google Scholar
- H. Sutter and J. Larus. Software and the concurrency revolution. ACM Queue, 3(7), 2005. Google ScholarDigital Library
- A. Kejariwal, X. Tian, W. Li, M. Girkar, S. Kozhukhov, H. Saito, U. Banerjee, A. Nicolau, A. V. Veidenbaum, and C. D. Polychronopoulos. On the performance potential of different types of speculative thread-level parallelism. In Proceedings of the 20th ACM ICS, pages 24--35, 2006. Google ScholarDigital Library
- A. Kejariwal, X. Tian, M. Girkar, W. Li, S. Kozhukhov, H. Saito, U. Banerjee, A. Nicolau, A. V. Veidenbaum, and C. D. Polychronopoulos. Tight analysis of the performance potential of thread speculation using SPEC CPU2006. In Proceedings of the 12th ACM SIGPLAN Symposium on PPoPP, 2007. Google ScholarDigital Library
- A. Kejariwal, M. Girkar, X. Tian, H. Saito, A. Nicolau, A. V. Veidenbaum, and U. Banerjee. On the efficacy of call graph-level thread-level speculation. In Proceedings of the First Joint WOSP/SIPEW ICPE, pages 247--248, 2010. Google ScholarDigital Library
- J. Renau, J. Tuck, W. Liu, L. Ceze, K. Strauss, and J. Torrellas. Tasking with out-of-order spawn in TLS chip multiprocessors: Microarchitecture and compilation. In Proceedings of the 19th ACM ICS, pages 179--188, 2005. Google ScholarDigital Library
- SPEC CFP2006. http://www.spec.org/cpu2006/CFP2006.Google Scholar
- P. Wu, A. Kejariwal, and C. Caşscaval. Compiler-driven dependence profiling to guide program parallelization. In Proceedings of the 21st International Workshop on LCPC, 2008. Google ScholarDigital Library
Index Terms
- Exploitation of nested thread-level speculative parallelism on multi-core systems
Recommendations
Speculative parallelization using software multi-threaded transactions
ASPLOS XV: Proceedings of the fifteenth International Conference on Architectural support for programming languages and operating systemsWith the right techniques, multicore architectures may be able to continue the exponential performance trend that elevated the performance of applications of all types for decades. While many scientific programs can be parallelized without speculative ...
Challenges in exploitation of loop parallelism in embedded applications
CODES+ISSS '06: Proceedings of the 4th international conference on Hardware/software codesign and system synthesisEmbedded processors have been increasingly exploiting hardware parallelism. Vector units, multiple processors or cores, hyper-threading, special-purpose accelerators such as DSPs or cryptographic engines, or a combination of the above have appeared in a ...
Converting thread-level parallelism to instruction-level parallelism via simultaneous multithreading
To achieve high performance, contemporary computer systems rely on two forms of parallelism: instruction-level parallelism (ILP) and thread-level parallelism (TLP). Wide-issue super-scalar processors exploit ILP by executing multiple instructions from a ...
Comments