Loading [a11y]/accessibility-menu.js
Boosting SIMD Benefits through a Run-time and Energy Efficient DLP Detection | IEEE Conference Publication | IEEE Xplore

Boosting SIMD Benefits through a Run-time and Energy Efficient DLP Detection


Abstract:

Data Level Parallelism has been improving performance-energy tradeoff of current processors by coupling SIMD engines, such as Intel AVX and ARM NEON. Special libraries an...Show More

Abstract:

Data Level Parallelism has been improving performance-energy tradeoff of current processors by coupling SIMD engines, such as Intel AVX and ARM NEON. Special libraries and compilers are used to support DLP execution on such engines. However, timing overhead on hand coding is inevitable since most software developers are not skilled to extract DLP using unfamiliar libraries. In addition, DLP detection through compiler, besides breaking software compatibility, is limited to static code analysis, which compromises performance gains. In this work, we propose a runtime DLP detection named as Dynamic SIMD Assembler, which transparently identifies vectorizable code regions to execute in the ARM NEON engine. Due to its dynamic fashion, DSA keeps software compatibility and avoids timing overhead on software developing process. Results have shown that DSA outperforms ARM NEON auto-vectorization compiler by 32% since it covers wider vectorized regions, such as Dynamic Range, Sentinel and Conditional Loops. In addition, DSA outperforms hand-vectorized code using ARM library by 26% reducing 45% of energy consumption with no penalties over software development time.
Date of Conference: 25-29 March 2019
Date Added to IEEE Xplore: 16 May 2019
ISBN Information:

ISSN Information:

Conference Location: Florence, Italy

References

References is not available for this document.