Loading [a11y]/accessibility-menu.js
Collaborative Accelerators for Streamlining MapReduce on Scale-up Machines With Incremental Data Aggregation | IEEE Journals & Magazine | IEEE Xplore

Collaborative Accelerators for Streamlining MapReduce on Scale-up Machines With Incremental Data Aggregation


Abstract:

The MapReduce programming paradigm has been increasingly adopted to implement data-intensive applications processing both small and large scale datasets. As most jobs in ...Show More

Abstract:

The MapReduce programming paradigm has been increasingly adopted to implement data-intensive applications processing both small and large scale datasets. As most jobs in data centers have a data footprint in the order of gigabytes, emerging high-end scale-up machines are capable of running most data center processing tasks, thus significantly improving power and server density. However, this approach provides limited performance and energy efficiency because of inefficient utilization of the memory subsystem and serial execution within the MapReduce programming model. Recent work has proposed a distributed hardware acceleration architecture, called CASM, which augments each core in a scale-up machine with a lightweight compute engine. The CASM's network of accelerators operates concurrently with the cores in executing MapReduce stages and reduces significantly traffic to/from storage. In this article, we study the benefits and applicability of CASM, by offering an extensive analysis of design parameters and of its scalable performance on a wide range of applications, and exploring its applicability to incremental data aggregation tasks. Our experimental evaluation indicates that CASM reduces off-chip traffic by four times on average over a chip multiprocessor solution, while scaling well with the number of cores in the system, and it is highly effective in providing incremental results that approximate final outcomes.
Published in: IEEE Transactions on Computers ( Volume: 69, Issue: 8, 01 August 2020)
Page(s): 1233 - 1247
Date of Publication: 22 June 2020

ISSN Information:

Funding Agency:


Contact IEEE to Subscribe

References

References is not available for this document.