Journals & Magazines >IEEE Transactions on Computers >Volume: 69 Issue: 8

Collaborative Accelerators for Streamlining MapReduce on Scale-up Machines With Incremental Data Aggregation

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

The MapReduce programming paradigm has been increasingly adopted to implement data-intensive applications processing both small and large scale datasets. As most jobs in ...Show More

Metadata

Abstract:

The MapReduce programming paradigm has been increasingly adopted to implement data-intensive applications processing both small and large scale datasets. As most jobs in data centers have a data footprint in the order of gigabytes, emerging high-end scale-up machines are capable of running most data center processing tasks, thus significantly improving power and server density. However, this approach provides limited performance and energy efficiency because of inefficient utilization of the memory subsystem and serial execution within the MapReduce programming model. Recent work has proposed a distributed hardware acceleration architecture, called CASM, which augments each core in a scale-up machine with a lightweight compute engine. The CASM's network of accelerators operates concurrently with the cores in executing MapReduce stages and reduces significantly traffic to/from storage. In this article, we study the benefits and applicability of CASM, by offering an extensive analysis of design parameters and of its scalable performance on a wide range of applications, and exploring its applicability to incremental data aggregation tasks. Our experimental evaluation indicates that CASM reduces off-chip traffic by four times on average over a chip multiprocessor solution, while scaling well with the number of cores in the system, and it is highly effective in providing incremental results that approximate final outcomes.

Published in: IEEE Transactions on Computers ( Volume: 69, Issue: 8, 01 August 2020)

Page(s): 1233 - 1247

Date of Publication: 22 June 2020

ISSN Information:

DOI: 10.1109/TC.2020.3004169

Funding Agency:

Contents

References is not available for this document.

Collaborative Accelerators for Streamlining MapReduce on Scale-up Machines With Incremental Data Aggregation

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Collaborative Accelerators for Streamlining MapReduce on Scale-up Machines With Incremental Data Aggregation

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

References

IEEE Account

Purchase Details

Profile Information

Need Help?