A programmable shared-memory system for an array of processing-in-memory devices

Lee, Sangkuen; Sim, Hyogi; Kim, Youngjae; Vazhkudai, Sudharshan S.

doi:10.1007/s10586-018-2844-1

Title: A programmable shared-memory system for an array of processing-in-memory devices

Journal Article · Thu Aug 30 00:00:00 EDT 2018 · Cluster Computing

DOI:https://doi.org/10.1007/s10586-018-2844-1· OSTI ID:1468266

^[1];

^[1]; Kim, Youngjae ^[2];

^[1]

Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States)
Sogang Univ., Seoul (Republic of Korea)

Processing in memory (PIM), the concept of integrating processing directly with memory has been attracting a lot of attention, since PIM can assist in overcoming the throughput limitation caused by data movement between CPU and memory. The challenge, however, is that it requires the programmers to have a deep understanding of the PIM architecture to maximize the benefits such as data locality and parallel thread execution on multiple PIM devices. In this study, we present AnalyzeThat, a programmable shared-memory system for parallel data processing with PIM devices. Thematic to AnalyzeThat is a rich PIM-aware data structure (PADS), which is an encapsulation that integrally ties together the data, the analysis tasks and the runtime needed to interface with the PIM device array. The PADS abstraction provides (i) a sophisticated key-value data container that allows programmers to easily store data on multiple PIMs, (ii) a suite of parallel operations with which users can easily implement data analysis applications, and (iii) a runtime, hidden to programmers, which provides the mechanisms needed to overlay both the data and the tasks on the PIM device array in an intelligent fashion, based on PIM-specific information collected from the hardware. We have developed a PIM emulation framework called AnalyzeThat. In conclusion, our experimental evaluation with representative data analytics applications suggests that the proposed system can significantly reduce the PIM programming effort without losing its technology benefits.

View Accepted Manuscript (DOE)

Cite

Export

Save

Research Organization:: Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States). Oak Ridge Leadership Computing Facility (OLCF)

Sponsoring Organization:: USDOE Office of Science (SC)

Grant/Contract Number:: AC05-00OR22725

OSTI ID:: 1468266

Journal Information:: Cluster Computing, Vol. 22; ISSN 1386-7857

Publisher:: SpringerCopyright Statement

Country of Publication:: United States

Language:: English

References (20)

The missing memristor found Strukov, Dmitri B.; Snider, Gregory S.; Stewart, Duncan R. Nature, Vol. 453, Issue 7191 https://doi.org/10.1038/nature06932	journal	May 2008
MapReduce: simplified data processing on large clusters Dean, Jeffrey; Ghemawat, Sanjay; Mehta, Brijesh Communications of the ACM, Vol. 51, Issue 1 https://doi.org/10.1145/1327452.1327492	journal	January 2008
The International Exascale Software Project roadmap Dongarra, Jack; Beckman, Pete; Moore, Terry The International Journal of High Performance Computing Applications, Vol. 25, Issue 1 https://doi.org/10.1177/1094342010391989	journal	January 2011
Dynamo: amazon's highly available key-value store DeCandia, Giuseppe; Hastorun, Deniz; Jampani, Madan ACM SIGOPS Operating Systems Review, Vol. 41, Issue 6 https://doi.org/10.1145/1323293.1294281	journal	October 2007
FlashStore: high throughput persistent key-value store Debnath, Biplob; Sengupta, Sudipta; Li, Jin Proceedings of the VLDB Endowment, Vol. 3, Issue 1-2 https://doi.org/10.14778/1920841.1921015	journal	September 2010
SkewTune: mitigating skew in mapreduce applications Kwon, YongChul; Balazinska, Magdalena; Howe, Bill Proceedings of the 2012 international conference on Management of Data - SIGMOD '12 https://doi.org/10.1145/2213836.2213840	conference	January 2012
The architecture of the DIVA processing-in-memory chip Draper, Jeff; Kang, Chang Woo; Kim, Ihn Proceedings of the 16th international conference on Supercomputing - ICS '02 https://doi.org/10.1145/514191.514197	conference	January 2002
FlexRAM: Toward an advanced Intelligent Memory system Kang, Yi; Huang, Wei; Yoo, Seung-Moon 2012 IEEE 30th International Conference on Computer Design (ICCD 2012), 2012 IEEE 30th International Conference on Computer Design (ICCD) https://doi.org/10.1109/ICCD.2012.6378608	conference	September 2012
Phoenix++: modular MapReduce for shared-memory systems Talbot, Justin; Yoo, Richard M.; Kozyrakis, Christos Proceedings of the second international workshop on MapReduce and its applications - MapReduce '11 https://doi.org/10.1145/1996092.1996095	conference	January 2011
A low cost, multithreaded processing-in-memory system Brockman, Jay B.; Thoziyoor, Shyamkumar; Kuntz, Shannon K. Proceedings of the 3rd workshop on Memory performance issues in conjunction with the 31st international symposium on computer architecture - WMPI '04 https://doi.org/10.1145/1054943.1054946	conference	January 2004
NDC: Analyzing the impact of 3D-stacked memory+logic devices on MapReduce workloads Pugsley, Seth H.; Jestes, Jeffrey; Zhang, Huihui 2014 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS) https://doi.org/10.1109/ISPASS.2014.6844483	conference	March 2014
A new perspective on processing-in-memory architecture design Zhang, Dong Ping; Jayasena, Nuwan; Lyashevsky, Alexander Proceedings of the ACM SIGPLAN Workshop on Memory Systems Performance and Correctness - MSPC '13 https://doi.org/10.1145/2492408.2492418	conference	January 2013
Processing-in-memory technology for knowledge discovery algorithms Adibi, Jafar; Barrett, Tim; Bhatt, Spundun Proceedings of the 2nd international workshop on Data management on new hardware - DaMoN '06 https://doi.org/10.1145/1140402.1140405	conference	January 2006
TOP-PIM: throughput-oriented programmable processing in memory Zhang, Dongping; Jayasena, Nuwan; Lyashevsky, Alexander Proceedings of the 23rd international symposium on High-performance parallel and distributed computing - HPDC '14 https://doi.org/10.1145/2600212.2600213	conference	January 2014
Phoenix rebirth: Scalable MapReduce on a large-scale shared-memory system Yoo, Richard M.; Romano, Anthony; Kozyrakis, Christos 2009 IEEE International Symposium on Workload Characterization (IISWC) https://doi.org/10.1109/IISWC.2009.5306783	conference	October 2009
Mars: a MapReduce framework on graphics processors He, Bingsheng; Fang, Wenbin; Luo, Qiong Proceedings of the 17th international conference on Parallel architectures and compilation techniques - PACT '08 https://doi.org/10.1145/1454115.1454152	conference	January 2008
AnalyzeThat: A Programmable Shared-Memory System for an Array of Processing-In-Memory Devices Lee, Sangkuen; Sim, Hyogi; Kim, Youngjae 2017 17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID) https://doi.org/10.1109/CCGRID.2017.72	conference	May 2017
A Comprehensive Performance Comparison of CUDA and OpenCL Fang, Jianbin; Varbanescu, Ana Lucia; Sips, Henk 2011 International Conference on Parallel Processing (ICPP) https://doi.org/10.1109/ICPP.2011.45	conference	September 2011
Power-Law Distribution of the World Wide Web Adamic, Lada A.; Huberman, Bernardo A.; Barabási, A. -L. Science, Vol. 287, Issue 5461 https://doi.org/10.1126/science.287.5461.2115a	journal	March 2000
Comparing Implementations of Near-Data Computing with In-Memory MapReduce Workloads Pugsley, Seth H.; Jestes, Jeffrey; Balasubramonian, Rajeev IEEE Micro, Vol. 34, Issue 4 https://doi.org/10.1109/MM.2014.54	journal	July 2014

Similar Records

AnalyzeThat: A Programmable Shared-Memory System for an Array of Processing-In-Memory Devices

Conference · Mon May 01 00:00:00 EDT 2017 · OSTI ID:1468266

Lee, Sangkuen; Sim, Hyogi; Kim, Youngjae; +1 more

Data Locality Enhancement of Dynamic Simulations for Exascale Computing (Final Report)

Technical Report · Fri Nov 29 00:00:00 EST 2019 · OSTI ID:1468266

Shen, Xipeng

HPC-Colony: Services and Interfaces to Aupport Systems With Very Large Numbers of Processors

Technical Report · Wed Jan 31 00:00:00 EST 2007 · OSTI ID:1468266

Jones, T; Kale, L; Moreira, J; +4 more

Related Subjects

97 MATHEMATICS AND COMPUTING
Programmable devices
Storage systems
Processing-in-memory
Big data processing

Title: A programmable shared-memory system for an array of processing-in-memory devices

Citation Formats

References (20)

Similar Records

Related Subjects