Proceedings of the 2015 International Workshop on Data-Intensive Scalable Computing Systems

DISCS '15: Proceedings of the 2015 International Workshop on Data-Intensive Scalable Computing Systems

November 2015

2015 Proceeding

General Chair:
Philip C. Roth

Publisher:

Association for Computing Machinery
New York
NY
United States

Conference:

SC15: The International Conference for High Performance Computing, Networking, Storage and Analysis Austin Texas 15 November 2015

ISBN:

978-1-4503-3993-3

Published:

15 November 2015

Sponsors:

SIGHPC, SIGARCH, IEEE-CS\DATC

Recommend ACM DL

ALREADY A SUBSCRIBER?SIGN IN

Get Alerts for this ConferenceAlerts Save to BinderBinder

Save to Binder

Create a New Binder

Name

Export CitationCitation

Share on

Bibliometrics

Citation count

Downloads (6 weeks)

Downloads (12 months)

Downloads (cumulative)

1,577

Sections

DISCS '15: Proceedings of the 2015 International Workshop on Data-Intensive Scalable Computing Systems

2015

Previous Next

Skip Abstract Section

Abstract

Welcome to DISCS-2015! The Data Intensive Scalable Computing Systems (DISCS) workshop series facilitates dialogue about research aimed at the intersection of data intensive computing and traditional high performance computing (HPC). Traditional HPC systems were designed from a compute-centric perspective, with an emphasis on high floating-point performance. As scientific and analytics applications become more data intensive, there is a need to rethink HPC system architectures, programming models, runtime systems, and tools with a focus on data intensive computing. Industry approaches supporting data intensive applications have been highly successful, leading many in the HPC community to explore ways to apply them. Conversely, the HPC community's expertise in designing, deploying, and using high performance systems is attractive to those in industry. The 2015 International Workshop on Data Intensive Scalable Computing Systems provides a forum for researchers and other interested people in the areas of data intensive computing and high performance parallel computing to exchange ideas and discuss approaches for addressing the challenges facing Big Data or data intensive computing at large scale.

Proceeding Downloads

PDFFront matter (Title page, Contents, Welcome, Organizers)

Skip Table Of Content Section

Select All

Export Citations Save to Binder

SESSION: New paradigms for data-intensive processing

research-article

SJM: an SCM-based journaling mechanism with write reduction for file systems

Lingfang Zeng,
Binbing Hou,
Dan Feng,
Kenneth B. Kent

Article No.: 1, pp 1–8https://doi.org/10.1145/2831244.2831246

Considering the unique characteristics of storage class memory (SCM), such as non-volatility, fast access speed, byte-addressability, low-energy consumption, and in-place modification support, we investigated the features of over-write and append-write ...

- 6
- 235
Metrics
Total Citations6
Total Downloads235
Last 12 Months7
Last 6 weeks2

Abstract
Get Access

research-article

Free

Efficient disk-to-disk sorting: a case study in the decoupled execution paradigm

Hassan Eslami,
Anthony Kougkas,
Maria Kotsifakou,
Theodoros Kasampalis,
Kun Feng,
Yin Lu,
William Gropp,
Xian-He Sun,
Yong Chen,
Rajeev Thakur

Article No.: 2, pp 1–8https://doi.org/10.1145/2831244.2831249

Many applications foreseen for exascale era should process huge amount of data. However, the IO infrastructure of current supercomputing architecture cannot be generalized to deal with this amount of data due to the need for excessive data movement from ...

- 4
- 310
Metrics
Total Citations4
Total Downloads310
Last 12 Months22
Last 6 weeks0

Abstract
View online with eReader
PDF

research-article

A low-cost adaptive data separation method for the flash translation layer of solid state drives

Wei Xie,
Yong Chen,
Philip C. Roth

Article No.: 3, pp 1–8https://doi.org/10.1145/2831244.2831250

Solid state drives (SSDs) have shown great potential for data-intensive computing due to their much higher throughput and lower energy consumption compared to traditional hard disk drives. Within an SSD, its Flash Translation Layer (FTL) is responsible ...

- 1
- 140
Metrics
Total Citations1
Total Downloads140
Last 12 Months2
Last 6 weeks0

Abstract
Get Access

research-article

Big data analytics on traditional HPC infrastructure using two-level storage

Pengfei Xuan,
Jeffrey Denton,
Pradip K. Srimani,
Rong Ge,
Feng Luo

Article No.: 4, pp 1–8https://doi.org/10.1145/2831244.2831253

Data-intensive computing has become one of the major workloads on traditional high-performance computing (HPC) clusters. Currently, deploying data-intensive computing software framework on HPC clusters still faces performance and scalability issues. In ...

- 9
- 270
Metrics
Total Citations9
Total Downloads270
Last 12 Months13
Last 6 weeks3

Abstract
Get Access

SESSION: Parallel I/O acceleration

research-article

Public Access

Route-aware independent MPI I/O on the blue gene/Q

Preeti Malakar,
Venkatram Vishwanath

Article No.: 5, pp 1–8https://doi.org/10.1145/2831244.2831251

Scalable high-performance I/O is crucial for application performance on large-scale systems. With the growing complexity of the system interconnects, it has become important to consider the impact of network contention on I/O performance because the I/O ...

- 0
- 151
Metrics
Total Citations0
Total Downloads151
Last 12 Months18
Last 6 weeks1

Abstract
View online with eReader
PDF

research-article

Experimental evaluation of a flexible I/O architecture for accelerating workflow engines in cloud environments

Francisco Rodrigo Duro,
Javier Garcia-Blas,
Florin Isaila,
Jesus Carretero

Article No.: 6, pp 1–8https://doi.org/10.1145/2831244.2831248

In the current scientific computing scenario storage systems are one of the main bottlenecks in computing platforms. This issue affects both traditional high performance computing systems and modern systems based on cloud platforms. Accelerating the I/O ...

- 1
- 116
Metrics
Total Citations1
Total Downloads116
Last 12 Months2
Last 6 weeks0

Abstract
Get Access

SESSION: Analytics frameworks

research-article

A case study of MapReduce speculation for failure recovery

Huansong Fu,
Yue Zhu,
Weikuan Yu

Article No.: 7, pp 1–8https://doi.org/10.1145/2831244.2831245

MapReduce has become indispensable for big data analytics. As a representative implementation of MapReduce, Hadoop/YARN strives to provide outstanding performance in terms of job turnaround time, fault tolerance etc. It is equipped with a speculation ...

- 1
- 131
Metrics
Total Citations1
Total Downloads131
Last 12 Months2
Last 6 weeks1

Abstract
Get Access

research-article

Supporting online analytics with user-defined estimation and early termination in a MapReduce-like framework

Yi Wang,
Linchuan Chen,
Gagan Agrawal

Article No.: 8, pp 1–8https://doi.org/10.1145/2831244.2831247

Online analytics based on runtime approximation has been widely adopted for meeting time and/or resource constraints. Though MapReduce has been gaining its popularity in both scientific and commercial sectors, there are several obstacles in implementing ...

- 2
- 84
Metrics
Total Citations2
Total Downloads84
Last 12 Months0
Last 6 weeks0

Abstract
Get Access

research-article

Performance evaluation and tuning of BioPig for genomic analysis

Lizhen Shi,
Zhong Wang,
Weikuan Yu,
Xiandong Meng

Article No.: 9, pp 1–7https://doi.org/10.1145/2831244.2831252

In this study, we aim to optimize Hadoop parameters to improve the performance of BioPig on Amazon Web Service (AWS). BioPig is a toolkit for large-scale sequencing data analysis and is built on Hadoop and Pig that enables easy parallel programming and ...

- 1
- 117
Metrics
Total Citations1
Total Downloads117
Last 12 Months1
Last 6 weeks0

Abstract
Get Access

Save to Binder

Create a New Binder

Name

Contributors

Philip Charles Roth
Oak Ridge National Laboratory
- Publication Years2002 - 2023
- Publication counts29
- Citation count949
- Available for Download15
- Downloads (cumulative)7,491
- Downloads (12 months)649
- Downloads (6 weeks)81
- Average Downloads per Article499
- Average Citation per Article33
View Full Profile

Index Terms

Proceedings of the 2015 International Workshop on Data-Intensive Scalable Computing Systems

Index terms have been assigned to the content through auto-classification.

Recommendations

DataCloud-SC '11: Proceedings of the second international workshop on Data intensive computing in the clouds
Read More
DataCloud '14: Proceedings of the 5th International Workshop on Data-Intensive Computing in the Clouds
Read More
DISCS-2013: Proceedings of the 2013 International Workshop on Data-Intensive Scalable Computing Systems
Read More

Acceptance Rates

DISCS '15 Paper Acceptance Rate9of15submissions,60%Overall Acceptance Rate19of34submissions,56%

Year	Submitted	Accepted	Rate
DISCS '15	15	9	60%
DISCS-2013	19	10	53%
Overall	34	19	56%

Comments

Export Citations

Select Citation format

Please download or close your previous search result export first before starting a new bulk export.
Preview is not available.
By clicking download,a status dialog will open to start the export process. The process may takea few minutes but once it finishes a file will be downloadable from your browser. You may continue to browse the DL while the export process is in progress.
Download
- Download citation
- Copy citation

Save to Binder

Sections

Proceeding Downloads

Save to Binder

Index Terms

Recommendations

DataCloud-SC '11: Proceedings of the second international workshop on Data intensive computing in the clouds

DataCloud '14: Proceedings of the 5th International Workshop on Data-Intensive Computing in the Clouds

DISCS-2013: Proceedings of the 2013 International Workshop on Data-Intensive Scalable Computing Systems

Acceptance Rates