keynote

Algorithms and application for grids and clouds

Author:
Geoffrey C. Fox

Indiana University, Bloomington, IN, USA

Indiana University, Bloomington, IN, USA
View Profile

SPAA '10: Proceedings of the twenty-second annual ACM symposium on Parallelism in algorithms and architecturesJune 2010Pages 144https://doi.org/10.1145/1810479.1810507

Published:13 June 2010Publication History

SPAA '10: Proceedings of the twenty-second annual ACM symposium on Parallelism in algorithms and architectures

Pages 144

ABSTRACT

We discuss the impact of clouds and grid technology on scientific computing using examples from a variety of fields -- especially the life sciences. We cover the impact of the growing importance of data analysis and note that it is more suitable for these modern architectures than the large simulations (particle dynamics and partial differential equation solution) that are mainstream use of large scale "massively parallel" supercomputers. The importance of grids is seen in the support of distributed data collection and archiving while clouds are and will replace grids for the large scale analysis of the data.

We discuss the structure of algorithms (and the associated applications) that will run on current clouds and use either the basic "on-demand" computing paradigm or higher level frameworks based on MapReduce and its extensions. Looking at performance of MPI (mainstay of scientific computing) and MapReduce both theoretically and experimentally shows that current MapReduce implementations run well on algorithms that are a "Map" followed by a "Reduce" but perform poorly on algorithms that iterate over many such phases. Several important algorithms including parallel linear algebra falls into latter class. One can define MapReduce extensions to accommodate iterative map and reduce but these have less fault tolerance than basic MapReduce. We discuss clustering, dimension reduction and sequence assembly and annotation as example algorithms.

Index Terms

Algorithms and application for grids and clouds
1. Computing methodologies
  1. Distributed computing methodologies
    1. Distributed programming languages
  2. Parallel computing methodologies
    1. Parallel programming languages
2. Software and its engineering
  1. Software notations and tools
    1. General programming languages
      1. Language types
        Distributed programming languages
        Parallel programming languages

Recommendations

Large scale data analytics on clouds
CloudDB '12: Proceedings of the fourth international workshop on Cloud data management

We summarize important overall issues affecting use of clouds to support Data Science. We describe the mapping of different applications to HPCC and Cloud systems and the architecture that support data analytics that is interoperable between these ...
Read More
MapReduce in MPI for Large-scale graph algorithms

We describe a parallel library written with message-passing (MPI) calls that allows algorithms to be expressed in the MapReduce paradigm. This means the calling program does not need to include explicit parallel code, but instead provides ''map'' and ''...
Read More
Application Level Interoperability between Clouds and Grids
GPC '09: Proceedings of the 2009 Workshops at the Grid and Pervasive Computing Conference

SAGA is a high-level programming interface which provides the ability to develop distributed applications in aninfrastructure independent way. In an earlier paper, we discussed how SAGA was used to develop a version of MapReduce which provided the user ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
SPAA '10: Proceedings of the twenty-second annual ACM symposium on Parallelism in algorithms and architectures
June 2010
378 pages
ISBN:9781450300797
DOI:10.1145/1810479
General Chairs:
Friedhelm Meyer auf der Heide
University of Paderborn, Germany
,
Cynthia Phillips
Sandia National Laboratories, USA
Copyright © 2010 Copyright is held by the author/owner(s)
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 13 June 2010
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
blast
clouds
clustering
data deluge
dimension reduction
grids
life sciences
mapreduce
mpi
Qualifiers
- keynote
Conference

Acceptance Rates
Overall Acceptance Rate447of1,461submissions,31%
Upcoming Conference
SPAA '24

Sponsor:

sigact

sigact

36th ACM Symposium on Parallelism in Algorithms and Architectures

June 17 - 21, 2024

Nantes , France
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 308
  Total Downloads
- Downloads (Last 12 months)0
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Algorithms and application for grids and clouds

SPAA '10: Proceedings of the twenty-second annual ACM symposium on Parallelism in algorithms and architectures

ABSTRACT

Cited By

Index Terms

Recommendations

Large scale data analytics on clouds

MapReduce in MPI for Large-scale graph algorithms

Application Level Interoperability between Clouds and Grids