skip to main content
10.1145/1414558.1414599acmconferencesArticle/Chapter ViewAbstractPublication PagesiteConference Proceedingsconference-collections
research-article

Meeting the data challenge: curriculum development for parallel data systems

Published: 16 October 2008 Publication History

Abstract

The emergence of commodity-based high performance computing systems and low-cost storage systems in concert with the continued proliferation of data has created a significant need for technologists with expertise in parallel data systems. The training in this area, though, falls outside the traditional boundaries of the data management curriculum. In this paper, we describe our efforts in developing a new course focused on parallel data systems, which exploit the power of high performance computing and commodity hardware to deliver high throughput and well-scaled storage systems. We describe in detail the trends and forces driving the need for this course, the topics to be covered in this course, the data laboratory to be used with the course, assessment methods to measure student progress, and desired learning outcomes for the course.

References

[1]
V. A. Vyssotsky, F. J. Corbató, and R. M. Graham, "Structure of the Multics Supervisor." pp. 203--212.
[2]
G. Sanjay, G. Howard, and L. Shun-Tak, "The Google file system," in Proceedings of the nineteenth ACM symposium on Operating systems principles, Bolton Landing, NY, USA, 2003.
[3]
Council on Competitiveness. "Full Vehicle Design Optimization for Global Market Dominance," http://www.compete.org/pdf/HPC_Full_Design.pdf
[4]
Council on Competitiveness, "Keeping the Lifeblood Flowing: Boosting Oil and Gas Recovery from the Earth," 2005.
[5]
Council on Competitiveness, "Auto Crash Safety: It's Not Just for Dummies," 2005.
[6]
Council on Competitiveness, "Spin Fiber Faster to Gain a Competitive Edge for U.S. Textile Manufacturing," 2005.
[7]
Council on Competitiveness, "Customized Catalysts to Improve Crude Oil Yields: Getting More Bang from Each Barrel," 2005.
[8]
A. Ricadela, "File Systems That Fly," Information Week, June 20, 2005.
[9]
W. L. P. Carns, "PVFS: A Parallel File System for Linux Clusters," in Proceedings of the 4th Annual Linux Showcase & Conference, Atlanta, GA, 2000.
[10]
"Parallel Virtual File System 2 (PVFS2)," July 2, 2008; http://www.pvfs.org.
[11]
C. F. S. Inc., "Lustre: A Scalable, High-Performance File System," in http://www.lustre.org/docs/whitepaper.pdf 2006.
[12]
"IOZone," July 2, 2008; http://www.iozone.org.
[13]
"Transaction Processing Performance," July 2, 2008; http://www.tpc.org.
[14]
"Borealis Distributed Stream Processing Engine," July 2, 2008; http://www.cs.brown.edu/research/borealis/public/.
[15]
"Linear Road Benchmark," July 2, 2008; http://www.cs.brandeis.edu/%7Elinearroad/.

Cited By

View all
  • (2010)Work in progress — Integration of the scientific workflow paradigm into high performance computing and large scale data management curricula2010 IEEE Frontiers in Education Conference (FIE)10.1109/FIE.2010.5673235(F3F-1-F3F-2)Online publication date: Oct-2010

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGITE '08: Proceedings of the 9th ACM SIGITE conference on Information technology education
October 2008
280 pages
ISBN:9781605583297
DOI:10.1145/1414558
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 16 October 2008

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. curriculum development
  2. high performance computing
  3. parallel data systems

Qualifiers

  • Research-article

Conference

SIGITE08
Sponsor:

Acceptance Rates

Overall Acceptance Rate 176 of 429 submissions, 41%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)3
  • Downloads (Last 6 weeks)0
Reflects downloads up to 28 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2010)Work in progress — Integration of the scientific workflow paradigm into high performance computing and large scale data management curricula2010 IEEE Frontiers in Education Conference (FIE)10.1109/FIE.2010.5673235(F3F-1-F3F-2)Online publication date: Oct-2010

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media