Skip to main content

Modeling a Leadership-Scale Storage System

  • Conference paper
Parallel Processing and Applied Mathematics (PPAM 2011)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7203))

Abstract

Exascale supercomputers will have the potential for billion-way parallelism. While physical implementations of these systems are currently not available, HPC system designers can develop models of exascale systems to evaluate system design points. Modeling these systems and associated subsystems is a significant challenge. In this paper, we present the Co-design of Exascale Storage System (CODES) framework for evaluating exascale storage system design points. As part of our early work with CODES, we discuss the use of the CODES framework to simulate leadership-scale storage systems in a tractable amount of time using parallel discrete-event simulation. We describe the current storage system models and protocols included with the CODES framework and demonstrate the use of CODES through simulations of an existing petascale storage system.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Overview of the IBM Blue Gene/P project. IBM Journal of Research and Development, 52(1.2), 199–220 (January 2008)

    Google Scholar 

  2. IOR benchmark (October 2011)

    Google Scholar 

  3. Ang, J., Doerfler, D., Dosanjh, S., Koch, K., Morrison, J., Vigil, M.: The alliance for computing at the extreme scale. In: Proceedings of the Cray Users Group Meeting (2010)

    Google Scholar 

  4. Bauer, D.W., Carothers, C.D., Holder, A.: Scalable time warp on Blue Gene supercomputers. In: Proc. ACM/IEEE Workshop on Principles of Advanced and Distributed Simulation (PADS 2009), Lake Placid, NY (2009)

    Google Scholar 

  5. Carns, P.H., Ligon III, W.B., Ross, R.B., Thakur, R.: PVFS: A Parallel File System for Linux Clusters. In: Proceedings of the 4th Annual Linux Showcase and Conference, pp. 317–327 (2000)

    Google Scholar 

  6. Lang, S., Carns, P., Latham, R., Ross, R., Harms, K., Allcock, W.: I/O performance challenges at leadership scale. In: Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis, p. 40. ACM (2009)

    Google Scholar 

  7. Liu, N., Carothers, C.D.: Modeling billion-node torus networks using massively parallel discrete-event simulation. In: Proceedings of the Workshop on Principles of Advanced and Distributed Simulation (PADS), pp. 1–8. IEEE, France (2011)

    Chapter  Google Scholar 

  8. Liu, Y., Figueiredo, R., Clavijo, D., Xu, Y., Zhao, M.: Towards simulation of parallel file system scheduling algorithms with PFSsim. In: Proceedings of the 7th IEEE International Workshop on Storage Network Architectures and Parallel I/O (May 2011)

    Google Scholar 

  9. Molina-Estolano, E., Maltzahn, C., Bent, J., Brandt, S.A.: Building a parallel file system simulator. Journal of Physics: Conference Series 180, 012050 (2009)

    Article  Google Scholar 

  10. Moreira, J., Brutman, M., Castaños, J., Engelsiepen, T., Giampapa, M., Gooding, T., Haskin, R., Inglett, T., Lieber, D., McCarthy, P., Mundy, M., Parker, J., Wallenfelt, B.: Designing a highly-scalable operating system: the blue gene/l story. In: Proceedings of the 2006 ACM/IEEE Conference on Supercomputing, SC 2006. ACM, New York (2006)

    Google Scholar 

  11. Nguyen, H.Q.: File system simulation: Hierachical performance measurement and modeling. PhD thesis, University of Arkansas (2011)

    Google Scholar 

  12. Nguyen, H.Q., Apon, A.W.: Hierarchical performance measurement and modeling of the linux file system. In: ICPE, pp. 73–84 (2011)

    Google Scholar 

  13. Perumalla, K.S.: μπ: a scalable and transparent system for simulating MPI programs. In: Proceedings of the 3rd International ICST Conference on Simulation Tools and Techniques, SIMUTools 2010, pp. 62:1–62:6. ICST, Brussels (2010)

    Google Scholar 

  14. Rodrigues, A.F., Hemmert, K.S., Barrett, B.W., Kersey, C., Oldfield, R., Weston, M., Risen, R., Cook, J., Rosenfeld, P., CooperBalls, E., Jacob, B.: The structural simulation toolkit. SIGMETRICS Perform. Eval. Rev. 38, 37–42 (2011)

    Article  Google Scholar 

  15. Schmuck, F., Haskin, R.: Gpfs: A shared-disk file system for large computing clusters. In: Proceedings of the 1st USENIX Conference on File and Storage Technologies (2002)

    Google Scholar 

  16. Settlemyer, B.W.: A Study of Client-side Caching in Parallel File Systems. PhD thesis, Clemson University, Clemson, South Carolina, USA (2009)

    Google Scholar 

  17. Shipman, G., Dillow, D., Oral, S., Wang, F.: The spider center wide file system: From concept to reality. In: Proceedings, Cray User Group (CUG) Conference, Atlanta, GA (2009)

    Google Scholar 

  18. Yaun, G., Carothers, C.D., Kalyanaraman, S.: Large-scale TCP models using optimistic parallel simulation. In: Proceedings of the Seventeenth Workshop on Parallel and Distributed Simulation (PADS 2003), San Diego, CA (June 2003)

    Google Scholar 

  19. Yaun, G.R., Bauer, D.W., Bhutada, H.L., Carothers, C.D., Yuksel, M., Kalyanaraman, S.: Largescale network simulation techniques: Examples of TCP and OSPF models. SIGCOMM Computer Comunications Review Special Issue on Tools and Technologies for Research and Eduction 33(5), 27–41 (2004)

    Google Scholar 

  20. Zheng, G., Gupta, G., Bohm, E., Dooley, I., Kale, L.V.: Simulating Large Scale Parallel Applications using Statistical Models for Sequential Execution Blocks. In: Proceedings of the 16th International Conference on Parallel and Distributed Systems (ICPADS 2010), Shanghai, China, pp. 10–15 (December 2010)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Liu, N. et al. (2012). Modeling a Leadership-Scale Storage System. In: Wyrzykowski, R., Dongarra, J., Karczewski, K., Waśniewski, J. (eds) Parallel Processing and Applied Mathematics. PPAM 2011. Lecture Notes in Computer Science, vol 7203. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-31464-3_2

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-31464-3_2

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-31463-6

  • Online ISBN: 978-3-642-31464-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics