Skip to main content

A Simple Concept for the Performance Analysis of Cluster-Computing

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7905))

Abstract

There seems to be a lack of reliable thumb rules to estimate the size and performance of clusters with respect to applications. Since modern cluster architecture is based on multi-cores we follow a concept derived by S. Williams et. al. for the analysis of such systems. The performance is described by the dimensionless speed-up in dependence on important hardware and application parameters. The hardware parameters are the number and the theoretical performance of each processing unit and the bandwidth of the network. The application parameters are the total number of operations performed on a number of bytes and the total number of bytes communicated between the processing units. In order to test our theoretical concept we apply our model to the scalar product of vectors, matrix multiplication, Linpack and the TOP500-list.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Kruse, H.G.: Leistungsbewertung bei Computer-Systemen. Springer (2009)

    Google Scholar 

  2. Kredel, H., Kruse, H.-G., Richling, S.: Zur Leistung von verteilten, homogenen Clustern. PIK (2), 166–171 (2010); English summary, see section V in [3]

    Google Scholar 

  3. Richling, S., Hau, S., Kredel, H., Kruse, H.-G.: Operating Two InfiniBand Grid Clusters over 28 km Distance. In: Proc. 3PGCIC 2010. IEEE (2010)

    Google Scholar 

  4. Williams, S., Waterman, A., Patterson, D.: Roofline: an insightful visual performance model for multi-core architectures. Commun. ACM 52(4), 65–76 (2009)

    Article  Google Scholar 

  5. Amdahl, G.: Validity of the Single Processor Approach to Achieving Large-Scale Computing Capabilities. In: AFIPS Conference Proceedings, vol. 30, pp. 483–485 (1967)

    Google Scholar 

  6. Gustafson, J.: Reevaluating Amdahl’s law. Commun. ACM 31(5), 532–533 (1988)

    Article  Google Scholar 

  7. Hockney, R.W.: Parametrization of computer performance. Parallel Computing 5(1-2), 97–103 (1987)

    Article  Google Scholar 

  8. Hockney, R.W., Jesshope, C.R.: Parallel Computers 2: architecture, programming and algorithms. Adam Hilger, Bristol (1988)

    MATH  Google Scholar 

  9. Hockney, R.W.: Computational similarity. Concurrency – Practice and Experience 7(2), 147–166 (1995)

    Article  Google Scholar 

  10. Kredel, H., Kruse, H.-G., Richling, S.: Einige Überlegungen zur Leistung von Cluster-Computern. PIK (3), 207–211 (2012); For a partial English summary and extensions see section 3 in [11]

    Google Scholar 

  11. Kredel, H., Kruse, H.G., Richling, S., Strohmaier, E.: Performance Analysis and Prediction for distributed homogenous clusters. In: Computer Science – Research and Development, Special Issue ISC 2012, Hamburg (May 2012)

    Google Scholar 

  12. LinPack and HPL, Linear Algebra Package and High Performance LinPack, http://www.netlib.org/benchmark/hpl/ (accessed January 2012)

  13. Dongarra, J., et al.: ScaLAPack documentation, http://www.netlib.org/scalapack/slug/node112.html (accessed January 2012)

  14. Luszczek, P., Dongarra, J.: Reducing the Time to Tune Parallel Dense Linear Algebra Routines with Partial Execution and Performance Modeling. In: Wyrzykowski, R., Dongarra, J., Karczewski, K., Waśniewski, J. (eds.) PPAM 2011, Part I. LNCS, vol. 7203, pp. 730–739. Springer, Heidelberg (2012)

    Chapter  Google Scholar 

  15. Richling, S., Hau, S., Kredel, H., Kruse, H.-G.: A Long-distance InfiniBand Interconnection between two Clusters in Production use. In: Proc. Supercomputing, November 12-18. IEEE (2011)

    Google Scholar 

  16. Numrich, R.W.: Computational Force: A Unifying Concept for Scalability Analysis. In: Proc PARCO, pp. 107–112 (2007)

    Google Scholar 

  17. Numrich, R.W.: A metric space for computer programs and the principle of computational least action. Journal of Supercomputing 43(3), 281–298 (2008)

    Article  Google Scholar 

  18. Numrich, R.W.: Computer performance analysis and the Pi Theorem. Comput. Sci. Res. Dev. (2010)

    Google Scholar 

  19. bwGRiD, Member of the German D-Grid initiative, funded by the Ministry of Education and Research and the Ministry for Science, Research and Arts Baden-Württemberg, Universities of Baden-Württemberg, 2007-2010, 2007-2012, http://www.bw-grid.de/ (accessed December 2012)

  20. Meuer, H., Strohmaier, E., Dongarra, J., Simon, H.: Top 500 Supercomputer Sites, http://top500.org/ (accessed November 2012)

  21. Hopper Interconnect, http://www.nersc.gov/users/computational-systems/hopper/configuration/interconnect/ (accessed May 2012)

  22. Uno, A.: K computer system overview, http://www.fujitsu.com/downloads/TC/sc11/k-computer-system-overview-sc11.pdf (accessed May 2012)

  23. Graph 500 Steering Committee, Benchmarks for data intensive supercomputer applications, http://www.graph500.org/

  24. Kredel, H., Kruse, H.-G., Ott, I.: Performance analysis and performance modeling of Web-applications. In: Proc. 3PGCIC 2011, pp. 115–122. IEEE (2011)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Kredel, H., Richling, S., Kruse, J.P., Strohmaier, E., Kruse, HG. (2013). A Simple Concept for the Performance Analysis of Cluster-Computing. In: Kunkel, J.M., Ludwig, T., Meuer, H.W. (eds) Supercomputing. ISC 2013. Lecture Notes in Computer Science, vol 7905. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38750-0_13

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-38750-0_13

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-38749-4

  • Online ISBN: 978-3-642-38750-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics