Abstract
There seems to be a lack of reliable thumb rules to estimate the size and performance of clusters with respect to applications. Since modern cluster architecture is based on multi-cores we follow a concept derived by S. Williams et. al. for the analysis of such systems. The performance is described by the dimensionless speed-up in dependence on important hardware and application parameters. The hardware parameters are the number and the theoretical performance of each processing unit and the bandwidth of the network. The application parameters are the total number of operations performed on a number of bytes and the total number of bytes communicated between the processing units. In order to test our theoretical concept we apply our model to the scalar product of vectors, matrix multiplication, Linpack and the TOP500-list.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Kruse, H.G.: Leistungsbewertung bei Computer-Systemen. Springer (2009)
Kredel, H., Kruse, H.-G., Richling, S.: Zur Leistung von verteilten, homogenen Clustern. PIK (2), 166–171 (2010); English summary, see section V in [3]
Richling, S., Hau, S., Kredel, H., Kruse, H.-G.: Operating Two InfiniBand Grid Clusters over 28 km Distance. In: Proc. 3PGCIC 2010. IEEE (2010)
Williams, S., Waterman, A., Patterson, D.: Roofline: an insightful visual performance model for multi-core architectures. Commun. ACM 52(4), 65–76 (2009)
Amdahl, G.: Validity of the Single Processor Approach to Achieving Large-Scale Computing Capabilities. In: AFIPS Conference Proceedings, vol. 30, pp. 483–485 (1967)
Gustafson, J.: Reevaluating Amdahl’s law. Commun. ACM 31(5), 532–533 (1988)
Hockney, R.W.: Parametrization of computer performance. Parallel Computing 5(1-2), 97–103 (1987)
Hockney, R.W., Jesshope, C.R.: Parallel Computers 2: architecture, programming and algorithms. Adam Hilger, Bristol (1988)
Hockney, R.W.: Computational similarity. Concurrency – Practice and Experience 7(2), 147–166 (1995)
Kredel, H., Kruse, H.-G., Richling, S.: Einige Überlegungen zur Leistung von Cluster-Computern. PIK (3), 207–211 (2012); For a partial English summary and extensions see section 3 in [11]
Kredel, H., Kruse, H.G., Richling, S., Strohmaier, E.: Performance Analysis and Prediction for distributed homogenous clusters. In: Computer Science – Research and Development, Special Issue ISC 2012, Hamburg (May 2012)
LinPack and HPL, Linear Algebra Package and High Performance LinPack, http://www.netlib.org/benchmark/hpl/ (accessed January 2012)
Dongarra, J., et al.: ScaLAPack documentation, http://www.netlib.org/scalapack/slug/node112.html (accessed January 2012)
Luszczek, P., Dongarra, J.: Reducing the Time to Tune Parallel Dense Linear Algebra Routines with Partial Execution and Performance Modeling. In: Wyrzykowski, R., Dongarra, J., Karczewski, K., Waśniewski, J. (eds.) PPAM 2011, Part I. LNCS, vol. 7203, pp. 730–739. Springer, Heidelberg (2012)
Richling, S., Hau, S., Kredel, H., Kruse, H.-G.: A Long-distance InfiniBand Interconnection between two Clusters in Production use. In: Proc. Supercomputing, November 12-18. IEEE (2011)
Numrich, R.W.: Computational Force: A Unifying Concept for Scalability Analysis. In: Proc PARCO, pp. 107–112 (2007)
Numrich, R.W.: A metric space for computer programs and the principle of computational least action. Journal of Supercomputing 43(3), 281–298 (2008)
Numrich, R.W.: Computer performance analysis and the Pi Theorem. Comput. Sci. Res. Dev. (2010)
bwGRiD, Member of the German D-Grid initiative, funded by the Ministry of Education and Research and the Ministry for Science, Research and Arts Baden-Württemberg, Universities of Baden-Württemberg, 2007-2010, 2007-2012, http://www.bw-grid.de/ (accessed December 2012)
Meuer, H., Strohmaier, E., Dongarra, J., Simon, H.: Top 500 Supercomputer Sites, http://top500.org/ (accessed November 2012)
Hopper Interconnect, http://www.nersc.gov/users/computational-systems/hopper/configuration/interconnect/ (accessed May 2012)
Uno, A.: K computer system overview, http://www.fujitsu.com/downloads/TC/sc11/k-computer-system-overview-sc11.pdf (accessed May 2012)
Graph 500 Steering Committee, Benchmarks for data intensive supercomputer applications, http://www.graph500.org/
Kredel, H., Kruse, H.-G., Ott, I.: Performance analysis and performance modeling of Web-applications. In: Proc. 3PGCIC 2011, pp. 115–122. IEEE (2011)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kredel, H., Richling, S., Kruse, J.P., Strohmaier, E., Kruse, HG. (2013). A Simple Concept for the Performance Analysis of Cluster-Computing. In: Kunkel, J.M., Ludwig, T., Meuer, H.W. (eds) Supercomputing. ISC 2013. Lecture Notes in Computer Science, vol 7905. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38750-0_13
Download citation
DOI: https://doi.org/10.1007/978-3-642-38750-0_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-38749-4
Online ISBN: 978-3-642-38750-0
eBook Packages: Computer ScienceComputer Science (R0)