Abstract
We discuss how power management development in multi-core processors to achieve higher performance using automatic frequency scaling can cause artifacts when doing performance comparisons and give pessimistic efficiency estimates for algorithms. Overclocking also causes underestimates of the theoretical peak performance of the CPU as can be seen in some cases on the TOP500 list. We show that overclocking capabilities, when available, must be taken into account in thread scheduling for better overall performance.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
AMD. AMD PhenomTM II Key Architectural Features, http://www.amd.com/us/products/desktop/processors/phenom-ii/Pages/phenom-ii-key-architectural-features.aspx (accessed February 16, 2012)
AMD. AMD PowerTune Technology. White Paper (2010)
Barker, K.J., Davis, K., Hoisie, A., Kerbyson, D.J., Lang, M., Pakin, S., Sancho, J.C.: A performance evaluation of the Nehalem quad-core processor for scientific computing. Parallel Processing Letters 18(4), 453–469 (2008)
Charles, J., Jassi, P., Ananth, N.S., Sadat, A., Fedorova, A.: Evaluation of the Intel® CoreTM i7 Turbo Boost feature. In: 2009 IEEE International Symposium on Workload Characterization (IISWC), pp. 188–197. IEEE (October 2009)
Goto, K., Van De Geijn, R.A.: Anatomy of high-performance matrix multiplication. ACM Trans. Math. Soft. 34(3), 12 (2008)
Goto, K., Van De Geijn, R.: High-performance implementation of the level-3 BLAS. ACM Transactions on Mathematical Software (TOMS) 35(1), 4 (2008)
Hess, B., Kutzner, C., van der Spoel, D., Lindahl, E.: Gromacs 4: Algorithms for highly efficient, load-balanced and scalable molecular simulation. Journal of Chemical Theory and Computation 4(3), 435–447 (2008)
Intel. Intel® Turbo Boost Technology in Intel® CoreTM Microarchitecture (Nehalem) Based Processors. White Paper (2008)
NVIDIA. Technology Overview - NVIDIA GeForce GTX 680. White Paper (2012)
Petitet, A., Whaley, C., Dongarra, J., Cleary, A.: HPL - A Portable Implementation of the High-Performance Linpack Benchmark for Distributed-Memory Computers (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Linde, M. (2013). Multi-core Scalability Measurements: Issues and Solutions. In: Manninen, P., Öster, P. (eds) Applied Parallel and Scientific Computing. PARA 2012. Lecture Notes in Computer Science, vol 7782. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-36803-5_23
Download citation
DOI: https://doi.org/10.1007/978-3-642-36803-5_23
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-36802-8
Online ISBN: 978-3-642-36803-5
eBook Packages: Computer ScienceComputer Science (R0)