ABSTRACT
The use of high-end multicore processors today can incur high power density with significant variability in spatial and temporal usage of resources by workloads. This situation leads to power and temperature hotspots, which in turn may lead to non-uniform ageing and accelerated chip failure. These drawbacks can be mitigated by online tuning of system performance and adopting closed-loop thermal and reliability management policies. The development and evaluation of these policies cannot be performed solely on real hardware - due to observability and flexibility limitations or just by relying on trace-driven simulation, due to dependencies present among power, thermal effects, reliability and performance. We present a complete and virtual platform to develop, simulate and evaluate power, temperature and reliability management control strategies for high-performance multicores. The accuracy and effectiveness of our solution are ensured by integrating a established system simulator (Simics) with models for power consumption, temperature distribution and aging. The models are based on characterization on real hardware. Control strategies exploration and design are carried out in the MATLAB/Simulink framework allowing the use of control theory tools. Fast prototyping is achieved by developing a suitable interface between Simics and MATLAB/Simulink, enabling co-simulation of hardware platforms and controllers.
- Argollo E. et al. COTSon: Infrastructure for full system simulation. In ACM SIGOPS Operating System Reviews, Jan 2009 Google ScholarDigital Library
- Atienza D. et al. A fast HW/SW FPGA-based thermal emulation framework for multi-processor system-on-chip. Design Automation Conference (DAC), pages 618--623, 2006. Google ScholarDigital Library
- Benini L. et al. MPARM: Exploring the multi-processor SoC design space with SystemC. The Journal of VLSI Signal Processing, 41:169--182, Sep.2005. Google ScholarDigital Library
- Brooks David et al. Wattch: a framework for architectural-level power analysis and optimizations. SIGARCH Comput. Archit. News, 28(2):83--94,2000. Google ScholarDigital Library
- Hamann H. F. et al. Hotspot-limited microprocessors: Direct temperature and power distribution measurements. IEEE Journal of Solid-State Circuits, 42:56--65, Jan. 2007.Google ScholarCross Ref
- Hanson H. et al. Thermal response to DVFS: analysis with an Intel R Pentium R m. In ISLPED '07, pages 219--224, 2007. Google ScholarDigital Library
- Huang Wei et al. Accurate, pre-RTL temperature-aware design using a parameterized, geometric thermal model. IEEE Trans. Comput., 57(9):1277--1288, 2008. Google ScholarDigital Library
- Martin Milo M. K. et al. Multifacet's general execution-driven multiprocessor simulator (GEMS) toolset. SIGARCH Comput. Archit News, 33(4):92--99, 2005. Google ScholarDigital Library
- N. Sakran et al. The implementation of the 65nm dual-core 64b merom processor. In IEEE International Solid-State Circuits Conference, 2007.Google Scholar
- Nathan L. Binkert et al. The m5 simulator: Modeling networked systems.IEEE Micro, 26:52--60, 2006. Google ScholarDigital Library
- P Chaparro et al. Understanding the thermal implications of multi-core architectures. IEEE Transactions on Parallel and Distributed Systems,18(8):1055--1065, Aug. 2007. Google ScholarDigital Library
- Paci G. et al. Exploring "temperature-aware" design in low-power MPSoCs. In DATE '06, pages 838--843, 2006. Google ScholarDigital Library
- Thoziyoor Shyamkumar et al. A comprehensive memory modeling tool and its application to the design and analysis of future memory hierarchies.ISCA '08, pages 51--62, 2008. Google ScholarDigital Library
- Tiwari A. et al. Facelift: Hiding and slowing down aging in multicores.MICRO '08, pages 129--140, 2008. Google ScholarDigital Library
- Intel Corporation. Intel (R) 64 and IA-32 Architectures Software Developer's Manual -- Volume 3B, June 2009.Google Scholar
- Intel Corporation. Intel (R) Xeon (R) Processor 7200 Series and 7300 Series Datasheet- Datasheet, September 2008.Google Scholar
- ACPI Advanced Con?guration and Power Interface Speci?cation http://www.Intel.com/products/processor/manuals/Google Scholar
- IDC. Worldwide server power and cooling expense 2006, 2010 forecast. http://www.sun.com/service/eco/IDCWorldwideServerPower-Consumption.pdf.Google Scholar
- Intel Corporation. Intel Corporation. Intel R turbo boost technology in Intel (R) core (TM) microarchitecture (Nehalem) based processors. Technical report, 2008.Google Scholar
- The MathWorks. MATLAB & Simulink. http://www.mathworks.com/.Google Scholar
- Virtutech. Virtutech Simics. http://www.virtutech.com/.Google Scholar
Index Terms
- A virtual platform environment for exploring power, thermal and reliability management control strategies in high-performance multicores
Recommendations
Thermal and Energy Management of High-Performance Multicores: Distributed and Self-Calibrating Model-Predictive Controller
As result of technology scaling, single-chip multicore power density increases and its spatial and temporal workload variation leads to temperature hot-spots, which may cause nonuniform ageing and accelerated chip failure. These critical issues can be ...
A performance study of general-purpose applications on graphics processors using CUDA
Graphics processors (GPUs) provide a vast number of simple, data-parallel, deeply multithreaded cores and high memory bandwidths. GPU architectures are becoming increasingly programmable, offering the potential for dramatic speedups for a variety of ...
Comments