Conferences >2024 Design, Automation & Tes...

Multi-Agent Reinforcement Learning for Thermally-Restricted Performance Optimization on Manycores

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

The problem of performance maximization under a thermal constraint has been tackled by means of dynamic voltage and frequency scaling (DVFS) in many system-level optimiza...Show More

Metadata

Abstract:

The problem of performance maximization under a thermal constraint has been tackled by means of dynamic voltage and frequency scaling (DVFS) in many system-level optimization techniques. State-of-the-art ones have exploited Su-pervised Learning (SL) to develop models that predict power and performance characteristics of applications and temperature of the cores. Such predictions enable proactive and efficient optimization decisions that exploit performance potentials under a temperature constraint. SL- based models are built at design time based on training data generated considering specific environment settings, i.e., processor architecture, cooling system, ambient temperature, etc. Hence, these models cannot adapt at runtime to different environment settings. In contrast, Reinforcement Learning (RL) employs an agent that explores and learns the environment at runtime, and hence can adapt to its potential changes. Nonetheless, using an RL agent to perform optimization on manycores is challenging because of the inherent large state/action spaces that might hinder the agent's ability to converge. To get the advantages of RL while tackling this challenge, we employ for the first time multi -agent RL to perform thermally-restricted performance optimization for manycores through DVFS. We investigated two RL algorithms-Table-based Q-Learning (TQL) and Deep Q-Learning (DQL)-and demonstrated that the latter outperforms the former. Compared to the state of the art, our DQL delivers a significant performance improvement of 34.96% on average, while also guaranteeing thermally -safe operation on the manycore. Our evaluation reveals the runtime adaptability of our DQL to varying workloads and ambient temperatures.

Published in: 2024 Design, Automation & Test in Europe Conference & Exhibition (DATE)

Date of Conference: 25-27 March 2024

Date Added to IEEE Xplore: 10 June 2024

ISBN Information:

ISSN Information:

DOI: 10.23919/DATE58400.2024.10546574

Conference Location: Valencia, Spain

Contents

References is not available for this document.

Multi-Agent Reinforcement Learning for Thermally-Restricted Performance Optimization on Manycores

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Multi-Agent Reinforcement Learning for Thermally-Restricted Performance Optimization on Manycores

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Authors

Figures

References

Keywords

Metrics

References

IEEE Account

Purchase Details

Profile Information

Need Help?