poster

LCT-DER: Learning Classifier Table with Dynamic-Sized Experience Replay for Run-time SoC Performance-Power Optimization

Authors:
Anmol Surhonne

Chair of Integrated Systems, Technical University of Munich, Munich, Germany

Chair of Integrated Systems, Technical University of Munich, Munich, Germany

https://orcid.org/0000-0002-4065-5007
View Profile

,
Florian Maurer

Chair of Integrated Systems, Technical University of Munich, Munich, Germany

Chair of Integrated Systems, Technical University of Munich, Munich, Germany

https://orcid.org/0000-0002-3369-7874
View Profile

,
Thomas Wild

Chair of Integrated Systems, Technical University of Munich, Munich, Germany

Chair of Integrated Systems, Technical University of Munich, Munich, Germany

https://orcid.org/0000-0002-2455-3625
View Profile

,
Andreas Herkersdorf

Chair of Integrated Systems, Technical University of Munich, Munich, Germany

Chair of Integrated Systems, Technical University of Munich, Munich, Germany

https://orcid.org/0000-0002-8886-5345
View Profile

GECCO '23 Companion: Proceedings of the Companion Conference on Genetic and Evolutionary ComputationJuly 2023Pages 331–334https://doi.org/10.1145/3583133.3590573

Published:24 July 2023Publication History

GECCO '23 Companion: Proceedings of the Companion Conference on Genetic and Evolutionary Computation

Pages 331–334

ABSTRACT

Learning classifier tables (LCTs) are lightweight, classifier based, hardware implemented reinforcement learning (RL) building blocks which enable self-adaptivity and self-optimization properties in multicore systems. LCTs are deployed per-core to learn and optimize potentially conflicting objectives and constraints. Experience replay (ER) is a replay memory technique in RL, where agents experiences are stored in a buffer and are used to improve the learning process. Implementing an ER buffer in hardware requires memory and is expensive. We introduce LCT-DER: LCT with dynamic-sized experience replay, where the classifier population and experiences share the same memory by exploiting the concept of macro-classifiers. LCT-DER performing DVFS achieves 44.5% and 4.5% lower number of power budget overshoots and IPS difference compared to a standard LCT without requiring additional memory.

References

Martin V Butz, Tim Kovacs, Pier Luca Lanzi, and Stewart W Wilson. 2001. How XCS evolves accurate classifiers. In Proceedings of the Third Genetic and Evolutionary Computation Conference (GECCO-2001).Google Scholar
Bryan Donyanavard, Tiago Mück, Amir M Rahmani, Nikil Dutt, Armin Sadighi, Florian Maurer, and Andreas Herkersdorf. 2019. SOSA: Self-Optimizing Learning with Self-Adaptive Control for Hierarchical System-on-Chip Management. In Proceedings of the 52nd Annual IEEE/ACM International Symposium on Microarchitecture.Google ScholarDigital Library
Jiri Gaisler, Edvin Catovic, Marko Isomaki, Kristoffer Glembo, and Sandi Habinc. 2007. GRLIB IP core user's manual. Gaisler research (2007).Google Scholar
Matthew R Guthaus, Jeffrey S Ringenberg, Dan Ernst, Todd M Austin, Trevor Mudge, and Richard B Brown. 2001. MiBench: A free, commercially representative embedded benchmark suite. In Proceedings of the fourth annual IEEE international workshop on workload characterization. WWC-4 (Cat. No. 01EX538).Google ScholarCross Ref
Michael Heider, David Pätzel, and Alexander RM Wagner. 2022. An overview of LCS research from 2021 to 2022. In Proceedings of the Genetic and Evolutionary Computation Conference Companion. 2086--2094.Google ScholarDigital Library
Florian Maurer, Bryan Donyanavard, Amir M Rahmani, Nikil Dutt, and Andreas Herkersdorf. 2020. Emergent control of MPSoC operation by a hierarchical supervisor/reinforcement learning approach. In 2020 Design, Automation & Test in Europe Conference & Exhibition (DATE).Google Scholar
Martin Rapp, Hussam Amrouch, Yibo Lin, Bei Yu, David Z Pan, Marilyn Wolf, and Jörg Henkel. 2021. Mlcad: A survey of research in machine learning for cad keynote paper. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (2021).Google Scholar
Lukas Rosenbauer, Anthony Stein, David Pätzel, and Jöorg Hähner. 2020. XCSF with experience replay for automatic test case prioritization. In 2020 IEEE Symposium Series on Computational Intelligence (SSCI). IEEE.Google ScholarCross Ref
Anthony Stein, Roland Maier, Lukas Rosenbauer, and Jörg Hähner. 2020. XCS classifier system with experience replay. In Proceedings of the 2020 Genetic and Evolutionary Computation Conference.Google ScholarDigital Library
Anmol Surhonne, Nguyen Anh Vu Doan, Florian Maurer, Thomas Wild, and Andreas Herkersdorf. 2022. GAE-LCT: A Run-Time GA-Based Classifier Evolution Method for Hardware LCT Controlled SoC Performance-Power Optimization. In International Conference on Architecture of Computing Systems.Google Scholar
Johannes Zeppenfeld, Abdelmajid Bouajila, Walter Stechele, and Andreas Herkersdorf. 2008. Learning classifier tables for autonomic systems on chip. INFORMATIK 2008. Beherrschbare Systeme-dank Informatik. Band 2 (2008).Google Scholar

Index Terms

LCT-DER: Learning Classifier Table with Dynamic-Sized Experience Replay for Run-time SoC Performance-Power Optimization
1. Computer systems organization
  1. Embedded and cyber-physical systems
    1. System on a chip
2. Computing methodologies
  1. Machine learning
    1. Machine learning approaches
      1. Rule learning

Recommendations

GAE-LCT: A Run-Time GA-Based Classifier Evolution Method for Hardware LCT Controlled SoC Performance-Power Optimization
Architecture of Computing Systems
Abstract
Learning classifier tables (LCTs) are classifier based and lightweight hardware reinforcement learning building blocks which inherit the concepts of learning classifier systems. LCTs are used as a per-core low level controllers to learn and ...
Read More
Real-time reinforcement learning by sequential Actor-Critics and experience replay

Actor-Critics constitute an important class of reinforcement learning algorithms that can deal with continuous actions and states in an easy and natural way. This paper shows how these algorithms can be augmented by the technique of experience replay ...
Read More
Hindsight-Combined and Hindsight-Prioritized Experience Replay
Neural Information Processing
Abstract
Reinforcement learning has proved to be of great utility; execution, however, may be costly due to sampling inefficiency. An efficient method for training is experience replay, which recalls past experiences. Several experience replay techniques, ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
GECCO '23 Companion: Proceedings of the Companion Conference on Genetic and Evolutionary Computation
July 2023
2519 pages
ISBN:9798400701207
DOI:10.1145/3583133
Chair:
Sara Silva,
Program Chair:
Luís Paquete
Copyright © 2023 Owner/Author(s)
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the owner/author(s).
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 24 July 2023
Check for updates
Author Tags
learning classifier table
experience replay
DVFS
SoC
Qualifiers
- poster
Conference

Acceptance Rates
Overall Acceptance Rate1,669of4,410submissions,38%
Upcoming Conference
GECCO '24

Sponsor:

sigevo

Genetic and Evolutionary Computation Conference

July 14 - 18, 2024

Melbourne , VIC , Australia
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 39
  Total Downloads
- Downloads (Last 12 months)39
- Downloads (Last 6 weeks)4
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

LCT-DER: Learning Classifier Table with Dynamic-Sized Experience Replay for Run-time SoC Performance-Power Optimization

GECCO '23 Companion: Proceedings of the Companion Conference on Genetic and Evolutionary Computation

ABSTRACT

References

Cited By

Index Terms

Recommendations

GAE-LCT: A Run-Time GA-Based Classifier Evolution Method for Hardware LCT Controlled SoC Performance-Power Optimization

Real-time reinforcement learning by sequential Actor-Critics and experience replay

Hindsight-Combined and Hindsight-Prioritized Experience Replay

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

LCT-DER: Learning Classifier Table with Dynamic-Sized Experience Replay for Run-time SoC Performance-Power Optimization

GECCO '23 Companion: Proceedings of the Companion Conference on Genetic and Evolutionary Computation

ABSTRACT

References

Cited By

Index Terms

Recommendations

GAE-LCT: A Run-Time GA-Based Classifier Evolution Method for Hardware LCT Controlled SoC Performance-Power Optimization

Real-time reinforcement learning by sequential Actor-Critics and experience replay

Hindsight-Combined and Hindsight-Prioritized Experience Replay

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media