research-article

Automatic adjustment of selection pressure based on range of reward in learning classifier system

Authors:
Takato Tatsumi

The University of Electro-Communications, Tokyo, Japan

The University of Electro-Communications, Tokyo, Japan
View Profile

,
Hiroyuki Sato

The University of Electro-Communications, Tokyo, Japan

The University of Electro-Communications, Tokyo, Japan
View Profile

,
Keiki Takadama

The University of Electro-Communications, Tokyo, Japan

The University of Electro-Communications, Tokyo, Japan
View Profile

GECCO '17: Proceedings of the Genetic and Evolutionary Computation ConferenceJuly 2017Pages 505–512https://doi.org/10.1145/3071178.3080531

Published:01 July 2017Publication History

GECCO '17: Proceedings of the Genetic and Evolutionary Computation Conference

Pages 505–512

ABSTRACT

XCS (Accuracy-based learning classifier system) can acquire accurate classifiers on the basis of consistent reward, but it does not always receive the consistent reward in real world problems even if it provides the same output for the same input. Such a situation prevents XCS from reducing the number of overspecific accurate classifiers by the subsumption mechanism. This means that XCS finds it hard to acquire the optimal classifiers. For this issue, our previous research proposed XCS-MR (XCS based on Mean of Reward) which can reduce the number of classifiers even in the environments where the size of the rewards is uncertain. However, XCS-MR requires a large amount of learning data to correctly determine the accuracy of classifiers because XCS-MR needs to record the average and variance of the rewards in all input-output space. To overcome this problem, this paper proposes a new XCS that can reduce the number of the classifiers even in the uncertain reward environments without recording the average and variance of the rewards in all input-output space. This paper shows the effectiveness of the proposed XCS through the experiments.

References

M. V. Butz, T. Kovacs, P. L. Lanzi, and S. W. Wilson. 2004. Toward a Theory of Generalization and Learning in XCS. Evolutionary Computation, IEEE Transactions on 8, 1 (2004), 28--46. Google ScholarDigital Library
M. V. Butz and O. Sigaud. 2012. XCSF with Local Deletion: Preventing Detrimental Forgetting. Evolutionary Intelligence 5, 2 (2012), 117--127.Google ScholarCross Ref
D. E. Goldberg. 1989. Genetic Algorithms in Search, Optimization and Machine Learning (1st ed.). Addison-Wesley Longman Publishing Co., Inc. Google ScholarDigital Library
J. H. Holland. 1986. Escaping Brittleness: The Possibilities of General-Purpose Learning Algorithms Applied to Parallel Rule-Based Systems. Machine learning (1986), 593--623.Google Scholar
P. L. Lanzi. 1999. An Analysis of Generalization in the XCS Classifier System. Evolutionary Computation Journal 7, 2 (1999), 125--149. Google ScholarDigital Library
P. L. Lanzi and M. Colombetti. 1999. An Extension to the XCS Classifier System for Stochastic Environments. In Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-99). 353--360. Google ScholarDigital Library
P. L. Lanzi and S. W. Wilson. 2000. Toward Optimal Classifier System Performance in Non-Markov Environments. Evol. Comput. 8, 4 (Dec. 2000), 393--418. Google ScholarDigital Library
R. S. Sutton. 1988. Learning to Predict by the Methods of Temporal Differences. Machine Learning 3, 1 (1988), 9--44. Google ScholarDigital Library
H. Takagi. 2001. Interactive evolutionary computation: Fusion of the capabilities of EC optimization and human evaluation. Proc. IEEE 89, 9 (2001), 1275--1296.Google ScholarCross Ref
T. Tatsumi, T. Komine, M. Nakata, H. Sato, T. Kovacs, and K. Takadama. 2016. Variance-based Learning Classifier System without Convergence of Reward Estimation. In Proceedings of the 2016 on Genetic and Evolutionary Computation Conference Companion (GECCO '16 Companion). ACM, 67--68. Google ScholarDigital Library
A. Webb, E. Hart, P. Ross, and A. Lawson. 2003. Controlling a Simulated Khepera with an XCS Classifier System with Memory. Springer Berlin Heidelberg, Berlin, Heidelberg, 885--892.Google Scholar
S. W. Wilson. 1995. Classifier Fitness Based on Accuracy. Evol. Comput. 3, 2 (June 1995), 149--175. Google ScholarDigital Library
S. W. Wilson. 2000. Get Real! XCS with Continuous-Valued Inputs. Springer Berlin Heidelberg, 209--219. Google ScholarDigital Library

Index Terms

Automatic adjustment of selection pressure based on range of reward in learning classifier system
1. Computing methodologies
  1. Machine learning
    1. Machine learning approaches
      1. Rule learning

Recommendations

XCS-CR: determining accuracy of classifier by its collective reward in action set toward environment with action noise
GECCO '18: Proceedings of the Genetic and Evolutionary Computation Conference Companion

Accuracy based Learning Classifier System (XCS) prefers to generalize the classifiers that always acquire the same reward, because they make accurate reward predictions. However, real-world problems have noise, which means that classifiers may not ...
Read More
Learning classifier system with average reward reinforcement learning

In the family of Learning Classifier Systems, the classifier system XCS is most widely used and investigated. However, the standard XCS has difficulties solving large multi-step problems, where long action chains are needed to get delayed rewards. Up to ...
Read More
Improving genetic search in XCS-based classifier systems through understanding the evolvability of classifier rules

Learning classifier systems (LCSs), an established evolutionary computation technique, are over 30 years old with much empirical testing and foundations of theoretical understanding. XCS is a well-tested LCS model that generates optimal (i.e., maximally ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
GECCO '17: Proceedings of the Genetic and Evolutionary Computation Conference
July 2017
1427 pages
ISBN:9781450349208
DOI:10.1145/3071178
General Chair:
Peter A. N. Bosman
Centrum Wiskunde & Informatica (CWI)
Copyright © 2017 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 1 July 2017
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
XCS
accuracy criteria
reward
sample standard deviation
Qualifiers
- research-article
Conference

Acceptance Rates
GECCO '17 Paper Acceptance Rate178of462submissions,39%Overall Acceptance Rate1,669of4,410submissions,38%
More
Upcoming Conference
GECCO '24

Sponsor:

sigevo

Genetic and Evolutionary Computation Conference

July 14 - 18, 2024

Melbourne , VIC , Australia
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 3
  Total Citations
  View Citations
- 100
  Total Downloads
- Downloads (Last 12 months)1
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Automatic adjustment of selection pressure based on range of reward in learning classifier system

GECCO '17: Proceedings of the Genetic and Evolutionary Computation Conference

ABSTRACT

References

Cited By

Index Terms

Recommendations

XCS-CR: determining accuracy of classifier by its collective reward in action set toward environment with action noise

Learning classifier system with average reward reinforcement learning

Improving genetic search in XCS-based classifier systems through understanding the evolvability of classifier rules

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Automatic adjustment of selection pressure based on range of reward in learning classifier system

GECCO '17: Proceedings of the Genetic and Evolutionary Computation Conference

ABSTRACT

References

Cited By

Index Terms

Recommendations

XCS-CR: determining accuracy of classifier by its collective reward in action set toward environment with action noise

Learning classifier system with average reward reinforcement learning

Improving genetic search in XCS-based classifier systems through understanding the evolvability of classifier rules

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media