research-article

XCS-CR: determining accuracy of classifier by its collective reward in action set toward environment with action noise

Authors:
Takato Tatsumi

The University of Electro-Communications, Japan

The University of Electro-Communications, Japan
View Profile

,
Tim Kovacs

University of Bristol, United Kingdom

University of Bristol, United Kingdom
View Profile

,
Keiki Takadama

The University of Electro-Communications, Japan

The University of Electro-Communications, Japan
View Profile

GECCO '18: Proceedings of the Genetic and Evolutionary Computation Conference CompanionJuly 2018Pages 1457–1464https://doi.org/10.1145/3205651.3208271

Published:06 July 2018Publication History

GECCO '18: Proceedings of the Genetic and Evolutionary Computation Conference Companion

Pages 1457–1464

ABSTRACT

Accuracy based Learning Classifier System (XCS) prefers to generalize the classifiers that always acquire the same reward, because they make accurate reward predictions. However, real-world problems have noise, which means that classifiers may not receive the same reward even if they always take the correct action. For this case, since all classifiers acquire multiple values as the reward, XCS cannot identify accurate classifiers. In this paper, we study a single step environment with action noise, where XCS's action is sometimes changed at random. To overcome this problem, this paper proposes XCS based on Collective weighted Reward (XCS-CR) to identify the accurate classifiers. In XCS each rule predicts its next reward by averaging its past rewards. Instead, XCS-CR predicts its next reward by selecting a reward from the set of past rewards, by comparing the past rewards to the collective weighted average reward of the rules matching the current input for each action. This comparison helps XCS-CR identify rewards that result from action noise. In experiments, XCS-CR acquired the optimal generalized classifier subset in 6-Multiplexer problems with action noise, similar to the environment without noise, and judged those optimal generalized classifiers correctly accurate.

References

M. V. Butz, T. Kovacs, P. L. Lanzi, and S. W. Wilson. 2004. Toward a Theory of Generalization and Learning in XCS. Evolutionary Computation, IEEE Transactions on 8, 1 (2004), 28--46. Google ScholarDigital Library
M. V. Butz and S. W. Wilson. 2002. An algorithmic description of XCS. Soft Computing 6, 3--4 (2002), 144--153.Google ScholarCross Ref
D. E. Goldberg. 1989. Genetic Algorithms in Search, Optimization and Machine Learning (1st ed.). Addison-Wesley Longman Publishing Co., Inc. Google ScholarDigital Library
J. H. Holland. 1986. Escaping Brittleness: The Possibilities of General-Purpose Learning Algorithms Applied to Parallel Rule-Based Systems. Machine learning (1986), 593--623.Google Scholar
P. L. Lanzi. 1999. An Analysis of Generalization in the XCS Classifier System. Evolutionary Computation Journal 7, 2 (1999), 125--149. Google ScholarDigital Library
P. L. Lanzi and M. Colombetti. 1999. An Extension to the XCS Classifier System for Stochastic Environments. In Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-99). 353--360. Google ScholarDigital Library
P. L. Lanzi and S. W. Wilson. 2000. Toward Optimal Classifier System Performance in Non-Markov Environments. Evol. Comput. 8, 4 (Dec. 2000), 393--418. Google ScholarDigital Library
R. S. Sutton. 1988. Learning to Predict by the Methods of Temporal Differences. Machine Learning 3, 1 (1988), 9--44. Google ScholarDigital Library
H. Takagi. 2001. Interactive evolutionary computation: Fusion of the capabilities of EC optimization and human evaluation. Proc. IEEE 89, 9 (2001), 1275--1296.Google ScholarCross Ref
T. Tatsumi, T. Komine, M. Nakata, H. Sato, T. Kovacs, and K. Takadama. 2016. Variance-based Learning Classifier System without Convergence of Reward Estimation. In Proceedings of the 2016 on Genetic and Evolutionary Computation Conference Companion (GECCO '16 Companion). ACM, 67--68. Google ScholarDigital Library
A. Webb, E. Hart, P. Ross, and A. Lawson. 2003. Controlling a Simulated Khepera with an XCS Classifier System with Memory. Springer Berlin Heidelberg, Berlin, Heidelberg, 885--892.Google Scholar
S. W. Wilson. 1995. Classifier Fitness Based on Accuracy. Evol. Comput. 3, 2 (June 1995), 149--175. Google ScholarDigital Library
S. W. Wilson. 2000. Get Real! XCS with Continuous-Valued Inputs. Springer Berlin Heidelberg, 209--219. Google ScholarDigital Library

Index Terms

XCS-CR: determining accuracy of classifier by its collective reward in action set toward environment with action noise
1. Computing methodologies
  1. Machine learning
    1. Machine learning approaches
      1. Rule learning

Recommendations

XCS-CR for handling input, output, and reward noise
GECCO '19: Proceedings of the Genetic and Evolutionary Computation Conference Companion

To briefly represent a dataset, it is crucial to find common attributes among the data. Extended learning classifier system (XCS) finds common attributes of multiple data and acquires generalized rules that match multiple data. In real-world problems, ...
Read More
Automatic adjustment of selection pressure based on range of reward in learning classifier system
GECCO '17: Proceedings of the Genetic and Evolutionary Computation Conference

XCS (Accuracy-based learning classifier system) can acquire accurate classifiers on the basis of consistent reward, but it does not always receive the consistent reward in real world problems even if it provides the same output for the same input. Such ...
Read More
Improving genetic search in XCS-based classifier systems through understanding the evolvability of classifier rules

Learning classifier systems (LCSs), an established evolutionary computation technique, are over 30 years old with much empirical testing and foundations of theoretical understanding. XCS is a well-tested LCS model that generates optimal (i.e., maximally ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
GECCO '18: Proceedings of the Genetic and Evolutionary Computation Conference Companion
July 2018
1968 pages
ISBN:9781450357647
DOI:10.1145/3205651
Editor:
Hernan Aguirre
Shinshu University
,
General Chair:
Keiki Takadama
The University of Electro-Communications
Copyright © 2018 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 6 July 2018
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
XCS
accuracy criteria
alternative noise
reward
Qualifiers
- research-article
Conference

Acceptance Rates
Overall Acceptance Rate1,669of4,410submissions,38%
Upcoming Conference
GECCO '24

Sponsor:

sigevo

Genetic and Evolutionary Computation Conference

July 14 - 18, 2024

Melbourne , VIC , Australia
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 3
  Total Citations
  View Citations
- 70
  Total Downloads
- Downloads (Last 12 months)0
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

XCS-CR: determining accuracy of classifier by its collective reward in action set toward environment with action noise

GECCO '18: Proceedings of the Genetic and Evolutionary Computation Conference Companion

ABSTRACT

References

Cited By

Index Terms

Recommendations

XCS-CR for handling input, output, and reward noise

Automatic adjustment of selection pressure based on range of reward in learning classifier system

Improving genetic search in XCS-based classifier systems through understanding the evolvability of classifier rules

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

XCS-CR: determining accuracy of classifier by its collective reward in action set toward environment with action noise

GECCO '18: Proceedings of the Genetic and Evolutionary Computation Conference Companion

ABSTRACT

References

Cited By

Index Terms

Recommendations

XCS-CR for handling input, output, and reward noise

Automatic adjustment of selection pressure based on range of reward in learning classifier system

Improving genetic search in XCS-based classifier systems through understanding the evolvability of classifier rules

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media