skip to main content
10.1145/2576768.2598351acmconferencesArticle/Chapter ViewAbstractPublication PagesgeccoConference Proceedingsconference-collections
research-article

Complete action map or best action map in accuracy-based reinforcement learning classifier systems

Published:12 July 2014Publication History

ABSTRACT

We study two existing Learning Classifier Systems (LCSs): XCS, which has a complete map (which covers all actions in each state), and XCSAMm, which has a best action map (which covers only the highest-return action in each state). This allows XCSAM to learn with a smaller population size limit (but larger population size) and to learn faster than XCS on well-behaved tasks. However, many tasks have dif- ficulties like noise and class imbalances. XCS and XCSAM have not been compared on such problems before. This pa- per aims to discover which kind of map is more robust to these difficulties. We apply them to a classification problem (the multiplexer problem) with class imbalance, Gaussian noise or alternating noise (where we return the reward for a different action). We also compare them on real-world data from the UCI repository without adding noise. We analyze how XCSAM focuses on the best action map and introduce a novel deletion mechanism that helps to evolve classifiers towards a best action map. Results show the best action map is more robust (has higher accuracy and sometimes learns faster) in all cases except small amounts of alternat- ing noise.

References

  1. E. Bernado-mansilla and J. M.Garrell-Guiu. Accuracy-based Learning Classifier Systems: Models, Analysis and Applications to Classification Tasks. Evolutionary Computation, 11:209--238, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Larry Bull, Ester Bernado-Mansilla, and John H. Holmes, editors. Learning Classifier Systems in Data Mining, volume 125 of Studies in Computational Intelligence. Springer, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. M. V. Butz and S. W. Wilson. An algorithmic description of XCS. Journal of Soft Computing, 6(3--4):144--153, 2002.Google ScholarGoogle ScholarCross RefCross Ref
  4. M. V. Butz, D. E. Goldberg, and P. L. Lanzi. Gradient descent methods in learning classifier systems: Improving XCS performance in multistep problems. IEEE Trans. on Evol. Comp., 9(5):452--473, October 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. M. V. Butz, P. L. Lanzi, and S. W. Wilson. Function approximation with XCS: Hyperellipsoidal conditions, recursive least squares, and compaction. IEEE Trans. on Evol. Comp., 12(3):355--376, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. D. E. Goldberg. Genetic Algorithms in Search, Optimization, and Machine Learning. Addison Wesley, 1989. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. J. H. Holland. Escaping Brittleness: The Possibilities of General Purpose Learning Algorithms Applied to Parallel Rule-based system. Machine Learning, 2:593--623, 1986.Google ScholarGoogle Scholar
  8. F. Kharbat, L. Bull, and M. Odeh. Mining breast cancer data with XCS. Proc. of the Genetic and Evol. Comp. Conf. (GECCO2007), pages 2066--2073, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. T. Kovacs. Deletion schemes for classifier systems. Proc. of the Genetic and Evol. Comp. Conf. (GECCO1999), pages 329--336, 1999.Google ScholarGoogle Scholar
  10. Pier Luca Lanzi. Learning classifier systems from a reinforcement learning perspective. Soft Computing - A Fusion of Foundations, Methodologies and Applications, 6(3):162--170, 2002.Google ScholarGoogle Scholar
  11. M. Nakata, P. L. Lanzi, and K. Takadama. Enhancing Learning Capabilities by XCS with Best Action Mapping. In PPSN XII, volume 7491 of LNCS, pages 256--265. Springer, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. M. Nakata, P. L. Lanzi, and K. Takadama. XCS with Adaptive Action Mapping. In SEAL2012, volume 7673 of LNCS, pages 138--147. Springer, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Masaya Nakata, Pier Luca Lanzi, and Keiki Takadama. Selection strategy for XCS with adaptive action mapping. In GECCO2013, p. 1085--1092. ACM, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. A. Orriols and E. Bernado-Mansilla. Class imbalance problem in UCS classifier system: fitness adaptation. In CEC2005, vol. 1, p. 604--611. IEEE, 2005.Google ScholarGoogle ScholarCross RefCross Ref
  15. A. Orriols-Puig and E. Bernadó-Mansilla. Bounding XCS's parameters for unbalanced datasets. In GECCO2006, pages 1561--1568. ACM, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. UCIrvine Machine Learning Repository. http://archive.ics.uci.edu/ml/.Google ScholarGoogle Scholar
  17. C. Stone and L. Bull. Comparing XCS and ZCS on noisy continuous valued environments. In ETechnical Report. UWE, 2005.Google ScholarGoogle Scholar
  18. R. S. Sutton and A. G. Barto. Reinforcement Learning -- An Introduction. MIT Press, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Fani A. Tzima and Pericles A. Mitkas. ZCS Revisited: Zeroth-level Classifier Systems for Data Mining. ICDM Workshops, pages 700--709, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. S. W. Wilson. ZCS: A Zeroth Level Classifier System. Evolutionary Computation, 2(1):1--18, 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. S. W. Wilson. Classifier fitness based on accuracy. Evolutionary Computation, 3(2):149--175, June 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Complete action map or best action map in accuracy-based reinforcement learning classifier systems

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      GECCO '14: Proceedings of the 2014 Annual Conference on Genetic and Evolutionary Computation
      July 2014
      1478 pages
      ISBN:9781450326629
      DOI:10.1145/2576768

      Copyright © 2014 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 12 July 2014

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      GECCO '14 Paper Acceptance Rate180of544submissions,33%Overall Acceptance Rate1,669of4,410submissions,38%

      Upcoming Conference

      GECCO '24
      Genetic and Evolutionary Computation Conference
      July 14 - 18, 2024
      Melbourne , VIC , Australia

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader