Abstract
This paper focuses on the study of the behavior of a genetic algorithm based classifier system, the Adapted Pittsburgh Classifier System (A.P.C.S), on maze type environments containing aliasing squares. This type of environment is often used in reinforcement learning literature to assess the performances of learning methods when facing problems containing non markovian situations.
Through this study, we discuss on the performance of the APCS upon two mazes (Woods 101 and Maze E2) and also on the efficiency of an improvement of the APCS learning method inspired from the XCS: the covering mechanism. We manage to show that, without any memory mechanism, the APCS is able to build and to keep accurate strategies to produce regular sub-optimal solution to these maze problems. This statement is shown through a comparison between the results obtained by the XCS on two specific maze problems and those obtained by the APCS.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bacardit, J., Garrell-Guiu, J.M.: Bloat control and generalization pressure using the minimum description length principle for a pittsburgh approach learning classifier system. In: Kovacs, T., Llorà, X., Takadama, K., Lanzi, P.L., Stolzmann, W., Wilson, S.W. (eds.) IWLCS 2003. LNCS (LNAI), vol. 4399, pp. 59–79. Springer, Heidelberg (2007)
Bagnall, A.J., Zatuchna, Z.: On the classification of maze problems. In: Bull, L., Kovacs, T. (eds.) Applications of Learning Classifier Systems. Studies in Fuzziness and Soft Computing, vol. 183, pp. 307–316. Springer, Heidelberg (2005)
Bernadó-Mansilla, E., Llorà, X., Garrell-Guiu, J.M.: Xcs and gale: A comparative study of two learning classifier systems on data mining. In: Lanzi, P.L., Stolzmann, W., Wilson, S.W. (eds.) IWLCS 2001. LNCS (LNAI), vol. 2321, pp. 115–132. Springer, Heidelberg (2002)
Bull, L.: Lookahead and latent learning in ZCS. In: GECCO 2002: Proceedings of the Genetic and Evolutionary Computation Conference, New York, July 9-13, pp. 897–904. Morgan Kaufmann Publishers, San Francisco (2002)
Butz, M., Wilson, S.W.: An algorithmic description of XCS. In: Lanzi, P.L., Stolzmann, W., Wilson, S.W. (eds.) IWLCS 2000. LNCS (LNAI), vol. 1996, pp. 253–272. Springer, Heidelberg (2001)
Butz, M.V.: Documentation of XCS+TS c-code 1.2. IlliGAL Report 2003023, Illinois Genetic Algorithms Laboratory (October 2003)
De Jong, K.A., Spears, W.M., Gordon, D.F.: Using Genetic Algorithms for Concept Learning. Machine Learning 13(3), 161–188
Énée, G.: Systèmes de Classeurs et Communication dans les Systèmes Multi-Agents. PhD thesis, Ecole Doctorale de STIC, Université de Nice Sophia-Antipolis, (Janvier 2003)
Énée, G., Barbaroux, P.: Adapted pittsburgh-style classifier-system: Case-study. In: Lanzi, P.L., Stolzmann, W., Wilson, S.W. (eds.) IWLCS 2003. LNCS (LNAI), vol. 2661, pp. 30–45. Springer, Heidelberg (2003)
Holmes, J.H., Lanzi, P.L., Stolzmann, W., Wilson, S.W.: Learning classifier systems: New models, successful applications. Inf. Process. Lett. 82(1), 23–30 (2002)
Lanzi, P.L.: Adding Memory to XCS. In: Proceedings of the IEEE Conference on Evolutionary Computation (ICEC 1998), IEEE Press, Los Alamitos (1998), http://ftp.elet.polimi.it/people/lanzi/icec98.ps.gz
Lanzi, P.L.: An analysis of the memory mechanism of XCSM. In: Proceedings of the Third Genetic Programming Conference, pp. 643–651. Morgan Kaufmann, San Francisco (1998), http://ftp.elet.polimi.it/people/lanzi/gp98.ps.gz
Lanzi, P.L., Wilson, S.W.: Optimal classifier system performance in non-markovian environments. Technical Report 99.36, Illinois Genetic Algorithms Laboratory, Milan, Italy (1999)
Sigaud, O.: Les systèmes de classeurs: un état de lárt. Revue d’intelligence Artificielle RSTI série RIA,Lavoisier, vol. 21 (February 2007)
Sigaud, O., Wilson, S.W.: Learning classifier systems: a survey. Soft Comput. 11(11), 1065–1078 (2007)
Smith, S.F.: A Learning System based on Genetic Adaptive Algorithms. PhD thesis, University of Pittsburgh (1980)
Wilson, S.W.: Classifier fitness based on accuracy. Evolutionary Computation 3(2), 148–175 (1995)
Zatuchna, Z.V.: AgentP: A Learning Classifier System with Associative Perception in Maze Environments. PhD thesis, School of Computing Sciences, UEA (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Gilles, É., Mathias, P. (2010). Building Accurate Strategies in Non Markovian Environments without Memory. In: Bacardit, J., Browne, W., Drugowitsch, J., Bernadó-Mansilla, E., Butz, M.V. (eds) Learning Classifier Systems. IWLCS IWLCS 2009 2008. Lecture Notes in Computer Science(), vol 6471. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-17508-4_8
Download citation
DOI: https://doi.org/10.1007/978-3-642-17508-4_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-17507-7
Online ISBN: 978-3-642-17508-4
eBook Packages: Computer ScienceComputer Science (R0)