The Effects of Bounding Rationality on the Performance and Learning of CHREST Agents in Tileworld

Lloyd-Kelly, Martyn; Lane, Peter C. R.; Gobet, Fernand

doi:10.1007/978-3-319-12069-0_10

The Effects of Bounding Rationality on the Performance and Learning of CHREST Agents in Tileworld

Martyn Lloyd-Kelly³,
Peter C. R. Lane⁴ &
Fernand Gobet³

Conference paper
First Online: 30 October 2014

520 Accesses
2 Citations
3 Altmetric

Abstract

Learning in complex and complicated domains is fundamental to performing suitable and timely actions within them. The ability of chess masters to learn and recall huge numbers of board configurations to produce near-optimal actions provides evidence that chunking mechanisms are likely to underpin human learning. Cognitive theories based on chunking argue in favour for the notion of bounded rationality since relatively small chunks of information are learnt in comparison to the total information present in the environment. CHREST, a computational architecture that implements chunking theory, has previously been used to investigate learning in deterministic environments such as chess, where future states are solely dependent upon the actions of agents. In this paper, the CHREST architecture is implemented in agents situated in “Tileworld”, a stochastic environment whose future state depends on both the actions of agents and factors intrinsic to the environment which agents have no control over. The effects of bounding agents’ visual input on learning and performance in various scenarios where the complexity of Tileworld is altered is analysed using computer simulations. Our results show that interactions between independent variables are complex and have important implications for agents situated in stochastic environments where a balance must be struck between learning and performance.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

1.
Complex in that there are many pieces capable of being moved, complicated in that there may be many possible solutions to a given position.
2.
With respect to both the number of positions and the amount of information in each position.
3.
Any Tileworld square can only be occupied by one artifact at most.
4.
The base 2 in the \(2^{n^{2}}\) term of the expression is derived from the fact that Tileworld squares may be empty or may be occupied by one instance of an artifact class. In Simari and Parsons’ version of Tileworld, there is only one artifact class: a hole. Therefore, in their version of Tileworld, a square will only ever be empty or occupied by a hole, which gives the base 2.
5.
MTT = “Move To Tile”.
6.
Note that the chunk retrieved from LTM is also placed into STM but this version of CHREST does not make use of STM chunks.
7.
The agent’s goals are implicit, i.e. goals are not explicitly represented in any data structure available to CHREST agents.
8.
All parameters concerning time are specified in seconds.
9.
Minimum “sight-radius” parameter value is 2 since agents must be able to see at least 1 square in front of a tile, so that its ability to be pushed can be determined.

References

Anderson, J.R., Bothell, D., Byrne, M.D., Douglass, S., Lebière, C., Qin, Y.L.: An integrated theory of the mind. Psychol. Rev. 111(4), 1036–1060 (2004)
Article Google Scholar
Bossomaier, T., Traish, J., Gobet, F., Lane, P.C.R.: Neuro-cognitive model of move location in the game of Go. In: Proceedings of the 2012 international joint conference on neural networks (2012)
Google Scholar
Chase, W.G., Simon, H.A.: Perception in chess. Cogn. Psychol. 4, 55–81 (1973)
Article Google Scholar
Cohn, D., Atlas, L., Ladner, R.: Improving generalization with active learning. Mach. Learn. 15, 201–221 (1994)
Google Scholar
de Groot, A.D., Gobet, F.: Perception and Memory in Chess: Heuristics of the Professional Eye. Van Gorcum, Assen (1996)
Google Scholar
Freudenthal, D., Pine, J.M., Gobet, F.: Simulating the referential properties of Dutch, German and English root infinitives in MOSAIC. Lang. Learn. Dev. 15, 1–29 (2009)
Article Google Scholar
Gobet, F., Lane, P.: Encyclopedia of the Science of Learning, Chapter Bounded Rationality and Learning. Springer, NY (2012)
Google Scholar
Gobet, F., Lane, P.C.R., Croker, S.J., Cheng, P.C.-H., Jones, G., Oliver, I., Pine, J.M.: Chunking mechanisms in human learning. Trends Cogn. Sci. 5, 236–243 (2001)
Article Google Scholar
Gobet, F., Simon, H.A.: Five seconds or sixty? Presentation time in expert memory. Cogn. Sci. 24, 651–682 (2000)
Article Google Scholar
Jones, G.A., Gobet, F., Pine, J.M.: Linking working memory and long-term memory: a computational model of the learning of new words. Dev. Sci. 10, 853–873 (2007)
Article Google Scholar
Jongman, R.W.: Het Oog Van De Meester. Van Gorcum, Assen (1968)
Google Scholar
Laird, J.E.: The Soar Cognitive Architecture. MIT Press, Cambridge (2012)
Google Scholar
Lane, P.C.R., Gobet, F.: Chrest models of implicit learning and board game interpetation. In: Bach, J., Goertzel, B., Ikle, M. (eds.) Proceedings of the Fifth Conference on Artificial General Intelligence, pp. 148–157. Springer, Berlin, Heidelberg (2012)
Chapter Google Scholar
Pollack, M., Ringuette, M.: Introducing the Tileworld: experimentally evaluating agent architectures. In: Eighth National Conference on Artificial Intelligence, pp. 183–189. AAAI Press, Menlo Park (1990)
Google Scholar
Shannon, C.E.: A chess-playing machine. Philos. Mag. 182, 41–51 (1950)
MathSciNet Google Scholar
Simari, G.I., Parsons, S.D.: On approximating the best decision for an autonomous agent. In: Sixth Workshop on Game Theoretic and Decision Theoretic Agents, Third Conference on Autonomous Agents and Multi-Agent Systems, pp. 91–100 (2004)
Google Scholar
Simon, H.A.: A behavioral model of rational choice. Q. J. Econ. 69, 99–118 (1955)
Article Google Scholar
Simon, H.A.: The Sciences of the Artificial. MIT Press, Cambridge, MA (1969)
Google Scholar
Simon, H.A., Gilmartin, K.J.: A simulation of memory for chess positions. Cogn. Psychol. 5, 29–46 (1973)
Article Google Scholar
Wooldridge, M., Parsons, S.: Intention reconsideration reconsidered. In: Intelligent Agents V, pp. 63–80. Springer, New York (1998)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Psychological Sciences, University of Liverpool, Liverpool, L69 3BX, UK
Martyn Lloyd-Kelly & Fernand Gobet
School of Computer Science, University of Hertfordshire, College Lane, Hatfield, L69 3BX, UK
Peter C. R. Lane

Authors

Martyn Lloyd-Kelly
View author publications
You can also search for this author in PubMed Google Scholar
Peter C. R. Lane
View author publications
You can also search for this author in PubMed Google Scholar
Fernand Gobet
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Martyn Lloyd-Kelly .

Editor information

Editors and Affiliations

University of Portsmouth, Portsmouth, United Kingdom
Max Bramer
School of Computing, Eng & Mathematics, University of Brighton, Brighton, West Sussex, United Kingdom
Miltos Petridis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lloyd-Kelly, M., Lane, P.C.R., Gobet, F. (2014). The Effects of Bounding Rationality on the Performance and Learning of CHREST Agents in Tileworld. In: Bramer, M., Petridis, M. (eds) Research and Development in Intelligent Systems XXXI. SGAI 2014. Springer, Cham. https://doi.org/10.1007/978-3-319-12069-0_10

Download citation

DOI: https://doi.org/10.1007/978-3-319-12069-0_10
Published: 30 October 2014
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-12068-3
Online ISBN: 978-3-319-12069-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics