Abstract
The most important question that autonomous agents have to answer is how to remain viable in various and changing environments despite their bounded cognitive capacities. This question is thus the same as how their semiotic capacity to guess viable solutions emerges, that is abduction. The claim is that no learning can happen without a hedonic principle. That defines the hedonic level.
The hedonic level is presented as a cognitive paradigm: the hedonic agent can auto teach its hedonic and sensorimotor anticipations and also the meaningful and useful distinctions for these anticipations. That defines the possibility of the emergence of a job architecture, in a constructivist way.
A model of emergence of abductive capacities inside an architecture of jobs and inside jobs is proposed. This model takes into account both the limited cognitive capacities of the agent and its necessity to manage continuously its compromise between exploration and exploitation. The claim is that, inside its job architecture, the hedonic agent can use only forward policies because of its bounded cognitive capacities. The theory of bandit processes provides the optimality of such policies based on the index of Gittins and their pertinence for the compromise between exploration and exploitation. A new learning rule of reinforcement, the I-Learning rule, is proposed to evaluate this index.
Preview
Unable to display preview. Download preview PDF.
References
Aubin J.P.,1991. Viability Theory, Birkhäuser.
Baum Eric B., David Haussler, 1989, What Size Net Gives Valid Generalization? Neural Computation 1, 151–160 (1989).
Bourgine P., F. Varela 1992. Towards a practice of autonomous system. in Towards a practice of autonomous system, F.Varela & P.Bourgine (ed). MIT Press/Bradford Books.pp 3–10.
Bourgine P., 1993, Viability and pleasure satisfaction principle of autonomous systems, in Imagina-93 proc.
Brooks R., 1991. Intelligence without reason. IICAI-91, Sydney.
Brooks R., 1991. Intelligence without representation. Artificial Intelligence, 47, Jan., 139–159.
Gittins J.C., 1989, Multi-armed Bandit. Allocation Indices, John Wiley & Sons
Edelman, G, 1992, Bright Air, Brillant Fire: On the Matter of Mind, Basic Books.
Holland, J.H., 1975. Adaptation in natural and artificial systems. Ann Arbor: the university of Michigan Press.
Kohonen T., 1984. Self-Organization and Associative Memory. Springer Verlag.
Langton C., 1989. (ed) Artificial Life I, Addison Wesley.
Langten C., 1992,Life at the edge of chaos, in Artificial Life II, Addison-Wesley, p.41–92, 1992.
Meyer Jean-Arcady, Wilson Stewart W., 1991, From animals to animats, M.I.T./Bradford Book, Cambridge,MA.
Nicolis G., I.Prigogine, Exploring Complexity: An Introduction. R.Piper GmbH & Co. KG Verlag, 1989.
Peirce Charles S., Textes fondamentaux de sémiotique, Méridiens Klincksiek, Paris, 1987.
Petitot J., 1990, Physique du sens, editions du CNRS.
Rosh E., 1978, Principles of Categorization, in Cognition and Categorization, ed. E.Rosh and B.B.Lloyd, Lawrence Erlbaum, Hillsdalle, N.J., 27–48.
Rumelhart D.E. and J.Mc Clelland, 1986, Parallel Distributed Processing, MIT Press/ Bradford Books.
Simon H.A. (1976) From subtantive to procedural rationality. Method and Appraisal in Economics, Latsis S.J.(ed.), p. 129–148. Cambridge University Press, Cambridge.
Sutton, R.S., 1988, Learning to predict by the methods of temporal difference. Machine Learning., 3, 9–44.
Valiant L.G., 1984, A theory of the learnable, Communications of the ACM V27, n∘11 pp. 1184–1142.
Vapnik V.N. et Y. Chervonenkis, 1981. On the uniform convergence of relative frequencies of events to their probabilities. In Theory of probability and its applications, XXVI, pp 532–553.
Varela F., 1979. Principles of Biological Autonomy, North Holland, Amsterdam.
Varela F., 1986. Trends in Cognitive Science and Technology. in: J.L. Roos (ed.), Economics and Artificial Intelligence. Pergamon Press, Oxford, pp. 1–8.
Varela F., E. Thompson & E. Rosch, 1991, The Embodied Mind. MIT Press.
Varela F., P.Bourgine, 1992, Towards a practice of autonomous system, MIT Press/Bradford Books.
Walliser B., 1993, A spectrum of cognitive processes in game theory, in Second European Congress on System Science, Prague, oct 93.
Watkins C., 1989, Learning with Delayed Reward, PhD, Cambridge University Psychology Department.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1995 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Bourgine, P. (1995). The hedonic agent: A constructivist approach of abductive capacities. In: Castelfranchi, C., Müller, JP. (eds) From Reaction to Cognition. MAAMAW 1993. Lecture Notes in Computer Science, vol 957. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0027065
Download citation
DOI: https://doi.org/10.1007/BFb0027065
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-60155-5
Online ISBN: 978-3-540-49532-1
eBook Packages: Springer Book Archive