An Adaptive Approach for the Exploration-Exploitation Dilemma for Learning Agents

Rejeb, Lilia; Guessoum, Zahia; M’Hallah, Rym

doi:10.1007/11559221_32

Lilia Rejeb²¹,
Zahia Guessoum^21,22 &
Rym M’Hallah²³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3690))

Included in the following conference series:

International Central and Eastern European Conference on Multi-Agent Systems

1263 Accesses

Abstract

Learning agents have to deal with the exploration-exploitation dilemma. The choice between exploration and exploitation is very difficult in dynamic systems; in particular in large scale ones such as economic systems. Recent research shows that there is neither an optimal nor a unique solution for this problem. In this paper, we propose an adaptive approach based on meta-rules to adapt the choice between exploration and exploitation. This new adaptive approach relies on the variations of the performance of the agents. To validate the approach, we apply it to economic systems and compare it to two adaptive methods: one local and one global. Herein, we adapt these two methods, which were originally proposed by Wilson, to economic systems. Moreover, we compare different exploration strategies and focus on their influence on the performance of the agents.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Optimality and Equilibrium of Exploration Ratio for Multiagent Learning in Nonstationary Environments

Learning in Networked Interactions: A Replicator Dynamics Approach

Accelerating the Computation of Solutions in Resource Allocation Problems Using an Evolutionary Approach and Multiagent Reinforcement Learning

References

Azoulay-Schwartz, R., Kraus, S., Wilkenfeld, J.: Exploration vs. exploitation: choosing a supplier in an environment of incomplete information. Elsevier Science, Amsterdam (2003)
Google Scholar
Baum, J.A.C., Rao, H.: Handbook of Organizational Change and Development: Evolutionary Dynamics of Organizational Populations and Communities. Oxford University Press, Oxford (1999)
Google Scholar
Butz, M.V., Wilson, S.W.: An algorithmic description of XCS. Journal of Soft Computing 6, 144–153 (2002)
MATH Google Scholar
Carmel, D., Markovitch, S.: Exploration Strategies for Model-Based Learning in Multi-agent Systems. In: Jennings, N., Sycara, K., Georgeff, M. (eds.) Autonomous Agents and Multi-agent systems, vol. 2(2), pp. 141–172 (1999)
Google Scholar
Gittings, J.C.: Multi-armed bandit allocation indices. John Wiley and Sons, NY (1989)
Google Scholar
Kaelbling, L.P., Moore, A.W.: Reinforcement learning: A survey. Journal of Artificial Intelligence Research 4, 237–285 (1996)
Google Scholar
Meuleau, N., Bourgine, P.: Exploration of multi-state environments: Local measure and back-propagation of uncertainty. Machine Learning 35(2), 117–154 (1999)
Article MATH Google Scholar
Miramontes Hercog, L., Fogarty, T.C.: Social Simulation Using a Multi-agent Model Based on Classifier Systems: The emergence of Vacillating Behavior in the ”El Farol” Bar Problem. In: Lanzi, P.L., Stolzmann, W., Wilson, S.W. (eds.) IWLCS 2001. LNCS (LNAI), vol. 2321, pp. 88–111. Springer, Heidelberg (2002)
Chapter Google Scholar
Rejeb, L., Guessoum, Z.: Adaptive Firms. In: Proc. AISTA 2004 International Conference on Advances in Intelligent Systems - Theory and Applications. IEEE Computer Society, Luxembourg (2004)
Google Scholar
Penrose, E.T.: The theory of the growth of the firm. Basil Blackwell, Malden (1959)
Google Scholar
Peres-Uribe, A., Hirsbrunner, B.: The risk of Exploration in multi-agent learning systems: a case study. In: Proc. Agents 2000 Joint workshop on learning agents, Barcelona, June 3–7, pp. 33–37 (2000)
Google Scholar
Sutton, R.S., Barto, A.G.: Reinforcement learning, an introduction. The MIT Press, Cambridge (1998)
Google Scholar
Thrun, S.B.: The role of exploration in learning control. In: Sofge, D.A. (ed.) Handbook of Intelligent Control: Neural, Fuzzy and Adaptive Approaches, Van Nostrand Reinhold, Florence (1992)
Google Scholar
Watkins, C., Dayan, P.: Q-Learning. Machine Learning 8, 279–292 (1999)
Google Scholar
Wiering, M.: Explorations in Efficient Reinforcement Learning. Ph.D. thesis (February 1999)
Google Scholar
Wilson, S.W.: Classifiers Fitness Based on Accuracy. Evolutionary computation 3(2), 149–175 (1995)
Article Google Scholar
Wilson, S.W.: Explore/Exploit Strategies in Autonomy. In: Maes, P., Mataric, M., Pollac, J., Meyer, J.-A., Wilson, S. (eds.) From Animals to Animats 4, Proc. of the 4th International Conference of Adaptive Behavior, Cambridge (1996)
Google Scholar

Download references

Author information

Authors and Affiliations

MODECO Team, CReSTIC, Rue des Crayères, Reims Cedex2, France
Lilia Rejeb & Zahia Guessoum
LIP6, OASIS Team, Université de Paris-VI, 4 place Jussieu, 75252 cedex 5, France
Zahia Guessoum
Dep. of Statistics and Operations Research, Kuwait University, P.O. Box 5969, Safat, 13060
Rym M’Hallah

Authors

Lilia Rejeb
View author publications
You can also search for this author in PubMed Google Scholar
Zahia Guessoum
View author publications
You can also search for this author in PubMed Google Scholar
Rym M’Hallah
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Cybernetics, Czech Technical University in Prague, Czech Republic
Michael Pěchouček
Austrian Research Institute for Artificial Intelligence, Freyung 6/6, A-1010, Vienna, Austria
Paolo Petta
Computer and Automation Research Institute, Hungarian Academy of Sciences, Kende u. 13-17., 1111, Budapest, Hungary
László Zsolt Varga

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Rejeb, L., Guessoum, Z., M’Hallah, R. (2005). An Adaptive Approach for the Exploration-Exploitation Dilemma for Learning Agents. In: Pěchouček, M., Petta, P., Varga, L.Z. (eds) Multi-Agent Systems and Applications IV. CEEMAS 2005. Lecture Notes in Computer Science(), vol 3690. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11559221_32

Download citation

DOI: https://doi.org/10.1007/11559221_32
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29046-9
Online ISBN: 978-3-540-31731-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics