A Design of Reward Function Based on Knowledge in Multi-agent Learning

Fan, Bo; Pu, Jiexin

doi:10.1007/978-3-540-88192-6_61

Bo Fan⁶ &
Jiexin Pu⁶

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5139))

Included in the following conference series:

International Conference on Advanced Data Mining and Applications

2504 Accesses

Abstract

The design of reward function is the key to build reinforcement learning system. With the analysis and research of the reinforcement learning and Markov games, an improved reward function is presented, which includes both the goal information based on task and learner’s action information based on its domain knowledge. According with this reinforcement function, reinforcement learning integrates the external environment reward and the internal behavior reward so that learner can perform better. The results of the experiment illuminates the reward function involving domain knowledge is better than the traditional reward function in application.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Littman, M.L.: Value-function reinforcement learning in Markov games. Journal of Cognitive Systems Research 2, 55–66 (2001)
Article Google Scholar
Boutilier, C.: Planning, Learning and Coordination in Multi-agent Decision Processes. In: Shoham, Y. (ed.) Proceedings of the Sixth Conference on Theoretical Aspects of Rationality and Knowledge, pp. 195–210. Morgan Kaufmann, San Francisco (1996)
Google Scholar
Bowling, M., Veloso, M.: Existence of Multiagent Equilibria with Limited Agents. J of Artificial Intelligence Research 22(2), 353–384 (2004)
MATH MathSciNet Google Scholar
Watkons, C.J.C.H., Dayan, P.: Q-leanign. Machine Learning 8(3), 279–292 (1992)
Google Scholar
Matalic, M.J.: Reward Functions for Accelerated Learning. In: Proc. Int. Conf. on Machine learning, pp. 181–189 (1994)
Google Scholar
Mataric, M.J.: Learning in behavior-based multi-robot systems: policies, models, and other agents. Journal of Cognitive Systems Research 2, 81–93 (2001)
Article Google Scholar
Inoue, K., Ota, J., Katayama, T., Arai, T.: Acceleration of Reinforcement Learning by A Mobile Robot Using Generalized Rules. In: Proc. IEEE Int. Conf. Intelligent Robots and Systems, pp. 885–890 (2000)
Google Scholar
Calos, H.C.: Embedding a Priori Knowledge in Reinforcement Learning. Journal of Intelligent and Robotics Systems 21, 51–71 (1998)
Article Google Scholar
Maclin, R., Shavlik, J.W.: Creating Advice-Taking Reinforcement Learners. Machine Learning 22, 251–281 (1996)
Google Scholar
Smart, W.D., Kaelbling, L.P.: Effective reinforcement learning for mobile robots. In: Proceedings of the IEEE International Conference on Robotics and Automation (2002), www.ai.mit.edu/people/lpk/papers/icra2002.pdf
http://www.fira.net

Download references

Author information

Authors and Affiliations

Electronic Information Engineering College, Henan University of Science & Technology, 471003, Luoyang, P.R. China
Bo Fan & Jiexin Pu

Authors

Bo Fan
View author publications
You can also search for this author in PubMed Google Scholar
Jiexin Pu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Computer Science, Sichuan University, 610065, Chengdu, China
Changjie Tang
Department of Computer Science, The University of Western Ontario, Canada
Charles X. Ling
School of ITEE, The University of Queensland, Australia
Xiaofang Zhou
Faculty of Science & Engineering, York University, 355 Lumbers Building, M3J 1P3, Toronto, Ontario, Canada
Nick J. Cercone
School of Information Technology and Electrical Engineering, The University of Queensland, Brisbane, 4072, Queensland, Australia
Xue Li

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Fan, B., Pu, J. (2008). A Design of Reward Function Based on Knowledge in Multi-agent Learning. In: Tang, C., Ling, C.X., Zhou, X., Cercone, N.J., Li, X. (eds) Advanced Data Mining and Applications. ADMA 2008. Lecture Notes in Computer Science(), vol 5139. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-88192-6_61

Download citation

DOI: https://doi.org/10.1007/978-3-540-88192-6_61
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-88191-9
Online ISBN: 978-3-540-88192-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics