Abstract
The design of reward function is the key to build reinforcement learning system. With the analysis and research of the reinforcement learning and Markov games, an improved reward function is presented, which includes both the goal information based on task and learner’s action information based on its domain knowledge. According with this reinforcement function, reinforcement learning integrates the external environment reward and the internal behavior reward so that learner can perform better. The results of the experiment illuminates the reward function involving domain knowledge is better than the traditional reward function in application.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Littman, M.L.: Value-function reinforcement learning in Markov games. Journal of Cognitive Systems Research 2, 55–66 (2001)
Boutilier, C.: Planning, Learning and Coordination in Multi-agent Decision Processes. In: Shoham, Y. (ed.) Proceedings of the Sixth Conference on Theoretical Aspects of Rationality and Knowledge, pp. 195–210. Morgan Kaufmann, San Francisco (1996)
Bowling, M., Veloso, M.: Existence of Multiagent Equilibria with Limited Agents. J of Artificial Intelligence Research 22(2), 353–384 (2004)
Watkons, C.J.C.H., Dayan, P.: Q-leanign. Machine Learning 8(3), 279–292 (1992)
Matalic, M.J.: Reward Functions for Accelerated Learning. In: Proc. Int. Conf. on Machine learning, pp. 181–189 (1994)
Mataric, M.J.: Learning in behavior-based multi-robot systems: policies, models, and other agents. Journal of Cognitive Systems Research 2, 81–93 (2001)
Inoue, K., Ota, J., Katayama, T., Arai, T.: Acceleration of Reinforcement Learning by A Mobile Robot Using Generalized Rules. In: Proc. IEEE Int. Conf. Intelligent Robots and Systems, pp. 885–890 (2000)
Calos, H.C.: Embedding a Priori Knowledge in Reinforcement Learning. Journal of Intelligent and Robotics Systems 21, 51–71 (1998)
Maclin, R., Shavlik, J.W.: Creating Advice-Taking Reinforcement Learners. Machine Learning 22, 251–281 (1996)
Smart, W.D., Kaelbling, L.P.: Effective reinforcement learning for mobile robots. In: Proceedings of the IEEE International Conference on Robotics and Automation (2002), www.ai.mit.edu/people/lpk/papers/icra2002.pdf
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Fan, B., Pu, J. (2008). A Design of Reward Function Based on Knowledge in Multi-agent Learning. In: Tang, C., Ling, C.X., Zhou, X., Cercone, N.J., Li, X. (eds) Advanced Data Mining and Applications. ADMA 2008. Lecture Notes in Computer Science(), vol 5139. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-88192-6_61
Download citation
DOI: https://doi.org/10.1007/978-3-540-88192-6_61
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-88191-9
Online ISBN: 978-3-540-88192-6
eBook Packages: Computer ScienceComputer Science (R0)