Unique State and Automatical Action Abstracting Based on Logical MDPs with Negation

Zhiwei, Song; Xiaoping, Chen

doi:10.1007/11881223_118

Song Zhiwei²¹ &
Chen Xiaoping²¹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 4222))

Included in the following conference series:

International Conference on Natural Computation

913 Accesses

Abstract

In this paper we introduce negation into Logical Markov Decision Processes, which is a model of Relational Reinforcement Learning. In the new model nLMDP the abstract state space can be constructed in a simple way, so that a good property of complementarity holds. Prototype action is also introduced into the model. A distinct feature of the model is that applicable abstract actions can be obtained automatically with valid substitutions. Given a complementary abstract state space and a set of prototype actions, a model-free Θ-learing method is implemented for evaluating the state-action-substitution value funcion.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Kaelbling, L., Littman, M., Moore, A.: Reinforcement learning: A survey. Journal of Artificial Intelligence Research 4, 237–285 (1996)
Google Scholar
Sutton, R., Barto, A.: Reinforcement Learning: An Introduction. MIT Press, Cambridge (1998)
Google Scholar
Džeroski, S., De Raedt, L., Driessens, K.: Relational reinforcement learning. Machine Learning 43, 7–52 (2001)
Article MATH Google Scholar
Driessens, K., Ramon, J.: Relational instrance based regression for relational reinforcement learning. In: ICML 2003 (2003)
Google Scholar
Gärtner, T., Driessens, K., Ramon, J.: Graph kernels and gaussian processes for relational reinforcement learning. In: Horváth, T., Yamamoto, A. (eds.) ILP 2003. LNCS, vol. 2835, pp. 146–163. Springer, Heidelberg (2003)
Chapter Google Scholar
Cole, J., Lloyd, K., Ng, K.: Symbolic learning for adaptive agents. In: The Annual Partner Conference, Smart Internet Technology Cooperative Research Centre (2003)
Google Scholar
Yoon, S., Fern, A., Givan, R.: Inductive policy selection for first-order MDPs. In: UAI 2002 (2002)
Google Scholar
Fern, A., Yoon, S., Givan, R.: Approximate policy iteration with a policy language bias. In: NIPS 2003 (2003)
Google Scholar
Boutilier, C., Reiter, R., Price, B.: Symbolic dynamic programming for first-order MDPs. In: Seventeenth International Joint Conference on Artificial Intelligence (IJCAI 2001), pp. 690–700 (2001)
Google Scholar
Kersting, K., De Raedt, L.: Logical markov decision programs. In: IJCAI 2003 Workshop on Learning Statistical Models of Relational Data (2003)
Google Scholar
Van Otterlo, M.: Reinforcement learning for relational MDPs. In: Machine Learning Conference of Belgium and the Netherlands (BeNeLearn 2004) (2004)
Google Scholar
Morales, E.: Scaling up reinforcement learning with a relational representation. In: Proceedings of the Workshop on Adaptability in Multi-agent Systems at AORC 2003, Sydney (2003)
Google Scholar
Guestrin, C., Koller, D., Gearhart, C., Kanodia, N.: Generalizing plans to new environments in relational MDPs. In: IJCAI 2003 (2003)
Google Scholar
Kersting, K., De Raedt, L.: Logical markov decision programs and the convergence of logical TD(λ). In: Camacho, R., King, R., Srinivasan, A. (eds.) ILP 2004. LNCS, vol. 3194, pp. 180–197. Springer, Heidelberg (2004)
Chapter Google Scholar
Nienhuys-Cheng, S.-H., de Wolf, R.: Foundations of Inductive Logic Programming. Lecture Notes in Artifical Intelligence, vol. 1228. Springer, Heidelberg (1997)
Google Scholar
Clark, K.L.: Negation as failure. In: Logic and Data Bases, pp. 293–322 (1977)
Google Scholar
Van Otterlo, M., Kersting, K.: Challenges for relational reinforcement learning. In: ICML 2004 Workshop on Relational Reinforcement Learning (2004)
Google Scholar
Slaney, J., Thiébaux, S.: Blocks world revisited. Artificial Intelligence 125, 119–153 (2001)
Article MATH MathSciNet Google Scholar
Kersting, K., Van Otterlo, M., De Raedt, L.: Bellman goes relational. In: ICML 2004 (2004)
Google Scholar
Lecoeuche, R.: Learning optimal dialogue management rules by using reinforcement learning and inductive logic programming. In: Proc. of the North American Chapter of the Association for Computational Linguistics (NAACL) (2001)
Google Scholar

Download references

Author information

Authors and Affiliations

Computer Science Department, University of Science and Technology of China, 96 Jinzhai Road, Hefei, Anhui, 230026, China
Song Zhiwei & Chen Xiaoping

Authors

Song Zhiwei
View author publications
You can also search for this author in PubMed Google Scholar
Chen Xiaoping
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Life Science Research Center, School of Electronic Engineering, Xidian University, 710071, Xi’an, Shaanxi, China
Licheng Jiao
School of Electrical and Electronic Engineering, Nanyang Technological University, Block S1, Nanyang Avenue, 639798, Singapore
Lipo Wang
School of Electronic Engineering, Xidian Univ., P.O. Box, 710071, Xi’an, P.R. China
Xinbo Gao
College of Mathematics and Information Science, Hebei Normal University, 050016, Shijiazhuang, Hebei, P.R. China
Jing Liu
Multi-Agent Systems Lab,Department of Computer Science, University of Science and Technology of China, 230026, Hefei, China
Feng Wu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhiwei, S., Xiaoping, C. (2006). Unique State and Automatical Action Abstracting Based on Logical MDPs with Negation. In: Jiao, L., Wang, L., Gao, X., Liu, J., Wu, F. (eds) Advances in Natural Computation. ICNC 2006. Lecture Notes in Computer Science, vol 4222. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11881223_118

Download citation

DOI: https://doi.org/10.1007/11881223_118
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-45907-1
Online ISBN: 978-3-540-45909-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics