Point-Based Bounded Policy Iteration for Decentralized POMDPs

Kim, Youngwook; Kim, Kee-Eung

doi:10.1007/978-3-642-15246-7_58

Youngwook Kim²¹ &
Kee-Eung Kim²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6230))

Included in the following conference series:

Pacific Rim International Conference on Artificial Intelligence

1610 Accesses
1 Citations

Abstract

We present a memory-bounded approximate algorithm for solving infinite-horizon decentralized partially observable Markov decision processes (DEC-POMDPs). In particular, we improve upon the bounded policy iteration (BPI) approach, which searches for a locally optimal stochastic finite state controller, by accompanying reachability analysis on controller nodes. As a result, the algorithm has different optimization criteria for the reachable and the unreachable nodes, and it is more effective in the search for an optimal policy. Through experiments on benchmark problems, we show that our algorithm is competitive to the recent nonlinear optimization approach, both in the solution time and the policy quality.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bernstein, D.S., Givan, R., Immerman, N., Zilberstein, S.: The complexity of decentralized control of Markov decision processes. Mathematics of Operations Research 27(4), 1192 (2002)
Article MathSciNet Google Scholar
Bernstein, D.S., Hansen, E.A., Zilberstein, S.: Bounded policy iteration for decentralized POMDPs. In: Proceedings of IJCAI, p. 1205 (2005)
Google Scholar
Poupart, P., Boutilier, C.: Bounded finite state controllers. In: Proceedings of NIPS, p. 1209 (2003)
Google Scholar
Amato, C., Bernstein, D.S., Zilberstein, S.: Optimizing memory-bounded controllers for decentralized POMDPs. In: Proceedings of UAI, p. 1241 (2007)
Google Scholar
Nair, R., Tambe, M., Yokoo, M., Pynadath, D., Marsella, S.: Taming decentralized POMDPs: Towards efficient policy computation for multiagent settings. In: Proceedings of IJCAI, p. 1206 (2003)
Google Scholar
Szer, D., Charpillet, F.: Point-based dynamic programming for DEC-POMDPs. In: Proceedings of AAAI, p. 1207 (2006)
Google Scholar
Seuken, S., Zilberstein, S.: Memory-bounded dynamic programming for DEC-POMDPs. In: Proceedings of IJCAI, p. 1208 (2007)
Google Scholar

Download references

Author information

Authors and Affiliations

Search Solutions, Seongnam-si, Korea
Youngwook Kim
Korea Advanced Institute of Science and Technology, Daejeon, Korea
Kee-Eung Kim

Authors

Youngwook Kim
View author publications
You can also search for this author in PubMed Google Scholar
Kee-Eung Kim
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Computer Science and Engineering, Seoul National University, 151-744, Seoul, Korea
Byoung-Tak Zhang
Department of Computing,, Macquarie University, NSW, Sydney, Australia
Mehmet A. Orgun

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kim, Y., Kim, KE. (2010). Point-Based Bounded Policy Iteration for Decentralized POMDPs. In: Zhang, BT., Orgun, M.A. (eds) PRICAI 2010: Trends in Artificial Intelligence. PRICAI 2010. Lecture Notes in Computer Science(), vol 6230. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15246-7_58

Download citation

DOI: https://doi.org/10.1007/978-3-642-15246-7_58
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15245-0
Online ISBN: 978-3-642-15246-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics