Network Distributed POMDP with Communication

Iwanari, Yuki; Yabu, Yuichi; Tasaki, Makoto; Yokoo, Makoto

doi:10.1007/978-3-642-00609-8_4

Yuki Iwanari²⁴,
Yuichi Yabu²⁴,
Makoto Tasaki²⁴ &
…
Makoto Yokoo²⁴

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5447))

Included in the following conference series:

Annual Conference of the Japanese Society for Artificial Intelligence

699 Accesses

Abstract

While Distributed POMDPs have become popular for modeling multiagent systems in uncertain domains, it is the Network Distributed POMDPs (ND-POMDPs) model that has begun to scale-up the number of agents. The ND-POMDPs can utilize the locality in agents’ interactions. However, prior work in ND-POMDPs has failed to address communication. Without communication, the size of a local policy at each agent within the ND-POMDPs grows exponentially in the time horizon. To overcome this problem, we extend existing algorithms so that agents periodically communicate their observation and action histories with each other. After communication, agents can start from new synchronized belief state. Thus, we can avoid the exponential growth in the size of local policies at agents. Furthermore, we introduce an idea that is similar the Point-based Value Iteration algorithm to approximate the value function with a fixed number of representative points. Our experimental results show that we can obtain much longer policies than existing algorithms as long as the interval between communications is small.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bernstein, D.S., Zilberstein, S., Immerman, N.: The complexity of decentralized control of markov decision processes. In: Proceedings of the 16th Conference on Uncertainty in Artificial Intelligence (UAI 2000), pp. 32–37 (2000)
Google Scholar
Szer, D., Francois Charpillet, S.Z.: MAA*: A heuristic search algorithm for solving decentralized POMDPs. In: Proceedings of the 21st Conference on Uncertainty in Artificial Intelligence (UAI 2005), pp. 576–590 (2005)
Google Scholar
Nair, R., Roth, M., Yokoo, M., Tambe, M.: Communication for improving policy computation in distributed pomdps. In: Proceedings of the Third International joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2004), pp. 1096–1103 (2004)
Google Scholar
Nair, R., Varakantham, P., Tambe, M., Yokoo, M.: Networked distributed POMDPs: A synthesis of distributed constraint optimization and POMDPs. In: Proceedings of the Twentieth National Conference on Artificial Intelligence (AAAI 2005), pp. 133–139 (2005)
Google Scholar
Varakantham, P., Marecki, J., Yabu, Y., Tambe, M., Yokoo, M.: Letting loose a SPIDER on a network of POMDPs: Generating quality guaranteed policies. In: Proceedings of the 6th International Joint Conference on Autonomous Agents and Multi-agent Systems (AAMAS 2007), pp. 822–829 (May 2007)
Google Scholar
Goldman, C.V., Zilberstein, S.: Optimizing information exchange in cooperative multi-agent systems. In: Proceedings of the Second International Joint Conference on Autonomous Agents and Multi-agent Systems (AAMAS 2003), pp. 137–144 (2003)
Google Scholar
Roth, M., Simmons, R., Veloso, M.: Exploiting factored representations for decentralized execution in multiagent teams. In: Proceedings of the 6th International joint conference on Autonomous agents and Multi-agent Systems (AAMAS 2007), pp. 457–463 (2007)
Google Scholar
Shen, J., Becker, R., Lesser, V.: Agent interaction in distributed pomdps and its implications on complexity. In: Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems (AAMAS 2006), pp. 529–536 (2006)
Google Scholar
Pineau, J., Gordon, G., Thrun, S.: Anytime point-based approximations for large POMDPs. Journal of Artificial Intelligence Research 227, 335–380 (2006)
MATH Google Scholar
Yokoo, M., Hirayama, K.: Distributed breakout algorithm for solving distributed constraint satisfaction problems. In: Proceeding of the Second International Conference on Multiagent Systems (ICMAS 1996), pp. 401–408 (1996)
Google Scholar

Download references

Author information

Authors and Affiliations

Kyushu University, Fukuoka, 819-0395, Japan
Yuki Iwanari, Yuichi Yabu, Makoto Tasaki & Makoto Yokoo

Authors

Yuki Iwanari
View author publications
You can also search for this author in PubMed Google Scholar
Yuichi Yabu
View author publications
You can also search for this author in PubMed Google Scholar
Makoto Tasaki
View author publications
You can also search for this author in PubMed Google Scholar
Makoto Yokoo
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Graduate School of Informatics, Kyoto University, Yoshida-Honmachi, Sakyo-ku, 606-8501, Kyoto, Japan
Hiromitsu Hattori
Toshiba Corp., Research & Development Center, 1, Komukai-Toshiba-cho, Saiwai-ku, 212-8582, Kawasaki, Japan
Takahiro Kawamura
Tokyo Research Laboratory, IBM Research, 1623-14 Shimo-tsuruma, Yamato, 242-8502, Kanagawa, Japan
Tsuyoshi Idé
Faculty of Information Science and Electrical Engineering, Kyushu University, 744 Motooka, Nishi-ku, 819-0395, Fukuoka, Japan
Makoto Yokoo
National Institute of Information and Communications Technology, 3-5 Hikaridai, Seika-cho, Soraku-gun, 619-0289, Kyoto, Japan
Yohei Murakami

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Iwanari, Y., Yabu, Y., Tasaki, M., Yokoo, M. (2009). Network Distributed POMDP with Communication. In: Hattori, H., Kawamura, T., Idé, T., Yokoo, M., Murakami, Y. (eds) New Frontiers in Artificial Intelligence. JSAI 2008. Lecture Notes in Computer Science(), vol 5447. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-00609-8_4

Download citation

DOI: https://doi.org/10.1007/978-3-642-00609-8_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-00608-1
Online ISBN: 978-3-642-00609-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics