A Fast Elimination Method for Pruning in POMDPs

Özgen, Selim; Demirekler, Mübeccel

doi:10.1007/978-3-319-46073-4_5

Selim Özgen¹⁶ &
Mübeccel Demirekler¹⁶

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9904))

Included in the following conference series:

Joint German/Austrian Conference on Artificial Intelligence (Künstliche Intelligenz)

1343 Accesses

Abstract

This paper aims to speed up the pruning procedure that is encountered in the exact value iteration in POMDPs. The value function in POMDPs can be represented by a finite set of vectors over the state space. In each step of the exact value iteration algorithm, the number of possible vectors increases linearly with the cardinality of the action set and exponentially with the cardinality of the observation set. This set of vectors should be pruned to a minimal subset retaining the same value function over the state space. Therefore, pruning procedure in general is the bottleneck of finding the optimal policy for POMDPs. This paper analyses two different linear programming methods, the classical Lark’s algorithm and the recently proposed Skyline algorithm for detecting these useless vectors. We claim that using the information about the support region of the vectors that have already been processed, both algorithms can be drastically improved. We present comparative experiments on both randomly generated problems and POMDP benchmarks.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Reasoning and predicting POMDP planning complexity via covering numbers

Article 19 January 2016

A Fast Approximation Method for Partially Observable Markov Decision Processes

Article 07 December 2018

Under-Approximating Expected Total Rewards in POMDPs

References

Cassandra, A.: Tony’s POMDP file repository page (1999). http://www.cs.brown.edu/research/ai/pomdp/examples/index.html
Cassandra, A., Littman, M.L., Zhang, N.L.: Incremental pruning: a simple, fast, exact method for partially observable Markov decision processes. In: Proceedings of the Thirteenth Conference on Uncertainty in Artificial Intelligence, pp. 54–61. Morgan Kaufmann Publishers Inc. (1997)
Google Scholar
Cassandra, A.R.: Exact and approximate algorithms for partially observable Markov decision processes. Brown University (1998)
Google Scholar
Feng, Z., Zilberstein, S.: Region-based incremental pruning for POMDPs. In: Proceedings of the 20th Conference on Uncertainty in Artificial Intelligence, pp. 146–153. AUAI Press (2004)
Google Scholar
Harris, P.M.: Pivot selection methods of the Devex LP code. Math. program. 5(1), 1–28 (1973)
Article MathSciNet MATH Google Scholar
Hauskrecht, M.: Value-function approximations for partially observable Markov decision processes. J. Artif. Intell. Res. 13, 33–94 (2000)
MathSciNet MATH Google Scholar
Hauskrecht, M., Fraser, H.: Planning treatment of ischemic heart disease with partially observable Markov decision processes. Artif. Intell. Med. 18(3), 221–244 (2000)
Article Google Scholar
Hero, A.O., Castanon, D., Cochran, D., Kastella, K.: Foundations and Applications of Sensor Management. Springer Science & Business Media, New York (2007)
Google Scholar
Hoey, J., Poupart, P., von Bertoldi, A., Craig, T., Boutilier, C., Mihailidis, A.: Automated handwashing assistance for persons with dementia using video and a partially observable Markov decision process. Comput. Vis. Image Underst. 114(5), 503–519 (2010)
Article Google Scholar
Littman, M.L.: The Witness algorithm: solving partially observable Markov decision processes. Brown University, Providence (1994)
Google Scholar
Mallick, M., Krishnamurthy, V., Vo, B.N.: Integrated Tracking, Classification, and Sensor Management: Theory and Applications. Wiley, Hoboken (2012)
Google Scholar
Monahan, G.E.: State of the art - a survey of partially observable Markov decision processes: theory, models, and algorithms. Manage. Sci. 28(1), 1–16 (1982)
Article MathSciNet MATH Google Scholar
Raphael, C., Shani, G.: The Skyline algorithm for POMDP value function pruning. Ann. Math. Artif. Intell. 65(1), 61–77 (2012)
Article MathSciNet MATH Google Scholar
Smallwood, R.D., Sondik, E.J.: The optimal control of partially observable Markov processes over a finite horizon. Oper. Res. 21(5), 1071–1088 (1973)
Article MATH Google Scholar
Temizer, S., Kochenderfer, M.J., Kaelbling, L.P., Lozano-Pérez, T., Kuchar, J.K.: Collision avoidance for unmanned aircraft using Markov decision processes. In: AIAA Guidance, Navigation, and Control Conference, Toronto, Canada (2010)
Google Scholar
Zhang, N.L., Liu, W.: Planning in stochastic domains: problem characteristics and approximation. Technical report HKUST-CS96-31, Department of Computer Science, Hong Kong University of Science and Technology (1996)
Google Scholar
Zhang, N.L., Zhang, W.: Speeding up the convergence of value iteration in partially observable Markov decision processes. J. Artif. Intell. Res. 14, 29–51 (2001)
MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

Department of Electrical and Electronics Engineering, Middle East Technical University, Ankara, Turkey
Selim Özgen & Mübeccel Demirekler

Authors

Selim Özgen
View author publications
You can also search for this author in PubMed Google Scholar
Mübeccel Demirekler
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Selim Özgen .

Editor information

Editors and Affiliations

Alpen-Adria Universität Klagenfurt, Klagenfurt, Austria
Gerhard Friedrich
University of Basel, Basel, Switzerland
Malte Helmert
Technische Universität Graz, Graz, Austria
Franz Wotawa

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Özgen, S., Demirekler, M. (2016). A Fast Elimination Method for Pruning in POMDPs. In: Friedrich, G., Helmert, M., Wotawa, F. (eds) KI 2016: Advances in Artificial Intelligence. KI 2016. Lecture Notes in Computer Science(), vol 9904. Springer, Cham. https://doi.org/10.1007/978-3-319-46073-4_5

Download citation

DOI: https://doi.org/10.1007/978-3-319-46073-4_5
Published: 08 September 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-46072-7
Online ISBN: 978-3-319-46073-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics