The Skyline algorithm for POMDP value function pruning

Raphael, Christopher; Shani, Guy

doi:10.1007/s10472-012-9302-1

The Skyline algorithm for POMDP value function pruning

Published: 21 June 2012

Volume 65, pages 61–77, (2012)
Cite this article

Annals of Mathematics and Artificial Intelligence Aims and scope Submit manuscript

Christopher Raphael¹ &
Guy Shani²

148 Accesses
9 Citations
Explore all metrics

Abstract

We address the pruning or filtering problem, encountered in exact value iteration in POMDPs and elsewhere, in which a collection of linear functions is reduced to the minimal subset retaining the same maximal surface. We introduce the Skyline algorithm, which traces the graph corresponding to the maximal surface. The algorithm has both a complete and an iterative version, which we present, along with the classical Lark’s algorithm, in terms of the basic dictionary-based simplex iteration from linear programming. We discuss computational complexity results, and present comparative experiments on both randomly-generated and well-known POMDP benchmarks.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

An exhaustive review of the metaheuristic algorithms for search and optimization: taxonomy, applications, and open challenges

Article 09 April 2023

Optimal classification trees

Article 03 April 2017

Metaheuristics: a comprehensive overview and classification along with bibliometric analysis

Article 16 March 2021

References

Bellman, R.E.: Dynamic Programming. Princeton University Press, Princeton (1962)
MATH Google Scholar
Cassandra, A.R., Littman, M.L., Zhang, N.: Incremental pruning: a simple, fast, exact method for partially observable Markov decision processes. In: Proceedings of the Conference in Uncertainty in Artificial Intelligence (UAI’97), pp. 54–61 (1997)
Chvatal, V.: Linear Programming (Series of Books in the Mathematical Sciences). Freeman, San Francisco (1983)
Google Scholar
Doshi, F., Roy, N.: The permutable POMDP: fast solutions to POMDPs for preference elicitation. In: Proceddings of the International Conference on autonomous Agents and Multiagent Systems (AAMAS), pp. 493–500 (2008)
Feng, Z., Zilberstein, S.: Region-based incremental pruning for POMDPs. In: Proceedings of the 20th Conference on Uncertainty in Artificial Intelligence, UAI ’04, pp. 146–153 (2004)
Hansen, E.A.: Solving POMDPs by searching in policy space. In: Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence, pp. 211–219 (1998)
Hauskrecht, M., Fraser, H.S.F.: Planning treatment of ischemic heart disease with partially observable markov decision processes. Artif. Intell. Med. 18(3), 221–244 (2000)
Article Google Scholar
Hsiao, K., Kaelbling, L.P., Lozano-Pérez, T.: Grasping POMDPs. In: Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), pp. 4685–4692 (2007)
Huynh, V.A., Roy, N.: icLQG: combining local and global optimization for control in information space. In: Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), pp. 2851–2858 (2009)
Kalai, G.: Linear programming, the simplex algorithm and simple polytopes. Math. Program. (Ser. B) 79, 217–234 (1997)
MathSciNet MATH Google Scholar
Raphael, C., Nichols, E.: Linear dynamic programming and the training of sequence estimators. In: Chinneck, J.W., Kristjansson, B., Saltzman, M. (eds.) Operations Research and Cyber-Infrastructure, vol. 47, pp. 219–231. Springer, New York (2009)
Chapter Google Scholar
Shani, G., Meek, C.: Improving existing fault recovery policies. In: Proceddings of the Neural Information Processing Systems (NIPS), pp. 1642–1650 (2009)
Shani, G., Heckerman, D., Brafman, R.I.: An MDP-based recommender system. J. Mach. Learn. Res. 6, 1265–1295 (2005)
MathSciNet MATH Google Scholar
Shani, G., Brafman, R.I., Shimony, S.E.: Forward search value iteration for POMDPs. In: Proceedings of the Interantional Joint Conference on Artificial Intelligence (IJCAI), pp. 2619–2624 (2007)
Shani, G., Brafman, R.I., Shimony, S.E.: Prioritizing point-based POMDP solvers. IEEE Trans. Syst. Man Cybern., Part B 38(6), 1592–1605 (2008)
Article Google Scholar
Smallwood, R., Sondik, E.: The optimal control of partially observable processes over a finite horizon. OR 21, 1071–1088 (1973)
Article MATH Google Scholar
Smith, T., Simmons, R.: Point-based POMDP algorithms: improved analysis and implementation. In: Proceedings of the 21st Conference in Uncertainty in Artificial Intelligence (UAI) (2005)
Sondik, E.J.: The optimal control of partially observable Markov processes. Ph.D. thesis, Stanford University (1971)
White, C.C.: Partially observed Markov decision processes: a survey. Ann. Oper. Res. 32, 215–230 (1991)
Article MathSciNet MATH Google Scholar
Williams, J.D., Young, S.: Partially observable Markov decision processes for spoken dialog systems. Comput. Speech Lang. 21(2), 393–422 (2007)
Article Google Scholar
Zhang, N., Liu, W.: Planning in stochastic domains: problem characteristics and approximation. Tech. Rep. HKUST-CS96-31, Dept. of Comp. Sci., Hong Kong Univ. of Sci. and Tech. (1997)

Download references

Author information

Authors and Affiliations

School of Informatics, Indiana University, Bloomington, IN, USA
Christopher Raphael
Information Systems Engineering, Ben Gurion University, Beer Sheva, Israel
Guy Shani

Authors

Christopher Raphael
View author publications
You can also search for this author in PubMed Google Scholar
Guy Shani
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Guy Shani.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Raphael, C., Shani, G. The Skyline algorithm for POMDP value function pruning. Ann Math Artif Intell 65, 61–77 (2012). https://doi.org/10.1007/s10472-012-9302-1

Download citation

Published: 21 June 2012
Issue Date: May 2012
DOI: https://doi.org/10.1007/s10472-012-9302-1

Keywords

Mathematics Subject Classification (2010)

90C40

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

The Skyline algorithm for POMDP value function pruning

Abstract

Access this article

Similar content being viewed by others

An exhaustive review of the metaheuristic algorithms for search and optimization: taxonomy, applications, and open challenges

Optimal classification trees

Metaheuristics: a comprehensive overview and classification along with bibliometric analysis

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Mathematics Subject Classification (2010)

Navigation

The Skyline algorithm for POMDP value function pruning

Abstract

Access this article

Similar content being viewed by others

An exhaustive review of the metaheuristic algorithms for search and optimization: taxonomy, applications, and open challenges

Optimal classification trees

Metaheuristics: a comprehensive overview and classification along with bibliometric analysis

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification (2010)

Search

Navigation