Optimal Threshold Policies for Multivariate Stopping-Time POMDPs

Krishnamurthy, Vikram

doi:10.1007/978-3-642-02906-6_73

Optimal Threshold Policies for Multivariate Stopping-Time POMDPs

Vikram Krishnamurthy²¹

Conference paper

1306 Accesses
2 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5590))

Abstract

This paper deals with the solving multivariate partially observed Markov decision process (POMDPs). We give sufficient conditions on the cost function, dynamics of the Markov chain target and observation probabilities so that the optimal scheduling policy has a threshold structure with respect to the multivariate TP2 ordering. We present stochastic approximation algorithms to estimate the parameterized threshold policy.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Krishnamurthy, V., Djonin, D.: Structured threshold policies for dynamic sensor scheduling–a partially observed Markov decision process approach. IEEE Trans. Signal Proc. 55(10), 4938–4957 (2007)
Article Google Scholar
Moran, W., Suvorova, S., Howard, S.: Application of sensor scheduling concepts to radar. In: Hero, A., Castanon, D., Cochran, D., Kastella, K. (eds.) Foundations and Applications for Sensor Management, pp. 221–256. Springer, Heidelberg (2006)
Google Scholar
Evans, R., Krishnamurthy, V., Nair, G.: Networked sensor management and data rate control for tracking maneuvering targets. IEEE Trans. Signal Proc. 53(6), 1979–(1991)
Article Google Scholar
Lovejoy, W.: Some monotonicity results for partially observed Markov decision processes. Operations Research 35(5), 736–743 (1987)
Article MATH Google Scholar
Rieder, U.: Structural results for partially observed control models. Methods and Models of Operations Research 35, 473–490 (1991)
Article MATH Google Scholar
Krishnamurthy, V.: Algorithms for optimal scheduling and management of hidden Markov model sensors. IEEE Trans. Signal Proc. 50(6), 1382–1397 (2002)
Article Google Scholar
Krishnamurthy, V., Wahlberg, B.: POMDP multiarmed bandits – structural results. Mathematics of Operations Research (May 2009)
Google Scholar
Lovejoy, W.: On the convexity of policy regions in partially observed systems. Operations Research 35(4), 619–621 (1987)
Article MATH Google Scholar
Spall, J.: Introduction to Stochastic Search and Optimization. Wiley, Chichester (2003)
Book MATH Google Scholar
Gantmacher, F.: Matrix Theory, vol. 2. Chelsea Publishing Company, New York (1960)
Google Scholar
Karlin, S., Rinott, Y.: Classes of orderings of measures and related correlation inequalities. I. Multivariate totally positive distributions. Journal of Multivariate Analysis 10, 467–498 (1980)
Article MATH Google Scholar
Topkis, D.: Supermodularity and Complementarity. Princeton University Press, Princeton (1998)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Electrical and Computer Engineering, University of British Columbia, Vancouver, V6T 1Z4, Canada
Vikram Krishnamurthy

Authors

Vikram Krishnamurthy
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

ISIB-CNR, Corso Stati Uniti 4, 35127, Padova, Italy
Claudio Sossai
ISIB-CNR, Corso Stati Uniti, 4, 35127, Padova, Italy
Gaetano Chemello

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Krishnamurthy, V. (2009). Optimal Threshold Policies for Multivariate Stopping-Time POMDPs. In: Sossai, C., Chemello, G. (eds) Symbolic and Quantitative Approaches to Reasoning with Uncertainty. ECSQARU 2009. Lecture Notes in Computer Science(), vol 5590. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-02906-6_73

Download citation

DOI: https://doi.org/10.1007/978-3-642-02906-6_73
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-02905-9
Online ISBN: 978-3-642-02906-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics