Skip to main content
Log in

Adaptive control for discrete-time Markov processes with unbounded costs: Average criterion

  • Published:
Mathematical Methods of Operations Research Aims and scope Submit manuscript

Abstract.

The paper deals with a class of discrete-time Markov control processes with Borel state and action spaces, and possibly unbounded one-stage costs. The processes are given by recurrent equations x t +1=F(x t ,a t t ), t=1,2,… with i.i.d. ℜk– valued random vectors ξ t whose density ρ is unknown. Assuming observability of ξ t , and taking advantage of the procedure of statistical estimation of ρ used in a previous work by authors, we construct an average cost optimal adaptive policy.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

Author information

Authors and Affiliations

Authors

Additional information

Received March/Revised version October 1997

Rights and permissions

Reprints and permissions

About this article

Cite this article

Gordienko, E., Minjárez-Sosa, J. Adaptive control for discrete-time Markov processes with unbounded costs: Average criterion. Mathematical Methods of OR 48, 37–55 (1998). https://doi.org/10.1007/PL00003993

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1007/PL00003993

Navigation