Approximation and estimation in Markov control processes under a discounted criterion

J. Adolfo Minjárez-Sosa

Displaying similar documents to “Approximation and estimation in Markov control processes under a discounted criterion”

Adaptive control for discrete-time Markov processes with unbounded costs: Discounted criterion

Evgueni I. Gordienko, J. Adolfo Minjárez-Sosa (1998)

Kybernetika

Similarity:

We study the adaptive control problem for discrete-time Markov control processes with Borel state and action spaces and possibly unbounded one-stage costs. The processes are given by recurrent equations $x_{t + 1} = F (x_{t}, a_{t}, ξ_{t}), t = 0, 1, ...$ with i.i.d. $ℜ^{k}$ -valued random vectors $ξ_{t}$ whose density $ρ$ is unknown. Assuming observability of $ξ_{t}$ we propose the procedure of statistical estimation of $ρ$ that allows us to prove discounted asymptotic optimality of two types of adaptive policies used early for the processes with bounded...

Approximation, estimation and control of stochastic systems under a randomized discounted cost criterion

Juan González-Hernández, Raquiel R. López-Martínez, J. Adolfo Minjárez-Sosa (2009)

Kybernetika

Similarity:

The paper deals with a class of discrete-time stochastic control processes under a discounted optimality criterion with random discount rate, and possibly unbounded costs. The state process $\{x_{t}\}$ and the discount process $\{α_{t}\}$ evolve according to the coupled difference equations $x_{t + 1} = F (x_{t}, α_{t}, a_{t}, ξ_{t}),$ $α_{t + 1} = G (α_{t}, η_{t})$ where the state and discount disturbance processes ${ξ_{t}}$ and ${η_{t}}$ are sequences of i.i.d. random variables with densities $ρ^{ξ}$ and $ρ^{η}$ respectively. The main objective is to introduce approximation algorithms...

Approximation and adaptive control of Markov processes: Average reward criterion

Onésimo Hernández-Lerma (1987)

Kybernetika

Similarity: