On self-optimizing control of Markov processes
Petr Mandl (1985)
Banach Center Publications
Similarity:
Petr Mandl (1985)
Banach Center Publications
Similarity:
Evgueni I. Gordienko, Francisco Salem-Silva (2000)
Kybernetika
Similarity:
For a discrete-time Markov control process with the transition probability , we compare the total discounted costs and , when applying the optimal control policy and its approximation . The policy is optimal for an approximating process with the transition probability . A cost per stage for considered processes can be unbounded. Under certain ergodicity assumptions we establish the upper bound for the relative stability index . This bound does not depend...
Łukasz Stettner (1993)
Applicationes Mathematicae
Similarity:
Optimal control with long run average cost functional of a partially observed Markov process is considered. Under the assumption that the transition probabilities are equivalent, the existence of the solution to the Bellman equation is shown, with the use of which optimal strategies are constructed.
Evgueni I. Gordienko, J. Adolfo Minjárez-Sosa (1998)
Kybernetika
Similarity:
We study the adaptive control problem for discrete-time Markov control processes with Borel state and action spaces and possibly unbounded one-stage costs. The processes are given by recurrent equations with i.i.d. -valued random vectors whose density is unknown. Assuming observability of we propose the procedure of statistical estimation of that allows us to prove discounted asymptotic optimality of two types of adaptive policies used early for the processes with bounded...