Necessary and sufficient optimality conditions for average reward of controlled Markov chains

Karel Sladký

Displaying similar documents to “Necessary and sufficient optimality conditions for average reward of controlled Markov chains”

On the set of optimal controls for Markov chains with rewards

Karel Sladký (1974)

Kybernetika

Similarity:

Discrete-time Markov control processes with discounted unbounded costs: Optimality criteria

Onésimo Hernández-Lerma, Myriam Muñoz de Ozak (1992)

Kybernetika

Similarity:

Bayesian parameter estimation and adaptive control of Markov processes with time-averaged cost

V. Borkar, S. Associate (1998)

Applicationes Mathematicae

Similarity:

This paper considers Bayesian parameter estimation and an associated adaptive control scheme for controlled Markov chains and diffusions with time-averaged cost. Asymptotic behaviour of the posterior law of the parameter given the observed trajectory is analyzed. This analysis suggests a "cost-biased" estimation scheme and associated self-tuning adaptive control. This is shown to be asymptotically optimal in the almost sure sense.

On the variance in controlled Markov chains

Petr Mandl (1971)

Kybernetika

Similarity: