The value function in ergodic control of diffusion processes with partial observations II

Vivek Borkar

Displaying similar documents to “The value function in ergodic control of diffusion processes with partial observations II”

Controlled diffusion processes.

Borkar, Vivek S. (2005)

Probability Surveys [electronic only]

Similarity:

Ergodic control of Markov processes with mixed observation structure

Łukasz Stettner

Similarity:

CONTENTS1. Introduction........................................................................................................ 52. Preliminary results and assumptions.................................................................. 73. Approximation of the invariant measure.............................................................. 144. Construction of nearly optimal control functions................................................ 24 4.1. Approximation of admissible control functions.....................................

Degenerate variance control in the one-dimensional stationary case.

Ocone, Daniel, Weerasinghe, Ananda (2003)

Electronic Journal of Probability [electronic only]

Similarity:

Risk-sensitive control of stochastic hybrid systems on infinite time horizon.

Runolfsson, Thordur (2000)

Mathematical Problems in Engineering

Similarity:

Maximum process problems in optimal control theory.

Peskir, Goran (2005)

Journal of Applied Mathematics and Stochastic Analysis

Similarity:

On the uniqueness of optimal controls

Masatoshi Fujisaki (1979)

Séminaire de probabilités de Strasbourg

Similarity:

Aspects of control for the normal Markov processes.

Saebi, Nasrollah (2004)

Bulletin of the Malaysian Mathematical Sciences Society. Second Series

Similarity:

Bayesian parameter estimation and adaptive control of Markov processes with time-averaged cost

V. Borkar, S. Associate (1998)

Applicationes Mathematicae

Similarity:

This paper considers Bayesian parameter estimation and an associated adaptive control scheme for controlled Markov chains and diffusions with time-averaged cost. Asymptotic behaviour of the posterior law of the parameter given the observed trajectory is analyzed. This analysis suggests a "cost-biased" estimation scheme and associated self-tuning adaptive control. This is shown to be asymptotically optimal in the almost sure sense.

Deterministic optimal policies for Markov control processes with pathwise constraints

Armando F. Mendoza-Pérez, Onésimo Hernández-Lerma (2012)

Applicationes Mathematicae

Similarity:

This paper deals with discrete-time Markov control processes in Borel spaces with unbounded rewards. Under suitable hypotheses, we show that a randomized stationary policy is optimal for a certain expected constrained problem (ECP) if and only if it is optimal for the corresponding pathwise constrained problem (pathwise CP). Moreover, we show that a certain parametric family of unconstrained optimality equations yields convergence properties that lead to an approximation scheme which...

A singular control problem with an expected and a pathwise ergodic performance criterion.

Jack, Andrew, Zervos, Mihail (2006)

Journal of Applied Mathematics and Stochastic Analysis

Similarity:

Necessary and sufficient optimality conditions for average reward of controlled Markov chains

Karel Sladký (1973)

Kybernetika

Similarity: