Currently displaying 1 – 4 of 4

Showing per page

Order by Relevance | Title | Year of publication

Recursive self-tuning control of finite Markov chains

Vivek Borkar — 1997

Applicationes Mathematicae

A recursive self-tuning control scheme for finite Markov chains is proposed wherein the unknown parameter is estimated by a stochastic approximation scheme for maximizing the log-likelihood function and the control is obtained via a relative value iteration algorithm. The analysis uses the asymptotic o.d.e.s associated with these.

The value function in ergodic control of diffusion processes with partial observations II

Vivek Borkar — 2000

Applicationes Mathematicae

The problem of minimizing the ergodic or time-averaged cost for a controlled diffusion with partial observations can be recast as an equivalent control problem for the associated nonlinear filter. In analogy with the completely observed case, one may seek the value function for this problem as the vanishing discount limit of value functions for the associated discounted cost problems. This passage is justified here for the scalar case under a stability hypothesis, leading in particular to a "martingale"...

Page 1

Download Results (CSV)