EUDML

Currently displaying 1 – 4 of 4

Order by Relevance | Title | Year of publication

Recursive self-tuning control of finite Markov chains

Vivek Borkar — 1997

Applicationes Mathematicae

A recursive self-tuning control scheme for finite Markov chains is proposed wherein the unknown parameter is estimated by a stochastic approximation scheme for maximizing the log-likelihood function and the control is obtained via a relative value iteration algorithm. The analysis uses the asymptotic o.d.e.s associated with these.

The value function in ergodic control of diffusion processes with partial observations II

Vivek Borkar — 2000

Applicationes Mathematicae

The problem of minimizing the ergodic or time-averaged cost for a controlled diffusion with partial observations can be recast as an equivalent control problem for the associated nonlinear filter. In analogy with the completely observed case, one may seek the value function for this problem as the vanishing discount limit of value functions for the associated discounted cost problems. This passage is justified here for the scalar case under a stability hypothesis, leading in particular to a "martingale"...

Parameter estimation in stochastic systems: some recent results and applications

Vivek S. Borkar — 1985

Banach Center Publications

Controlled diffusion processes.

Borkar, Vivek S. — 2005

Probability Surveys [electronic only]

Page 1

Download Results (CSV)

Advanced Search

Formula preview

Currently displaying 1 – 4 of 4

Recursive self-tuning control of finite Markov chains

The value function in ergodic control of diffusion processes with partial observations II

Parameter estimation in stochastic systems: some recent results and applications

Controlled diffusion processes.