EUDML

Currently displaying 1 – 3 of 3

Order by Relevance | Title | Year of publication

Correction to "Recursive self-tuning control of finite Markov chains" (Applicationes Math. 24 (2) (1996), 169-188

V. Borkar — 1997

Applicationes Mathematicae

Bayesian parameter estimation and adaptive control of Markov processes with time-averaged cost

V. Borkar; S. Associate — 1998

Applicationes Mathematicae

This paper considers Bayesian parameter estimation and an associated adaptive control scheme for controlled Markov chains and diffusions with time-averaged cost. Asymptotic behaviour of the posterior law of the parameter given the observed trajectory is analyzed. This analysis suggests a "cost-biased" estimation scheme and associated self-tuning adaptive control. This is shown to be asymptotically optimal in the almost sure sense.

Correction to "The value function in ergodic control of diffusion processes with partial observations II" (Applicationes Math. 27 (2000), 455-464)

V. S. Borkar — 2001

Applicationes Mathematicae

Page 1

Download Results (CSV)

Advanced Search

Formula preview

Currently displaying 1 – 3 of 3

Correction to "Recursive self-tuning control of finite Markov chains" (Applicationes Math. 24 (2) (1996), 169-188

Bayesian parameter estimation and adaptive control of Markov processes with time-averaged cost

Correction to "The value function in ergodic control of diffusion processes with partial observations II" (Applicationes Math. 27 (2000), 455-464)