Page 1

Displaying 1 – 4 of 4

Showing per page

On nearly selfoptimizing strategies for multiarmed bandit problems with controlled arms

Ewa Drabik (1996)

Applicationes Mathematicae

Two kinds of strategies for a multiarmed Markov bandit problem with controlled arms are considered: a strategy with forcing and a strategy with randomization. The choice of arm and control function in both cases is based on the current value of the average cost per unit time functional. Some simulation results are also presented.

On the discrete time-varying JLQG problem

Adam Czornik, Andrzej Świerniak (2002)

International Journal of Applied Mathematics and Computer Science

In the present paper optimal time-invariant state feedback controllers are designed for a class of discrete time-varying control systems with Markov jumping parameter and quadratic performance index. We assume that the coefficients have limits as time tends to infinity and the boundary system is absolutely observable and stabilizable. Moreover, following the same line of reasoning, an adaptive controller is proposed in the case when system parameters are unknown but their strongly consistent estimators...

Currently displaying 1 – 4 of 4

Page 1