Nonparametric adaptive control for discrete-time Markov processes with unbounded costs under average criterion
J. Minjárez-Sosa (1999)
Applicationes Mathematicae
Similarity:
We introduce average cost optimal adaptive policies in a class of discrete-time Markov control processes with Borel state and action spaces, allowing unbounded costs. The processes evolve according to the system equations , t=1,2,..., with i.i.d. -valued random vectors , which are observable but whose density ϱ is unknown.