On adaptive control of Markov chains using nonparametric estimation

Ewa Drabik; Łukasz Stettner

On adaptive control of Markov chains using nonparametric estimation

Ewa Drabik; Łukasz Stettner

Applicationes Mathematicae (2000)

Volume: 27, Issue: 2, page 143-152
ISSN: 1233-7234

Access Full Article

top

Access to full text

Full (PDF)

Abstract

top

Two adaptive procedures for controlled Markov chains which are based on a nonparametric window estimation are shown.

How to cite

top

MLA
BibTeX
RIS

Drabik, Ewa, and Stettner, Łukasz. "On adaptive control of Markov chains using nonparametric estimation." Applicationes Mathematicae 27.2 (2000): 143-152. <http://eudml.org/doc/219263>.

@article{Drabik2000,
abstract = {Two adaptive procedures for controlled Markov chains which are based on a nonparametric window estimation are shown.},
author = {Drabik, Ewa, Stettner, Łukasz},
journal = {Applicationes Mathematicae},
keywords = {controlled Markov chain; estimation; adaptive control},
language = {eng},
number = {2},
pages = {143-152},
title = {On adaptive control of Markov chains using nonparametric estimation},
url = {http://eudml.org/doc/219263},
volume = {27},
year = {2000},
}

TY - JOUR
AU - Drabik, Ewa
AU - Stettner, Łukasz
TI - On adaptive control of Markov chains using nonparametric estimation
JO - Applicationes Mathematicae
PY - 2000
VL - 27
IS - 2
SP - 143
EP - 152
AB - Two adaptive procedures for controlled Markov chains which are based on a nonparametric window estimation are shown.
LA - eng
KW - controlled Markov chain; estimation; adaptive control
UR - http://eudml.org/doc/219263
ER -

References

top

[1] R. Agraval, The continuum-armed bandit problem, SIAM J. Control Optim. 33 (1995), 1926-1951. Zbl0848.93069
[2] V. S. Borkar, Recursive self-tuning of finite Markov chains, Appl. Math. (Warsaw) 24 (1996), 169-188. Zbl0951.93537
[3] E. Drabik, On nearly selfoptimizing strategies for multiarmed bandit problems with controlled arms, ibid. 23 (1996), 449-473. Zbl0848.93068
[4] T. Duncan, B. Pasik-Duncan and Ł. Stettner, Discretized maximum likelihood and almost optimal adaptive control of ergodic adaptive models, SIAM J. Control Optim. 36 (1998), 422-446. Zbl0914.93076
[5] T. Duncan, B. Pasik-Duncan and Ł. Stettner, Adaptive control of discrete Markov processes by the method of large deviations, in: Proc. 35th IEEE CDC, Kobe 1996, IEEE, 360-365. Zbl1006.93071
[6] O. Hernández-Lerma and R. Cavazos-Cadena, Density estimation and adaptive control of Markov processes; average and discounted criteria, Acta Appl. Math. 20 (1990), 285-307. Zbl0717.93066
[7] A. Nowak, A generalization of Ueno's inequality for n-step transition probabilities, Appl. Math. (Warsaw) 25 (1998), 295-299. Zbl0998.60068

NotesEmbed ?

top

You must be logged in to post comments.

To embed these notes on your page include the following JavaScript code on your page where you want the notes to appear.

Language to use for this widget.

Only the controls for the widget will be shown in your chosen language. Notes will be shown in their authored language.

Number of notes per page

Tells the widget how many notes to show per page. You can cycle through additional notes using the next and previous controls.

Note: Best practice suggests putting the JavaScript code just before the closing </body> tag.