Estimation and adaptive control of span-contracting Markov decision processes

Gerhard Hübner

Estimation and adaptive control of span-contracting Markov decision processes

Gerhard Hübner

Kybernetika (1991)

Volume: 27, Issue: 1, page 66-71
ISSN: 0023-5954

Access Full Article

top

Access to full text

Full (PDF)

How to cite

top

MLA
BibTeX
RIS

Hübner, Gerhard. "Estimation and adaptive control of span-contracting Markov decision processes." Kybernetika 27.1 (1991): 66-71. <http://eudml.org/doc/28807>.

@article{Hübner1991,
author = {Hübner, Gerhard},
journal = {Kybernetika},
keywords = {adaptive control; discrete time Markov decision process; finite state space; successive approximation},
language = {eng},
number = {1},
pages = {66-71},
publisher = {Institute of Information Theory and Automation AS CR},
title = {Estimation and adaptive control of span-contracting Markov decision processes},
url = {http://eudml.org/doc/28807},
volume = {27},
year = {1991},
}

TY - JOUR
AU - Hübner, Gerhard
TI - Estimation and adaptive control of span-contracting Markov decision processes
JO - Kybernetika
PY - 1991
PB - Institute of Information Theory and Automation AS CR
VL - 27
IS - 1
SP - 66
EP - 71
LA - eng
KW - adaptive control; discrete time Markov decision process; finite state space; successive approximation
UR - http://eudml.org/doc/28807
ER -

References

top

R. S. Acosta-Abreu, O. Hernandez-Lerma, Iterative adaptive control of denumerable state average-cost Markov systems, Control Cybernet. 14 (1985), 313 - 322. (1985) MR0842780
V. V. Baranov, Recursive algorithms of adaptive control in stochastic systems, Cybernetics 17 (1981), 815-824. (1981) MR0689427
A. Federgruen, Markovian Control Problems, Math. Centre Tracts 97, Amsterdam 1983. (1983) Zbl0541.90068 MR0745450
A. Federgruen, P. J. Schweitzer, Nonstationary Markov decision problems with converging parameters, J. optim. Theory Appl. 34 (1981), 207-241. (1981) Zbl0426.90091 MR0625228
A. Federgruen P. J. Schweitzer, H. C Tijms, Contraction mappings underlying undiscounted Markov decision problems, J. Math. Anal. Appl. 65 (1978), 711 - 730. (1978) MR0510481
A. Federgruen, H. C Tijms, The optimality equation in average cost denumerable state semi-Markov decision problems, recurrency conditions and algorithms, J. Appl. Probab. 15 (1978), 356-373. (1978) Zbl0386.90060 MR0475896
O. Hernandez-Lerma, Adaptive Control Processes, Springer-Verlag, Berlin-Heidelberg- New York 1989. (1989) MR0995463
K. Hinderer, On approximate solutions of finite-stage dynamic programs, In: Dynamic Programming and its applications (M. L. Puterman, ed.), Academic Press, New York 1978, pp. 289-317. (1978) Zbl0461.90075 MR0537885
G. Hiibner, Contraction properties of Markov decision models with applications to the elimination of non-optimal actions, In: Dynamische optimierung, Bonner Math. Schriften 98 (1977), 57-65. (1977) MR0524411
G. Hiibner, A unified approach to adaptive control of average reward Markov decision processes, OR Spektrum 10 (1988), 161-166. (1988) MR0961229
M. Kurano, Discrete-time Markovian decision processes with an unknown parameter - average return criterion, J. oper. Res. Soc. Japan 15 (1972), 67-76. (1972) Zbl0238.90006 MR0343942
M. Kurano, Adaptive policies in Markov decision processes with uncertain matrices, J. Inf. Optim. 4 (1983), 21-40. (1983) MR0697991
M. Kurano, Learning algorithms for Markov decision processes, J. Appl. Probab. 24 (1987), 270-276. (1987) Zbl0631.90085 MR0876190
P. Mandl, Estimation and control of Markov chains, Adv. in Appl. Probab. 6 (1974), 40-60. (1974) MR0339876
P. Mandl, On the adaptive control of countable Markov chains, In: Probability Theory, Banach Centre Publications, Warsaw 1979, pp. 159-173. (1979) Zbl0439.60069 MR0561478
W. Whitt, Approximations of dynamic programs, Math. Oper. Res. 3 (1978), 231 - 243. (1978) Zbl0393.90094 MR0506661

NotesEmbed ?

top

You must be logged in to post comments.

To embed these notes on your page include the following JavaScript code on your page where you want the notes to appear.

Language to use for this widget.

Only the controls for the widget will be shown in your chosen language. Notes will be shown in their authored language.

Number of notes per page

Tells the widget how many notes to show per page. You can cycle through additional notes using the next and previous controls.

Note: Best practice suggests putting the JavaScript code just before the closing </body> tag.