Weak conditions for the existence of optimal stationary policies in average Markov decision chains with unbounded costs
Kybernetika (1989)
- Volume: 25, Issue: 3, page 145-156
- ISSN: 0023-5954
Access Full Article
topHow to cite
topCavazos-Cadena, Rolando. "Weak conditions for the existence of optimal stationary policies in average Markov decision chains with unbounded costs." Kybernetika 25.3 (1989): 145-156. <http://eudml.org/doc/28240>.
@article{Cavazos1989,
author = {Cavazos-Cadena, Rolando},
journal = {Kybernetika},
keywords = {Average cost Markov decision chains; discrete time parameter; single stationary policy; finite average cost; average cost optimal stationary policy},
language = {eng},
number = {3},
pages = {145-156},
publisher = {Institute of Information Theory and Automation AS CR},
title = {Weak conditions for the existence of optimal stationary policies in average Markov decision chains with unbounded costs},
url = {http://eudml.org/doc/28240},
volume = {25},
year = {1989},
}
TY - JOUR
AU - Cavazos-Cadena, Rolando
TI - Weak conditions for the existence of optimal stationary policies in average Markov decision chains with unbounded costs
JO - Kybernetika
PY - 1989
PB - Institute of Information Theory and Automation AS CR
VL - 25
IS - 3
SP - 145
EP - 156
LA - eng
KW - Average cost Markov decision chains; discrete time parameter; single stationary policy; finite average cost; average cost optimal stationary policy
UR - http://eudml.org/doc/28240
ER -
References
top- A. Hordijk, Dynamic Programming and Potential Theory, (Mathematical Centre Tract 51.) Mathematisch Centrum, Amsterdam 1974. (1974) MR0432227
- D. P. Heyman, M. J. Sobel, Stochastic Models in Operations Research, Vol. II, McGraw-Hill, New York 1984. (1984) Zbl0531.90062
- J. Dugundji, Topology, Allyn and Bacon, Boston 1966. (1966) Zbl0144.21501MR0193606
- L. C. Thomas, Connectedness conditions for denumerable state Markov decision processes, In: Recent Developments in Markov Decision Processes (Hartley, Thomas, White, eds.), Academic Press, New York 1981, pp. 181-204. (1981)
- L. I. Sennott, A new condition for the existence of optimal stationary policies in average cost Markov decision processes, Oper. Res. Lett. 5 (1986), 17-23. (1986) Zbl0593.90083MR0845763
- L. I. Sennott, A new condition for the existence of optimum stationary policies in average cost Markov decision processes -- unbounded cost case, Proceedings of the 25th IEEE Conf. on Dec. and Control, Athens, Greece 1986, pp. 1719-1721. (1986)
- L. I. Sennott, Average cost optimal stationary policies in infinite state Markov decision processes -- Existence and an algorithm, Submitted (1987). (1987)
- M. Loève, Probability Theory I, Springer-Verlag, New York--Berlin--Heidelberg 1977. (1977) MR0651017
- P. Nain, K. W. Ross, Optimal priority assignement with hard constraints, Submitted to IEEE Trans. Automat. Control (1986). (1986) MR0855542
- R. Cavazos-Cadena, Necessary conditions for the optimality equation in average-reward Markov decision processes, Appl. Math. Optim. 19 (1989), 1, 97-112. (1989) Zbl0663.90094MR0955092
- R. Cavazos-Cadena, Necessary and sufficient conditions for a bounded solution to the optimality equation in average reward Markov decision chains, System Control Lett. 10 (1988), 71-78. (1988) Zbl0645.90099MR0920807
- R. B. Ash, Real Analysis and Probability, Academic Press, New York 1972. (1972) MR0435320
- S. M. Ross, Applied Probability Models with Optimization Applications, Holden-Day, San Francisco 1970. (1970) Zbl0213.19101MR0264792
- S. M. Ross, Introduction to Stochastic Dynamic Programming, Academic Press, New York 1983. (1983) Zbl0567.90065MR0749232
Citations in EuDML Documents
topNotesEmbed ?
topTo embed these notes on your page include the following JavaScript code on your page where you want the notes to appear.