Existence of average optimal policies in Markov control processes with strictly unbounded costs

Onésimo Hernández-Lerma

Existence of average optimal policies in Markov control processes with strictly unbounded costs

Onésimo Hernández-Lerma

Kybernetika (1993)

Volume: 29, Issue: 1, page 1-17
ISSN: 0023-5954

Access Full Article

top

Access to full text

Full (PDF)

How to cite

top

MLA
BibTeX
RIS

Hernández-Lerma, Onésimo. "Existence of average optimal policies in Markov control processes with strictly unbounded costs." Kybernetika 29.1 (1993): 1-17. <http://eudml.org/doc/27703>.

@article{Hernández1993,
author = {Hernández-Lerma, Onésimo},
journal = {Kybernetika},
keywords = {dynamic programming equation; discrete-time Markov control processes},
language = {eng},
number = {1},
pages = {1-17},
publisher = {Institute of Information Theory and Automation AS CR},
title = {Existence of average optimal policies in Markov control processes with strictly unbounded costs},
url = {http://eudml.org/doc/27703},
volume = {29},
year = {1993},
}

TY - JOUR
AU - Hernández-Lerma, Onésimo
TI - Existence of average optimal policies in Markov control processes with strictly unbounded costs
JO - Kybernetika
PY - 1993
PB - Institute of Information Theory and Automation AS CR
VL - 29
IS - 1
SP - 1
EP - 17
LA - eng
KW - dynamic programming equation; discrete-time Markov control processes
UR - http://eudml.org/doc/27703
ER -

References

top

D. P. Bertsekas, Dynamic Programming: Deterministic and Stochastic Models, Prentice-Hall, Englewood Cliffs, N. J. 1987. (1987) Zbl0649.93001 MR0896902
D. P. Bertsekas, S. E. Shreve, Stochastic Optimal Control: The Discrete Time Case, Academic Press, New York 1978. (1978) Zbl0471.93002 MR0511544
P. Billingsley, Convergence of Probability Measures, Wiley, New York 1968. (1968) Zbl0172.21201 MR0233396
D. Blackwell, Memoryless strategies in finite-stage dynamic programming, Ann. Math. Statist. 35 (1964), 863-865. (1964) Zbl0127.36406 MR0162642
D. Blackwell, Discounted dynamic programming, Ann. Math. Statist. 36 (1965), 226-235. (1965) Zbl0133.42805 MR0173536
V. S. Borkar, Control of Markov chains with long-run average cost criterion: the dynamic programming equations, SIAM J. Control Optim. 27 (1989), 642-657. (1989) Zbl0668.60059 MR0993291
R. Cavazos-Cadena, Solution to the optimality equation in a class of average Markov decision chains with unbounded costs, Kybernetika 27 (1991), 23-37. (1991) MR1099512
J. Diebolt, D. Guegan, Probabilistic properties of the general nonlinear markovian process of order one and applications to time series modelling, Rapport Technique No. 125, Laboratoire de Statistique Theorique et Appliquee, CNR-URA 1321, Universite Paris VI, 1990. (1990)
J. L. Doob, Stochastic Processes, Wiley, New York 1953. (1953) Zbl0053.26802 MR0058896
M. Duflo, Methodes Recursives Aleatoires, Masson, Paris 1990. (1990) Zbl0703.62084 MR1082344
E. B. Dynkin, A. A. Yushkevich, Controlled Markov Processes, Springer - Verlag, Berlin 1979. (1979) MR0554083
R. Hartley, Dynamic programming and an undiscounted, infinite horizon, convex stochastic control problem, In: Recent Developments in Markov Decision Processes (R. Hartley, L. C. Thomas and D.J. White, eds.). Academic Press, London 1980, pp. 277-300. (1980)
O. Hernandez-Lerma, Lyapunov criteria for stability of differential equations with Markov parameters, Boletin Soc. Mat. Mexicana 24 (1979), 27-48. (1979) Zbl0486.60051 MR0579667
O. Hernandez-Lerma, Adaptive Markov Control Processes, Springer - Verlag, New York 1989. (1989) Zbl0698.90053 MR0995463
O. Hernandez-Lerma, Average optimality in dynamic programming on Borel spaces - unbounded costs and controls, Syst. Control Lett. 17 (1991), 237-242. (1991) Zbl0771.90098 MR1125975
O. Hernandez-Lerma, J. B. Lasserre, Average cost optimal policies for Markov control processes with Borel state space and unbounded costs, Syst. Control Lett. 15 (1990), 349-356. (1990) Zbl0723.93080 MR1078813
O. Hernandez-Lerma, J. B. Lasserre, Linear programming and average optimality of Markov control processes on Borel spaces - unbounded costs, Rapport LAAS, LAAS-CNRS, Toulouse 1992. To appear in SIAM J. Control Optim. (1992) MR1261150
O. Hernandez-Lerma R. Montes de Oca, R. Cavazos-Cadena, Recurrence conditions for Markov decision processes with Borel state space: a survey, Ann. Oper. Res. 28 (1991), 29-46. (1991) MR1105165
K. Hinderer, Foundations of Non-Stationary Dynamic Programming with Discrete Time Parameter, Springer-Verlag, Berlin 1970. (1970) Zbl0202.18401 MR0267890
M. Yu. Kitayev, Semi-Markov and jump Markov control models: average cost criterion, Theory Probab. Appl. 30 (1985), 272-288. (1985) MR0792619
M. Kurano, The existence of a minimum pair of state and policy for Markov decision processes under the hypothesis of Doeblin, SIAM J. Control Optim. 27 (1989), 296-307. (1989) Zbl0677.90085 MR0984830
H. J. Kushner, Introduction to Stochastic Control, Holt, Rinehart and Winston, New York 1971. (1971) Zbl0293.93018 MR0280248
A. Leizarowitz, Optimal controls for diffusions in $R^{n}$ , J. Math. Anal. Appl. 149 (1990), 180-209, (1990) MR1054802
S. P. Meyn, Ergodic theorems for discrete time stochastic systems using a stochastic Lyapunov function, SIAM J. Control Optim. 27 (1989), 1409-1439. (1989) Zbl0681.60067 MR1022436
A. Mokkadem, Sur un modele autoregressif nonlineaire. Ergodicite et ergodicite geometrique, J. Time Series Anal. 8 (1987), 195-205. (1987) MR0886138
D. Revuz, Markov Chains, Second edition. North-Holland, Amsterdam 1984. (1984) Zbl0539.60073 MR0758799
U. Rieder, Measurable selection theorems for optimization problems, Manuscripta Math. 24 (1978), 507-518. (1978) Zbl0385.28005 MR0493590
V. I. Rotar, T. A. Konyuhova, Two papers on asymptotic optimality in probability and almost surely, Preprint, Central Economic Mathematical Institute (CEMI), Moscow 1991. (1991)
R. H. Stockbridge, Time-average control of martingale problems: a linear programming formulation, Ann. Probab. 18 (1990), 206-217. (1990) Zbl0699.49019 MR1043944
J. Wijngaard, Existence of average optimal strategies in markovian decision problems with strictly unbounded costs, In: Dynamic Programming and Its Applications (M. L. Puterman, ed.), Academic Press, New York 1978, pp. 369-386. (1978) Zbl0458.90081 MR0537889
K. Yosida, Functional Analysis, Fifth edition. Springer-Verlag, Berlin 1978. (1978) Zbl0365.46001 MR0500055

Citations in EuDML Documents

top

NotesEmbed ?

top

You must be logged in to post comments.

To embed these notes on your page include the following JavaScript code on your page where you want the notes to appear.

Language to use for this widget.

Only the controls for the widget will be shown in your chosen language. Notes will be shown in their authored language.

Number of notes per page

Tells the widget how many notes to show per page. You can cycle through additional notes using the next and previous controls.

Note: Best practice suggests putting the JavaScript code just before the closing </body> tag.