Uniform value in dynamic programming

Jérôme Renault

Journal of the European Mathematical Society (2011)

  • Volume: 013, Issue: 2, page 309-330
  • ISSN: 1435-9855

Abstract

top
We consider dynamic programming problems with a large time horizon, and give sufficient conditions for the existence of the uniform value. As a consequence, we obtain an existence result when the state space is precompact, payoffs are uniformly continuous and the transition correspondence is non expansive. In the same spirit, we give an existence result for the limit value. We also apply our results to Markov decision processes and obtain a few generalizations of existing results.

How to cite

top

Renault, Jérôme. "Uniform value in dynamic programming." Journal of the European Mathematical Society 013.2 (2011): 309-330. <http://eudml.org/doc/277283>.

@article{Renault2011,
abstract = {We consider dynamic programming problems with a large time horizon, and give sufficient conditions for the existence of the uniform value. As a consequence, we obtain an existence result when the state space is precompact, payoffs are uniformly continuous and the transition correspondence is non expansive. In the same spirit, we give an existence result for the limit value. We also apply our results to Markov decision processes and obtain a few generalizations of existing results.},
author = {Renault, Jérôme},
journal = {Journal of the European Mathematical Society},
keywords = {uniform value; dynamic programming; Markov decision processes; limit value; Blackwell optimality; average payoffs; long-run values; precompact state space; non expansive correspondence},
language = {eng},
number = {2},
pages = {309-330},
publisher = {European Mathematical Society Publishing House},
title = {Uniform value in dynamic programming},
url = {http://eudml.org/doc/277283},
volume = {013},
year = {2011},
}

TY - JOUR
AU - Renault, Jérôme
TI - Uniform value in dynamic programming
JO - Journal of the European Mathematical Society
PY - 2011
PB - European Mathematical Society Publishing House
VL - 013
IS - 2
SP - 309
EP - 330
AB - We consider dynamic programming problems with a large time horizon, and give sufficient conditions for the existence of the uniform value. As a consequence, we obtain an existence result when the state space is precompact, payoffs are uniformly continuous and the transition correspondence is non expansive. In the same spirit, we give an existence result for the limit value. We also apply our results to Markov decision processes and obtain a few generalizations of existing results.
LA - eng
KW - uniform value; dynamic programming; Markov decision processes; limit value; Blackwell optimality; average payoffs; long-run values; precompact state space; non expansive correspondence
UR - http://eudml.org/doc/277283
ER -

NotesEmbed ?

top

You must be logged in to post comments.

To embed these notes on your page include the following JavaScript code on your page where you want the notes to appear.

Only the controls for the widget will be shown in your chosen language. Notes will be shown in their authored language.

Tells the widget how many notes to show per page. You can cycle through additional notes using the next and previous controls.

    
                

Note: Best practice suggests putting the JavaScript code just before the closing </body> tag.