Time-discretization for controlled Markov processes. I. General approximation results
Nico M. van Dijk, Arie Hordijk (1996)
Kybernetika
Similarity:
The search session has expired. Please query the service again.
Nico M. van Dijk, Arie Hordijk (1996)
Kybernetika
Similarity:
Evgueni Gordienko, Onésimo Hernández-Lerma (1995)
Applicationes Mathematicae
Similarity:
This paper considers discrete-time Markov control processes on Borel spaces, with possibly unbounded costs, and the long run average cost (AC) criterion. Under appropriate hypotheses on weighted norms for the cost function and the transition law, the existence of solutions to the average cost optimality inequality and the average cost optimality equation are shown, which in turn yield the existence of AC-optimal and AC-canonical policies respectively.
Onésimo Hernández-Lerma (1987)
Kybernetika
Similarity:
Zhu, Quanxin, Guo, Xianping (2006)
Journal of Applied Mathematics and Stochastic Analysis
Similarity:
Nico M. van Dijk, Arie Hordijk (1996)
Kybernetika
Similarity:
H. J. Girlich (1985)
Banach Center Publications
Similarity:
Saebi, Nasrollah (2004)
Bulletin of the Malaysian Mathematical Sciences Society. Second Series
Similarity:
Anna Jaśkiewicz (2009)
Applicationes Mathematicae
Similarity:
We establish the average cost optimality equation and show the existence of an (ε-)optimal stationary policy for semi-Markov control processes without compactness and continuity assumptions. The only condition we impose on the model is the V-geometric ergodicity of the embedded Markov chain governed by a stationary policy.
R. Israel Ortega-Gutiérrez, Raúl Montes-de-Oca, Enrique Lemus-Rodríguez (2016)
Kybernetika
Similarity:
Many examples in optimization, ranging from Linear Programming to Markov Decision Processes (MDPs), present more than one optimal solution. The study of this non-uniqueness is of great mathematical interest. In this paper the authors show that in a specific family of discounted MDPs, non-uniqueness is a “fragile” property through Ekeland's Principle for each problem with at least two optimal policies; a perturbed model is produced with a unique optimal policy. This result not only supersedes...