Weak conditions for the existence of optimal stationary policies in average Markov decision chains with unbounded costs
Rolando Cavazos-Cadena (1989)
Kybernetika
Similarity:
Rolando Cavazos-Cadena (1989)
Kybernetika
Similarity:
Zhu, Quanxin, Guo, Xianping (2006)
Journal of Applied Mathematics and Stochastic Analysis
Similarity:
Oscar Vega-Amaya, Fernando Luque-Vásquez (2000)
Applicationes Mathematicae
Similarity:
We deal with semi-Markov control processes (SMCPs) on Borel spaces with unbounded cost and mean holding time. Under suitable growth conditions on the cost function and the mean holding time, together with stability properties of the embedded Markov chains, we show the equivalence of several average cost criteria as well as the existence of stationary optimal policies with respect to each of these criteria.
Onésimo Hernández-Lerma, Myriam Muñoz de Ozak (1992)
Kybernetika
Similarity:
Oscar Vega-Amaya (1999)
Applicationes Mathematicae
Similarity:
We study the existence of sample path average cost (SPAC-) optimal policies for Markov control processes on Borel spaces with strictly unbounded costs, i.e., costs that grow without bound on the complement of compact subsets. Assuming only that the cost function is lower semicontinuous and that the transition law is weakly continuous, we show the existence of a relaxed policy with 'minimal' expected average cost and that the optimal average cost is the limit of discounted programs. Moreover,...