EUDML

Currently displaying 1 – 4 of 4

Order by Relevance | Title | Year of publication

Estimates of stability of Markov control processes with unbounded costs

Evgueni I. Gordienko; Francisco Salem-Silva — 2000

Kybernetika

For a discrete-time Markov control process with the transition probability $p$ , we compare the total discounted costs $V_{β}$ $(π_{β})$ and $V_{β} ({\tilde{π}}_{β})$ , when applying the optimal control policy $π_{β}$ and its approximation ${\tilde{π}}_{β}$ . The policy ${\tilde{π}}_{β}$ is optimal for an approximating process with the transition probability $\tilde{p}$ . A cost per stage for considered processes can be unbounded. Under certain ergodicity assumptions we establish the upper bound for the relative stability index $[V_{β} ({\tilde{π}}_{β}) - V_{β} (π_{β})] / V_{β} (π_{β})$ . This bound does not depend on a discount...

Estimates for perturbations of general discounted Markov control chains

Raúl Montes-de-Oca; Alexander Sakhanenko; Francisco Salem-Silva — 2003

Applicationes Mathematicae

We extend previous results of the same authors ([11]) on the effects of perturbation in the transition probability of a Markov cost chain for discounted Markov control processes. Supposing valid, for each stationary policy, conditions of Lyapunov and Harris type, we get upper bounds for the index of perturbations, defined as the difference of the total expected discounted costs for the original Markov control process and the perturbed one. We present examples that satisfy our conditions.

Estimates for perturbations of discounted Markov chains on general spaces

Raúl Montes-de-Oca; Alexander Sakhanenko; Francisco Salem-Silva — 2003

Applicationes Mathematicae

We analyse a Markov chain and perturbations of the transition probability and the one-step cost function (possibly unbounded) defined on it. Under certain conditions, of Lyapunov and Harris type, we obtain new estimates of the effects of such perturbations via an index of perturbations, defined as the difference of the total expected discounted costs between the original Markov chain and the perturbed one. We provide an example which illustrates our analysis.

Estimates for perturbations of average Markov decision processes with a minimal state and upper bounded by stochastically ordered Markov chains

Raúl Montes-de-Oca; Francisco Salem-Silva — 2005

Kybernetika

This paper deals with Markov decision processes (MDPs) with real state space for which its minimum is attained, and that are upper bounded by (uncontrolled) stochastically ordered (SO) Markov chains. We consider MDPs with (possibly) unbounded costs, and to evaluate the quality of each policy, we use the objective function known as the average cost. For this objective function we consider two Markov control models $ℙ$ and $ℙ_{1}$ . $ℙ$ and $ℙ_{1}$ have the same components except for the transition laws. The transition...

Page 1

Download Results (CSV)

Advanced Search

Formula preview

Currently displaying 1 – 4 of 4

Estimates of stability of Markov control processes with unbounded costs

Estimates for perturbations of general discounted Markov control chains

Estimates for perturbations of discounted Markov chains on general spaces

Estimates for perturbations of average Markov decision processes with a minimal state and upper bounded by stochastically ordered Markov chains