New continuity estimates of geometric sums.
Let be two sequences of i.i.d. random vectors with values in and , , . Assuming that , , and the existence of a density of satisfying the certain conditions we prove the following inequalities: where and are the total variation and Zolotarev’s metrics, respectively.
The upper bounds of the uniform distance between two sums of a random number of independent random variables are given. The application of these bounds is illustrated by stability (continuity) estimating in models in queueing and risk theory.
This paper considers discrete-time Markov control processes on Borel spaces, with possibly unbounded costs, and the long run average cost (AC) criterion. Under appropriate hypotheses on weighted norms for the cost function and the transition law, the existence of solutions to the average cost optimality inequality and the average cost optimality equation are shown, which in turn yield the existence of AC-optimal and AC-canonical policies respectively.
This paper shows the convergence of the value iteration (or successive approximations) algorithm for average cost (AC) Markov control processes on Borel spaces, with possibly unbounded cost, under appropriate hypotheses on weighted norms for the cost function and the transition law. It is also shown that the aforementioned convergence implies strong forms of AC-optimality and the existence of forecast horizons.
For a discrete-time Markov control process with the transition probability , we compare the total discounted costs and , when applying the optimal control policy and its approximation . The policy is optimal for an approximating process with the transition probability . A cost per stage for considered processes can be unbounded. Under certain ergodicity assumptions we establish the upper bound for the relative stability index . This bound does not depend on a discount...
We deal with a stochastic programming problem that can be inconsistent. To overcome the inconsistency we apply Tikhonov's regularization technique, and, using recent results on the convergence rate of empirical measures in Wasserstein metric, we treat the following two related problems: 1. A choice of regularization parameters that guarantees the convergence of the minimization procedure. 2. Estimation of the rate of convergence in probability. Considering both light and heavy tail distributions...
We study the adaptive control problem for discrete-time Markov control processes with Borel state and action spaces and possibly unbounded one-stage costs. The processes are given by recurrent equations with i.i.d. -valued random vectors whose density is unknown. Assuming observability of we propose the procedure of statistical estimation of that allows us to prove discounted asymptotic optimality of two types of adaptive policies used early for the processes with bounded costs.
We study the limit behavior of certain classes of dependent random sequences (processes) which do not possess the Markov property. Assuming these processes depend on a control parameter we show that the optimization of the control can be reduced to a problem of nonlinear optimization. Under certain hypotheses we establish the stability of such optimization problems.
We study the stability of the classical optimal sequential probability ratio test based on independent identically distributed observations when testing two simple hypotheses about their common density : versus . As a functional to be minimized, it is used a weighted sum of the average (under ) sample number and the two types error probabilities. We prove that the problem is reduced to stopping time optimization for a ratio process generated by with the density . For being the corresponding...
Page 1