A weak convergence approach to hybrid LQG problems with infinite control weights.

Yin, G.George; Yong, Jiongmin

Displaying similar documents to “A weak convergence approach to hybrid LQG problems with infinite control weights.”

Estimates for perturbations of general discounted Markov control chains

Raúl Montes-de-Oca, Alexander Sakhanenko, Francisco Salem-Silva (2003)

Applicationes Mathematicae

Similarity:

We extend previous results of the same authors ([11]) on the effects of perturbation in the transition probability of a Markov cost chain for discounted Markov control processes. Supposing valid, for each stationary policy, conditions of Lyapunov and Harris type, we get upper bounds for the index of perturbations, defined as the difference of the total expected discounted costs for the original Markov control process and the perturbed one. We present examples that satisfy our conditions. ...

Recursive self-tuning control of finite Markov chains

Vivek Borkar (1997)

Applicationes Mathematicae

Similarity:

A recursive self-tuning control scheme for finite Markov chains is proposed wherein the unknown parameter is estimated by a stochastic approximation scheme for maximizing the log-likelihood function and the control is obtained via a relative value iteration algorithm. The analysis uses the asymptotic o.d.e.s associated with these.

Estimates for perturbations of average Markov decision processes with a minimal state and upper bounded by stochastically ordered Markov chains

Raúl Montes-de-Oca, Francisco Salem-Silva (2005)

Kybernetika

Similarity:

This paper deals with Markov decision processes (MDPs) with real state space for which its minimum is attained, and that are upper bounded by (uncontrolled) stochastically ordered (SO) Markov chains. We consider MDPs with (possibly) unbounded costs, and to evaluate the quality of each policy, we use the objective function known as the average cost. For this objective function we consider two Markov control models $ℙ$ and $ℙ_{1}$ . $ℙ$ and $ℙ_{1}$ have the same components except for the transition laws....

Ergodic control of partially observed Markov processes with equivalent transition probabilities

Łukasz Stettner (1993)

Applicationes Mathematicae

Similarity:

Optimal control with long run average cost functional of a partially observed Markov process is considered. Under the assumption that the transition probabilities are equivalent, the existence of the solution to the Bellman equation is shown, with the use of which optimal strategies are constructed.

A generalization of Ueno's inequality for n-step transition probabilities

Andrzej Nowak (1998)

Applicationes Mathematicae

Similarity:

We provide a generalization of Ueno's inequality for n-step transition probabilities of Markov chains in a general state space. Our result is relevant to the study of adaptive control problems and approximation problems in the theory of discrete-time Markov decision processes and stochastic games.

Regeneration and general Markov chains.

Kalashnikov, Vladimir V. (1994)

Journal of Applied Mathematics and Stochastic Analysis

Similarity:

The tail structure of nonhomogeneous finite state Markov chains: survey

Marius Losifescu (1979)

Banach Center Publications

Similarity:

From shuffling cards to walking around the building: An introduction to modern Markov chain theory.

Diaconis, Persi (1998)

Documenta Mathematica

Similarity: