Page 1 Next

Displaying 1 – 20 of 25

Showing per page

Calculating the variance in Markov-processes with random reward.

Francisco Benito (1982)

Trabajos de Estadística e Investigación Operativa

In this article we present a generalization of Markov Decision Processes with discreet time where the immediate rewards in every period are not deterministic but random, with the two first moments of the distribution given.Formulas are developed to calculate the expected value and the variance of the reward of the process, formulas which generalize and partially correct other results. We make some observations about the distribution of rewards for processes with limited or unlimited horizon and...

Collisions of random walks

Martin T. Barlow, Yuval Peres, Perla Sousi (2012)

Annales de l'I.H.P. Probabilités et statistiques

A recurrent graph G has the infinite collision property if two independent random walks on G , started at the same point, collide infinitely often a.s. We give a simple criterion in terms of Green functions for a graph to have this property, and use it to prove that a critical Galton–Watson tree with finite variance conditioned to survive, the incipient infinite cluster in d with d 19 and the uniform spanning tree in 2 all have the infinite collision property. For power-law combs and spherically symmetric...

Computing the Stackelberg/Nash equilibria using the extraproximal method: Convergence analysis and implementation details for Markov chains games

Kristal K. Trejo, Julio B. Clempner, Alexander S. Poznyak (2015)

International Journal of Applied Mathematics and Computer Science

In this paper we present the extraproximal method for computing the Stackelberg/Nash equilibria in a class of ergodic controlled finite Markov chains games. We exemplify the original game formulation in terms of coupled nonlinear programming problems implementing the Lagrange principle. In addition, Tikhonov's regularization method is employed to ensure the convergence of the cost-functions to a Stackelberg/Nash equilibrium point. Then, we transform the problem into a system of equations in the...

Convergence of iterates of a transfer operator, application to dynamical systems and to Markov chains

Jean-Pierre Conze, Albert Raugi (2003)

ESAIM: Probability and Statistics

We present a spectral theory for a class of operators satisfying a weak “Doeblin–Fortet” condition and apply it to a class of transition operators. This gives the convergence of the series k 0 k r P k f , r , under some regularity assumptions and implies the central limit theorem with a rate in n - 1 2 for the corresponding Markov chain. An application to a non uniformly hyperbolic transformation on the interval is also given.

Currently displaying 1 – 20 of 25

Page 1 Next