Dynamic Programming Principle for tug-of-war games with noise
Juan J. Manfredi; Mikko Parviainen; Julio D. Rossi
ESAIM: Control, Optimisation and Calculus of Variations (2012)
- Volume: 18, Issue: 1, page 81-90
- ISSN: 1292-8119
Access Full Article
topAbstract
topHow to cite
topManfredi, Juan J., Parviainen, Mikko, and Rossi, Julio D.. "Dynamic Programming Principle for tug-of-war games with noise." ESAIM: Control, Optimisation and Calculus of Variations 18.1 (2012): 81-90. <http://eudml.org/doc/272799>.
@article{Manfredi2012,
abstract = {We consider a two-player zero-sum-game in a bounded open domain Ω described as follows: at a point x ∈ Ω, Players I and II play an ε-step tug-of-war game with probability α, and with probability β (α + β = 1), a random point in the ball of radius ε centered at x is chosen. Once the game position reaches the boundary, Player II pays Player I the amount given by a fixed payoff function F. We give a detailed proof of the fact that the value functions of this game satisfy the Dynamic Programming Principle\begin\{equation*\} u(x) = \frac\{\alpha \}\{2\} \left\lbrace \sup \_\{y\in B\_\{\}(x)\} u (y) + \inf \_\{ y \in B\_\{\}(x)\} u (y) \right\rbrace + \beta \_\{ B\_\{\}(x)\} u(y) y, \end\{equation*\}u ( x ) = α 2 ⎧ ⎪ ⎪ ⎨ ⎪ ⎪ ⎩ sup y ∈ B ε ( x ) u ( y ) + inf y ∈ B ε ( x ) u ( y ) ⎫ ⎪ ⎪ ⎬ ⎪ ⎪ ⎭ + β ∫ B ε ( x ) u ( y ) d y, forx ∈ Ω with u(y) = F(y) when y ∉ Ω. This principle implies the existence of quasioptimal Markovian strategies.},
author = {Manfredi, Juan J., Parviainen, Mikko, Rossi, Julio D.},
journal = {ESAIM: Control, Optimisation and Calculus of Variations},
keywords = {Dirichlet boundary conditions; dynamic programming principle; p-laplacian; stochastic games; two-player zero-sum games; zero sum game; tug-of-war game; dynamic programming; quasioptimal strategies; Markovian strategies},
language = {eng},
number = {1},
pages = {81-90},
publisher = {EDP-Sciences},
title = {Dynamic Programming Principle for tug-of-war games with noise},
url = {http://eudml.org/doc/272799},
volume = {18},
year = {2012},
}
TY - JOUR
AU - Manfredi, Juan J.
AU - Parviainen, Mikko
AU - Rossi, Julio D.
TI - Dynamic Programming Principle for tug-of-war games with noise
JO - ESAIM: Control, Optimisation and Calculus of Variations
PY - 2012
PB - EDP-Sciences
VL - 18
IS - 1
SP - 81
EP - 90
AB - We consider a two-player zero-sum-game in a bounded open domain Ω described as follows: at a point x ∈ Ω, Players I and II play an ε-step tug-of-war game with probability α, and with probability β (α + β = 1), a random point in the ball of radius ε centered at x is chosen. Once the game position reaches the boundary, Player II pays Player I the amount given by a fixed payoff function F. We give a detailed proof of the fact that the value functions of this game satisfy the Dynamic Programming Principle\begin{equation*} u(x) = \frac{\alpha }{2} \left\lbrace \sup _{y\in B_{}(x)} u (y) + \inf _{ y \in B_{}(x)} u (y) \right\rbrace + \beta _{ B_{}(x)} u(y) y, \end{equation*}u ( x ) = α 2 ⎧ ⎪ ⎪ ⎨ ⎪ ⎪ ⎩ sup y ∈ B ε ( x ) u ( y ) + inf y ∈ B ε ( x ) u ( y ) ⎫ ⎪ ⎪ ⎬ ⎪ ⎪ ⎭ + β ∫ B ε ( x ) u ( y ) d y, forx ∈ Ω with u(y) = F(y) when y ∉ Ω. This principle implies the existence of quasioptimal Markovian strategies.
LA - eng
KW - Dirichlet boundary conditions; dynamic programming principle; p-laplacian; stochastic games; two-player zero-sum games; zero sum game; tug-of-war game; dynamic programming; quasioptimal strategies; Markovian strategies
UR - http://eudml.org/doc/272799
ER -
References
top- [1] E. Le Gruyer, On absolutely minimizing Lipschitz extensions and PDE Δ∞(u) = 0. NoDEA14 (2007) 29–55. Zbl1154.35055MR2346452
- [2] E. Le Gruyer and J.C. Archer, Harmonious extensions. SIAM J. Math. Anal.29 (1998) 279–292. Zbl0915.46002MR1617186
- [3] A.P. Maitra and W.D. Sudderth, Borel stochastic games with limsup payoff. Ann. Probab.21 (1993) 861–885. Zbl0803.90142MR1217569
- [4] A.P. Maitra and W.D. Sudderth, Discrete gambling and stochastic games, Applications of Mathematics 32. Springer-Verlag (1996). Zbl0864.90148MR1382657
- [5] J.J. Manfredi, M. Parviainen and J.D. Rossi, An asymptotic mean value property characterization of p-harmonic functions. Proc. Am. Math. Soc.138 (2010) 881–889. Zbl1187.35115MR2566554
- [6] J.J. Manfredi, M. Parviainen and J.D. Rossi, On the definition and properties of p-harmonious functions. Preprint (2009). Zbl1252.91014MR3011990
- [7] A. Oberman, A convergent difference scheme for the infinity Laplacian : construction of absolutely minimizing Lipschitz extensions. Math. Comp.74 (2005) 1217–1230. Zbl1094.65110MR2137000
- [8] Y. Peres and S. Sheffield, Tug-of-war with noise : a game theoretic view of the p-Laplacian. Duke Math. J.145 (2008) 91–120. Zbl1206.35112MR2451291
- [9] Y. Peres, O. Schramm, S. Sheffield and D. Wilson, Tug-of-war and the infinity Laplacian. J. Am. Math. Soc.22 (2009) 167–210. Zbl1206.91002MR2449057
- [10] S.R.S. Varadhan, Probability theory, Courant Lecture Notes in Mathematics 7. Courant Institute of Mathematical Sciences, New York University/AMS (2001). Zbl0980.60002MR1852999
NotesEmbed ?
topTo embed these notes on your page include the following JavaScript code on your page where you want the notes to appear.