Nash -equilibria for stochastic games with total reward functions: an approach through Markov decision processes
Francisco J. González-Padilla; Raúl Montes-de-Oca
Kybernetika (2019)
- Volume: 55, Issue: 1, page 152-165
- ISSN: 0023-5954
Access Full Article
topAbstract
topHow to cite
topGonzález-Padilla, Francisco J., and Montes-de-Oca, Raúl. "Nash $\epsilon $-equilibria for stochastic games with total reward functions: an approach through Markov decision processes." Kybernetika 55.1 (2019): 152-165. <http://eudml.org/doc/294587>.
@article{González2019,
abstract = {The main objective of this paper is to find structural conditions under which a stochastic game between two players with total reward functions has an $\epsilon $-equilibrium. To reach this goal, the results of Markov decision processes are used to find $\epsilon $-optimal strategies for each player and then the correspondence of a better answer as well as a more general version of Kakutani’s Fixed Point Theorem to obtain the $\epsilon $-equilibrium mentioned. Moreover, two examples to illustrate the theory developed are presented.},
author = {González-Padilla, Francisco J., Montes-de-Oca, Raúl},
journal = {Kybernetika},
keywords = {stochastic games; Nash equilibrium; Markov decision processes; total rewards},
language = {eng},
number = {1},
pages = {152-165},
publisher = {Institute of Information Theory and Automation AS CR},
title = {Nash $\epsilon $-equilibria for stochastic games with total reward functions: an approach through Markov decision processes},
url = {http://eudml.org/doc/294587},
volume = {55},
year = {2019},
}
TY - JOUR
AU - González-Padilla, Francisco J.
AU - Montes-de-Oca, Raúl
TI - Nash $\epsilon $-equilibria for stochastic games with total reward functions: an approach through Markov decision processes
JO - Kybernetika
PY - 2019
PB - Institute of Information Theory and Automation AS CR
VL - 55
IS - 1
SP - 152
EP - 165
AB - The main objective of this paper is to find structural conditions under which a stochastic game between two players with total reward functions has an $\epsilon $-equilibrium. To reach this goal, the results of Markov decision processes are used to find $\epsilon $-optimal strategies for each player and then the correspondence of a better answer as well as a more general version of Kakutani’s Fixed Point Theorem to obtain the $\epsilon $-equilibrium mentioned. Moreover, two examples to illustrate the theory developed are presented.
LA - eng
KW - stochastic games; Nash equilibrium; Markov decision processes; total rewards
UR - http://eudml.org/doc/294587
ER -
References
top- Aliprantis, C. D., Border, K. C., Infinite Dimensional Analysis., Springer 2006. Zbl1156.46001MR2378491
- Ash, R. B., Real Analysis and Probability., Academic Press, New York 1972. MR0435320
- Bartle, R., 10.1002/zamm.19650450519, John Wiley and Sons, Inc. 1964. MR0393369DOI10.1002/zamm.19650450519
- Cavazos-Cadena, R., Montes-de-Oca, R., 10.1007/978-1-4613-0265-0_11, In: Markov Processes and Controlled Markov Chains 2002 (Z. Hou, J. A. Filar and A. Chen, eds.), Kluwer Academic Publishers, pp. 189-221. MR2022426DOI10.1007/978-1-4613-0265-0_11
- Filar, J., Vrieze, K., Competitive Markov Decision Processes., Springer-Verlag, New York 1997. MR1418636
- Habil, E. D., Double sequences and double series., The Islamic Univ. J., Series of Natural Studies and Engineering 14 (2006), 1-32. (This reference is available at the Islamic University Journal's site: http://journal.iugaza.edu.ps/index.php/IUGNS/article/view/1594/1525.)
- Hernández-Lerma, O., Lasserre, J. B., 10.1007/978-1-4612-0729-0, Springer-Verlag, New York 1996. Zbl0840.93001MR1363487DOI10.1007/978-1-4612-0729-0
- Hordijk, A., Dynamic Programming and Markov Potential Theory., Mathematical Centre Tracts 51, Amsterdam 1974. MR0432227
- Jaśkiewicz, A., Nowak, A. S., 10.1007/s13235-011-0013-8, Dyn. Games Appl. 1 (2011), 2, 253-279. MR2804096DOI10.1007/s13235-011-0013-8
- Kakutani, S., 10.1215/s0012-7094-41-00838-4, Duke Math. J. 8 (1942), 457-459. MR0004776DOI10.1215/s0012-7094-41-00838-4
- Kelley, J. L., General Topology., Springer, New York 1955. MR0070144
- Köthe, G., Topological Vector Spaces I., Springer-Verlag, 1969. MR0248498
- Puterman, M., Markov Decision Processes., John Wiley and Sons, Inc. New Jersey 1994. Zbl1184.90170MR1270015
- Shapley, L. S., 10.1073/pnas.39.10.1095, Proc. Nat. Acad. Sci. U. S. A. 39 (1953), 1095-1100. Zbl1180.91042MR0061807DOI10.1073/pnas.39.10.1095
- Thuijsman, F., Optimality and Equilibria in Stochastic Games., CW1 Tract-82, Amsterdam 1992. MR1171220
- Zeidler, E., Nonlinear Functional Analysis and its Applications., Springer-Verlag, New York Inc. 1988. MR0816732
NotesEmbed ?
topTo embed these notes on your page include the following JavaScript code on your page where you want the notes to appear.