Nash ϵ -equilibria for stochastic games with total reward functions: an approach through Markov decision processes

Francisco J. González-Padilla; Raúl Montes-de-Oca

Kybernetika (2019)

  • Volume: 55, Issue: 1, page 152-165
  • ISSN: 0023-5954

Abstract

top
The main objective of this paper is to find structural conditions under which a stochastic game between two players with total reward functions has an ϵ -equilibrium. To reach this goal, the results of Markov decision processes are used to find ϵ -optimal strategies for each player and then the correspondence of a better answer as well as a more general version of Kakutani’s Fixed Point Theorem to obtain the ϵ -equilibrium mentioned. Moreover, two examples to illustrate the theory developed are presented.

How to cite

top

González-Padilla, Francisco J., and Montes-de-Oca, Raúl. "Nash $\epsilon $-equilibria for stochastic games with total reward functions: an approach through Markov decision processes." Kybernetika 55.1 (2019): 152-165. <http://eudml.org/doc/294587>.

@article{González2019,
abstract = {The main objective of this paper is to find structural conditions under which a stochastic game between two players with total reward functions has an $\epsilon $-equilibrium. To reach this goal, the results of Markov decision processes are used to find $\epsilon $-optimal strategies for each player and then the correspondence of a better answer as well as a more general version of Kakutani’s Fixed Point Theorem to obtain the $\epsilon $-equilibrium mentioned. Moreover, two examples to illustrate the theory developed are presented.},
author = {González-Padilla, Francisco J., Montes-de-Oca, Raúl},
journal = {Kybernetika},
keywords = {stochastic games; Nash equilibrium; Markov decision processes; total rewards},
language = {eng},
number = {1},
pages = {152-165},
publisher = {Institute of Information Theory and Automation AS CR},
title = {Nash $\epsilon $-equilibria for stochastic games with total reward functions: an approach through Markov decision processes},
url = {http://eudml.org/doc/294587},
volume = {55},
year = {2019},
}

TY - JOUR
AU - González-Padilla, Francisco J.
AU - Montes-de-Oca, Raúl
TI - Nash $\epsilon $-equilibria for stochastic games with total reward functions: an approach through Markov decision processes
JO - Kybernetika
PY - 2019
PB - Institute of Information Theory and Automation AS CR
VL - 55
IS - 1
SP - 152
EP - 165
AB - The main objective of this paper is to find structural conditions under which a stochastic game between two players with total reward functions has an $\epsilon $-equilibrium. To reach this goal, the results of Markov decision processes are used to find $\epsilon $-optimal strategies for each player and then the correspondence of a better answer as well as a more general version of Kakutani’s Fixed Point Theorem to obtain the $\epsilon $-equilibrium mentioned. Moreover, two examples to illustrate the theory developed are presented.
LA - eng
KW - stochastic games; Nash equilibrium; Markov decision processes; total rewards
UR - http://eudml.org/doc/294587
ER -

References

top
  1. Aliprantis, C. D., Border, K. C., Infinite Dimensional Analysis., Springer 2006. Zbl1156.46001MR2378491
  2. Ash, R. B., Real Analysis and Probability., Academic Press, New York 1972. MR0435320
  3. Bartle, R., 10.1002/zamm.19650450519, John Wiley and Sons, Inc. 1964. MR0393369DOI10.1002/zamm.19650450519
  4. Cavazos-Cadena, R., Montes-de-Oca, R., 10.1007/978-1-4613-0265-0_11, In: Markov Processes and Controlled Markov Chains 2002 (Z. Hou, J. A. Filar and A. Chen, eds.), Kluwer Academic Publishers, pp. 189-221. MR2022426DOI10.1007/978-1-4613-0265-0_11
  5. Filar, J., Vrieze, K., Competitive Markov Decision Processes., Springer-Verlag, New York 1997. MR1418636
  6. Habil, E. D., Double sequences and double series., The Islamic Univ. J., Series of Natural Studies and Engineering 14 (2006), 1-32. (This reference is available at the Islamic University Journal's site: http://journal.iugaza.edu.ps/index.php/IUGNS/article/view/1594/1525.) 
  7. Hernández-Lerma, O., Lasserre, J. B., 10.1007/978-1-4612-0729-0, Springer-Verlag, New York 1996. Zbl0840.93001MR1363487DOI10.1007/978-1-4612-0729-0
  8. Hordijk, A., Dynamic Programming and Markov Potential Theory., Mathematical Centre Tracts 51, Amsterdam 1974. MR0432227
  9. Jaśkiewicz, A., Nowak, A. S., 10.1007/s13235-011-0013-8, Dyn. Games Appl. 1 (2011), 2, 253-279. MR2804096DOI10.1007/s13235-011-0013-8
  10. Kakutani, S., 10.1215/s0012-7094-41-00838-4, Duke Math. J. 8 (1942), 457-459. MR0004776DOI10.1215/s0012-7094-41-00838-4
  11. Kelley, J. L., General Topology., Springer, New York 1955. MR0070144
  12. Köthe, G., Topological Vector Spaces I., Springer-Verlag, 1969. MR0248498
  13. Puterman, M., Markov Decision Processes., John Wiley and Sons, Inc. New Jersey 1994. Zbl1184.90170MR1270015
  14. Shapley, L. S., 10.1073/pnas.39.10.1095, Proc. Nat. Acad. Sci. U. S. A. 39 (1953), 1095-1100. Zbl1180.91042MR0061807DOI10.1073/pnas.39.10.1095
  15. Thuijsman, F., Optimality and Equilibria in Stochastic Games., CW1 Tract-82, Amsterdam 1992. MR1171220
  16. Zeidler, E., Nonlinear Functional Analysis and its Applications., Springer-Verlag, New York Inc. 1988. MR0816732

NotesEmbed ?

top

You must be logged in to post comments.

To embed these notes on your page include the following JavaScript code on your page where you want the notes to appear.

Only the controls for the widget will be shown in your chosen language. Notes will be shown in their authored language.

Tells the widget how many notes to show per page. You can cycle through additional notes using the next and previous controls.

    
                

Note: Best practice suggests putting the JavaScript code just before the closing </body> tag.