Page 1

Displaying 1 – 1 of 1

Showing per page

Bi-personal stochastic transient Markov games with stopping times and total reward criterion

Martínez-Cortés Victor Manuel (2021)

Kybernetika

The article is devoted to a class of Bi-personal (players 1 and 2), zero-sum Markov games evolving in discrete-time on Transient Markov reward chains. At each decision time the second player can stop the system by paying terminal reward to the first player. If the system is not stopped the first player selects a decision and two things will happen: The Markov chain reaches next state according to the known transition law, and the second player must pay a reward to the first player. The first player...

Currently displaying 1 – 1 of 1

Page 1