Currently displaying 1 – 1 of 1

Showing per page

Order by Relevance | Title | Year of publication

A stopping rule for discounted Markov decision processes with finite action sets

In a Discounted Markov Decision Process (DMDP) with finite action sets the Value Iteration Algorithm, under suitable conditions, leads to an optimal policy in a finite number of steps. Determining an upper bound on the necessary number of steps till gaining convergence is an issue of great theoretical and practical interest as it would provide a computationally feasible stopping rule for value iteration as an algorithm for finding an optimal policy. In this paper we find such a bound depending only...

Page 1

Download Results (CSV)