Currently displaying 1 – 6 of 6

Showing per page

Order by Relevance | Title | Year of publication

First passage risk probability optimality for continuous time Markov decision processes

Haifeng HuoXian Wen — 2019

Kybernetika

In this paper, we study continuous time Markov decision processes (CTMDPs) with a denumerable state space, a Borel action space, unbounded transition rates and nonnegative reward function. The optimality criterion to be considered is the first passage risk probability criterion. To ensure the non-explosion of the state processes, we first introduce a so-called drift condition, which is weaker than the well known regular condition for semi-Markov decision processes (SMDPs). Furthermore, under some...

Risk probability optimization problem for finite horizon continuous time Markov decision processes with loss rate

Haifeng HuoXian Wen — 2021

Kybernetika

This paper presents a study the risk probability optimality for finite horizon continuous-time Markov decision process with loss rate and unbounded transition rates. Under drift condition, which is slightly weaker than the regular condition, as detailed in existing literature on the risk probability optimality Semi-Markov decision processes, we prove that the value function is the unique solution of the corresponding optimality equation, and demonstrate the existence of a risk probability optimization...

The exponential cost optimality for finite horizon semi-Markov decision processes

Haifeng HuoXian Wen — 2022

Kybernetika

This paper considers an exponential cost optimality problem for finite horizon semi-Markov decision processes (SMDPs). The objective is to calculate an optimal policy with minimal exponential costs over the full set of policies in a finite horizon. First, under the standard regular and compact-continuity conditions, we establish the optimality equation, prove that the value function is the unique solution of the optimality equation and the existence of an optimal policy by using the minimum nonnegative...

Minimizing risk probability for infinite discounted piecewise deterministic Markov decision processes

Haifeng HuoJinhua CuiXian Wen — 2024

Kybernetika

The purpose of this paper is to study the risk probability problem for infinite horizon piecewise deterministic Markov decision processes (PDMDPs) with varying discount factors and unbounded transition rates. Different from the usual expected total rewards, we aim to minimize the risk probability that the total rewards do not exceed a given target value. Under the condition of the controlled state process being non-explosive is slightly weaker than the corresponding ones in the previous literature,...

Page 1

Download Results (CSV)