EUDML

Currently displaying 1 – 8 of 8

Order by Relevance | Title | Year of publication

Nonparametric adaptive control for discrete-time Markov processes with unbounded costs under average criterion

J. Minjárez-Sosa — 1999

Applicationes Mathematicae

We introduce average cost optimal adaptive policies in a class of discrete-time Markov control processes with Borel state and action spaces, allowing unbounded costs. The processes evolve according to the system equations $x_{t + 1} = F (x_{t}, a_{t}, ξ_{t})$ , t=1,2,..., with i.i.d. $ℝ^{k}$ -valued random vectors $ξ_{t}$ , which are observable but whose density ϱ is unknown.

Approximation and estimation in Markov control processes under a discounted criterion

J. Adolfo Minjárez-Sosa — 2004

Kybernetika

We consider a class of discrete-time Markov control processes with Borel state and action spaces, and $ℜ^{k}$ -valued i.i.d. disturbances with unknown density $ρ .$ Supposing possibly unbounded costs, we combine suitable density estimation methods of $ρ$ with approximation procedures of the optimal cost function, to show the existence of a sequence ${{\hat{f}}_{t}}$ of minimizers converging to an optimal stationary policy $f_{\infty} .$

Bayesian estimation of the mean holding time in average semi-Markov control processes

J. Adolfo Minjárez-Sosa; José A. Montoya — 2015

Applicationes Mathematicae

We consider semi-Markov control models with Borel state and action spaces, possibly unbounded costs, and holding times with a generalized exponential distribution with unknown mean θ. Assuming that such a distribution does not depend on the state-action pairs, we introduce a Bayesian estimation procedure for θ, which combined with a variant of the vanishing discount factor approach yields average cost optimal policies.

Adaptive control for discrete-time Markov processes with unbounded costs: Discounted criterion

Evgueni I. Gordienko; J. Adolfo Minjárez-Sosa — 1998

Kybernetika

We study the adaptive control problem for discrete-time Markov control processes with Borel state and action spaces and possibly unbounded one-stage costs. The processes are given by recurrent equations $x_{t + 1} = F (x_{t}, a_{t}, ξ_{t}), t = 0, 1, ...$ with i.i.d. $ℜ^{k}$ -valued random vectors $ξ_{t}$ whose density $ρ$ is unknown. Assuming observability of $ξ_{t}$ we propose the procedure of statistical estimation of $ρ$ that allows us to prove discounted asymptotic optimality of two types of adaptive policies used early for the processes with bounded costs.

Empirical approximation in Markov games under unbounded payoff: discounted and average criteria

Fernando Luque-Vásquez; J. Adolfo Minjárez-Sosa — 2017

Kybernetika

This work deals with a class of discrete-time zero-sum Markov games whose state process $\{x_{t}\}$ evolves according to the equation $x_{t + 1} = F (x_{t}, a_{t}, b_{t}, ξ_{t}),$ where $a_{t}$ and $b_{t}$ represent the actions of player 1 and 2, respectively, and $\{ξ_{t}\}$ is a sequence of independent and identically distributed random variables with unknown distribution $θ$ . Assuming possibly unbounded payoff, and using the empirical distribution to estimate $θ$ , we introduce approximation schemes for the value of the game as well as for optimal strategies considering both,...

Partially observable queueing systems with controlled service rates under a discounted optimality criterion

Yofre H. García; Saul Diaz-Infante; J. Adolfo Minjárez-Sosa — 2021

Kybernetika

We are concerned with a class of $G I / G I / 1$ queueing systems with controlled service rates, in which the waiting times are only observed when they take zero value. Applying a suitable filtering process, we show the existence of optimal control policies under a discounted optimality criterion.

Approximation, estimation and control of stochastic systems under a randomized discounted cost criterion

Juan González-Hernández; Raquiel R. López-Martínez; J. Adolfo Minjárez-Sosa — 2009

Kybernetika

The paper deals with a class of discrete-time stochastic control processes under a discounted optimality criterion with random discount rate, and possibly unbounded costs. The state process $\{x_{t}\}$ and the discount process $\{α_{t}\}$ evolve according to the coupled difference equations $x_{t + 1} = F (x_{t}, α_{t}, a_{t}, ξ_{t}),$ $α_{t + 1} = G (α_{t}, η_{t})$ where the state and discount disturbance processes ${ξ_{t}}$ and ${η_{t}}$ are sequences of i.i.d. random variables with densities $ρ^{ξ}$ and $ρ^{η}$ respectively. The main objective is to introduce approximation algorithms of the optimal...

Partially observable Markov decision processes with partially observable random discount factors

E. Everardo Martinez-Garcia; J. Adolfo Minjárez-Sosa; Oscar Vega-Amaya — 2022

Kybernetika

This paper deals with a class of partially observable discounted Markov decision processes defined on Borel state and action spaces, under unbounded one-stage cost. The discount rate is a stochastic process evolving according to a difference equation, which is also assumed to be partially observable. Introducing a suitable control model and filtering processes, we prove the existence of optimal control policies. In addition, we illustrate our results in a class of GI/GI/1 queueing systems where...

Page 1

Download Results (CSV)

Advanced Search

Formula preview

Currently displaying 1 – 8 of 8

Nonparametric adaptive control for discrete-time Markov processes with unbounded costs under average criterion

Approximation and estimation in Markov control processes under a discounted criterion

Bayesian estimation of the mean holding time in average semi-Markov control processes

Adaptive control for discrete-time Markov processes with unbounded costs: Discounted criterion

Empirical approximation in Markov games under unbounded payoff: discounted and average criteria

Partially observable queueing systems with controlled service rates under a discounted optimality criterion

Approximation, estimation and control of stochastic systems under a randomized discounted cost criterion

Partially observable Markov decision processes with partially observable random discount factors