Bayesian estimation of the mean holding time in average semi-Markov control processes

J. Adolfo Minjárez-Sosa; José A. Montoya

Bayesian estimation of the mean holding time in average semi-Markov control processes

J. Adolfo Minjárez-Sosa; José A. Montoya

Applicationes Mathematicae (2015)

Volume: 42, Issue: 2-3, page 205-218
ISSN: 1233-7234

Access Full Article

top

Access to full text

Full (PDF)

Abstract

top

We consider semi-Markov control models with Borel state and action spaces, possibly unbounded costs, and holding times with a generalized exponential distribution with unknown mean θ. Assuming that such a distribution does not depend on the state-action pairs, we introduce a Bayesian estimation procedure for θ, which combined with a variant of the vanishing discount factor approach yields average cost optimal policies.

How to cite

top

MLA
BibTeX
RIS

J. Adolfo Minjárez-Sosa, and José A. Montoya. "Bayesian estimation of the mean holding time in average semi-Markov control processes." Applicationes Mathematicae 42.2-3 (2015): 205-218. <http://eudml.org/doc/279949>.

@article{J2015,
abstract = {We consider semi-Markov control models with Borel state and action spaces, possibly unbounded costs, and holding times with a generalized exponential distribution with unknown mean θ. Assuming that such a distribution does not depend on the state-action pairs, we introduce a Bayesian estimation procedure for θ, which combined with a variant of the vanishing discount factor approach yields average cost optimal policies.},
author = {J. Adolfo Minjárez-Sosa, José A. Montoya},
journal = {Applicationes Mathematicae},
keywords = {semi-Markov control processes; Bayes estimator; average cost criterion; optimal policy},
language = {eng},
number = {2-3},
pages = {205-218},
title = {Bayesian estimation of the mean holding time in average semi-Markov control processes},
url = {http://eudml.org/doc/279949},
volume = {42},
year = {2015},
}

TY - JOUR
AU - J. Adolfo Minjárez-Sosa
AU - José A. Montoya
TI - Bayesian estimation of the mean holding time in average semi-Markov control processes
JO - Applicationes Mathematicae
PY - 2015
VL - 42
IS - 2-3
SP - 205
EP - 218
AB - We consider semi-Markov control models with Borel state and action spaces, possibly unbounded costs, and holding times with a generalized exponential distribution with unknown mean θ. Assuming that such a distribution does not depend on the state-action pairs, we introduce a Bayesian estimation procedure for θ, which combined with a variant of the vanishing discount factor approach yields average cost optimal policies.
LA - eng
KW - semi-Markov control processes; Bayes estimator; average cost criterion; optimal policy
UR - http://eudml.org/doc/279949
ER -

NotesEmbed ?

top

You must be logged in to post comments.

To embed these notes on your page include the following JavaScript code on your page where you want the notes to appear.

Language to use for this widget.

Only the controls for the widget will be shown in your chosen language. Notes will be shown in their authored language.

Number of notes per page

Tells the widget how many notes to show per page. You can cycle through additional notes using the next and previous controls.

Note: Best practice suggests putting the JavaScript code just before the closing </body> tag.