An extended version of average Markov decision processes on discrete spaces under fuzzy environment

Hugo Cruz-Suárez; Raúl Montes-de-Oca; R. Israel Ortega-Gutiérrez

An extended version of average Markov decision processes on discrete spaces under fuzzy environment

Hugo Cruz-Suárez; Raúl Montes-de-Oca; R. Israel Ortega-Gutiérrez

Kybernetika (2023)

Volume: 59, Issue: 1, page 160-178
ISSN: 0023-5954

Access Full Article

top

Access to full text

Full (PDF)

Abstract

top

The article presents an extension of the theory of standard Markov decision processes on discrete spaces and with the average cost as the objective function which permits to take into account a fuzzy average cost of a trapezoidal type. In this context, the fuzzy optimal control problem is considered with respect to two cases: the max-order of the fuzzy numbers and the average ranking order of the trapezoidal fuzzy numbers. Each of these cases extends the standard optimal control problem, and for each of them the optimal solution is related to a suitable standard optimal control problem, and it is obtained that (i) the optimal policy coincides with the optimal policy of this suitable standard control problem, and (ii) the fuzzy optimal value function is of a trapezoidal shape. Two models: a queueing system and a machine replacement problem are provided in order to examplify the theory given.

How to cite

top

MLA
BibTeX
RIS

Cruz-Suárez, Hugo, Montes-de-Oca, Raúl, and Ortega-Gutiérrez, R. Israel. "An extended version of average Markov decision processes on discrete spaces under fuzzy environment." Kybernetika 59.1 (2023): 160-178. <http://eudml.org/doc/299093>.

@article{Cruz2023,
abstract = {The article presents an extension of the theory of standard Markov decision processes on discrete spaces and with the average cost as the objective function which permits to take into account a fuzzy average cost of a trapezoidal type. In this context, the fuzzy optimal control problem is considered with respect to two cases: the max-order of the fuzzy numbers and the average ranking order of the trapezoidal fuzzy numbers. Each of these cases extends the standard optimal control problem, and for each of them the optimal solution is related to a suitable standard optimal control problem, and it is obtained that (i) the optimal policy coincides with the optimal policy of this suitable standard control problem, and (ii) the fuzzy optimal value function is of a trapezoidal shape. Two models: a queueing system and a machine replacement problem are provided in order to examplify the theory given.},
author = {Cruz-Suárez, Hugo, Montes-de-Oca, Raúl, Ortega-Gutiérrez, R. Israel},
journal = {Kybernetika},
keywords = {Markov decision process; average criterion; trapezoidal fuzzy cost; max-order; average ranking},
language = {eng},
number = {1},
pages = {160-178},
publisher = {Institute of Information Theory and Automation AS CR},
title = {An extended version of average Markov decision processes on discrete spaces under fuzzy environment},
url = {http://eudml.org/doc/299093},
volume = {59},
year = {2023},
}

TY - JOUR
AU - Cruz-Suárez, Hugo
AU - Montes-de-Oca, Raúl
AU - Ortega-Gutiérrez, R. Israel
TI - An extended version of average Markov decision processes on discrete spaces under fuzzy environment
JO - Kybernetika
PY - 2023
PB - Institute of Information Theory and Automation AS CR
VL - 59
IS - 1
SP - 160
EP - 178
AB - The article presents an extension of the theory of standard Markov decision processes on discrete spaces and with the average cost as the objective function which permits to take into account a fuzzy average cost of a trapezoidal type. In this context, the fuzzy optimal control problem is considered with respect to two cases: the max-order of the fuzzy numbers and the average ranking order of the trapezoidal fuzzy numbers. Each of these cases extends the standard optimal control problem, and for each of them the optimal solution is related to a suitable standard optimal control problem, and it is obtained that (i) the optimal policy coincides with the optimal policy of this suitable standard control problem, and (ii) the fuzzy optimal value function is of a trapezoidal shape. Two models: a queueing system and a machine replacement problem are provided in order to examplify the theory given.
LA - eng
KW - Markov decision process; average criterion; trapezoidal fuzzy cost; max-order; average ranking
UR - http://eudml.org/doc/299093
ER -

References

top

Arapostathis, A., Borkar, V. S., Fernández-Gaucherand, E., Gosh, M. K., Marcus, S. I., , SIAM J. Control Optim. 32 (1993), 2, 282-344. MR1205981 DOI
Carrero-Vera, K., Cruz-Suárez, H., Montes-de-Oca, R., Discounted Markov decision processes with fuzzy rewards induced by non-fuzzy systems., In: Proc. 10th International Conference on Operations Research and Enterprise Systems ICORES 2021, pp. 49-59.
Carrero-Vera, K., Cruz-Suárez, H., Montes-de-Oca, R., , Kybernetika 58 (2022), 2, 180-199. MR4467492 DOI
Chen, S. J., Chen, S. M., , Appl. Intell. 26 (2007), 1, 1-11. DOI
Chung, Y. L., Tsai, Z. N., A quantized water-filling packet scheduling scheme for downlink transmissions in LTE-advanced systems with carrier aggregation., In: SoftCOM 2010, 18th International Conference on Software, Telecommunications and Computer Networks IEEE (2010), pp. 275-279.
Diamond, P., Kloeden, P., Metric Spaces of Fuzzy Sets: Theory and Applications., World Scientific, Singapore 1994. MR1337027
Ebrahimnejad, A., , Appl. Soft Comput. 19 (2014), 171-176. MR3414360 DOI
Furukawa, N., , Optimization 40 (1997), 171-192. MR1620380 DOI
Hernández-Lerma, O., Lasserre, J. B., Discrete-Time Markov Control Processes: Basic Optimality Criteria., Springer-Verlag, New York, 1996. Zbl0840.93001 MR1363487
Kageyama, M., , J. Comput. Appl. Math. 224 (2009), 1, 140-145. MR2474219 DOI
Kaur, A., Kumar, A., , Appl. Soft Comput. 12 (2012), 3, 1201-1213. MR3040892 DOI
Konstantin, E., Avrachenkov, E., Sanchez, E., 10.1023/A:1015729400380, Fuzzy Optim. Decis. Making 1 (2002), 12, 143-159. MR1921754 DOI10.1023/A:1015729400380
Kurano, M., Yasuda, M., Nakagami, J., Yoshida, Y., A fuzzy treatment of uncertain Markov decision processes: average case., In: Proc. ASSM2000 International Conference on Applied Stochastic System Modeling, Kyoto 2000, pp. 148-157. MR1782634
Kurano, M., Yasuda, M., Nakagami, J., Yoshida, Y., , Eur. J. Oper. Res. 92 (1996), 3, 649-662. MR1328908 DOI
Kurano, M., Yasuda, M., Nakagami, J., Yoshida, Y., Markov decision processes with fuzzy rewards., J. Nonlinear Convex Anal. 4 (1996), 1, 105-116. MR1986973
Kurano, M., Yasuda, M., Nakagami, J., Yoshida, Y., Perceptive evaluation for the optimal discounted reward in Markov decision processes., In: International Conference on Modeling Decisions for Artificial Intelligence, Springer 2005, pp. 283-293.
Kurano, M., Yasuda, M., Nakagami, J., Yoshida, Y., , Fuzzy Sets and Systems 157 (2006), 19, 2674-2682. MR2328391 DOI
López-Díaz, M., Ralescu, D. A., , Comput. Statist. Data Anal. 51 (2006), 109-114. MR2297590 DOI
Puri, M. L., Ralescu, D. A., , J. Math. Anal. Appl. 114 (1986), 402-422. MR0833596 DOI
Puterman, M., Markov Decision Processes: Discrete Stochastic Dynamic Programming., Wiley, 1994. Zbl1184.90170 MR1270015
Rani, D., Gulati, T. R., , J. Transp. Secur. 7 (2014), 3, 277-287. DOI
Rani, D., Gulati, T. R., Kumar, A., , Sadhana 39 (2014),3, 573-581. MR3225832 DOI
Rezvani, S., Molani, M., Representation of trapezoidal fuzzy numbers with shape function., Ann. Fuzzy Math. Inform. 8 (2014), 89-112. MR3214770
Ross, S., Applied Probability Models with Optimization Applications., Holden Day, 1996. MR0264792
Semmouri, A., Jourhmane, M., Belhallaj, Z., , Ann. Oper. Res. 295 (2020), 769-786. MR4181708 DOI
Sennott, L., Stochastic Dynamic Programming and Control of Queueing Systems., Systems. Wiley, New York 1999. MR1645435
Syropoulos, A., Grammenos, T., A Modern Introduction to Fuzzy Mathematics., Wiley, New Jersey 2020.
Wang, J., Ma, X., Xu, Z., Zhan, J., , Inform. Sci. 552 (2021), 328-351. MR4197247 DOI
Zadeh, L., , Inform. Control 8 (1965), 338-353. Zbl0942.00007 MR0219427 DOI

Citations in EuDML Documents

top

NotesEmbed ?

top

You must be logged in to post comments.

To embed these notes on your page include the following JavaScript code on your page where you want the notes to appear.

Language to use for this widget.

Only the controls for the widget will be shown in your chosen language. Notes will be shown in their authored language.

Number of notes per page

Tells the widget how many notes to show per page. You can cycle through additional notes using the next and previous controls.

Note: Best practice suggests putting the JavaScript code just before the closing </body> tag.