Approximation and estimation in Markov control processes under a discounted criterion
Kybernetika (2004)
- Volume: 40, Issue: 6, page [681]-690
- ISSN: 0023-5954
Access Full Article
topAbstract
topHow to cite
topReferences
top- Cavazos-Cadena R., 10.1007/BF01102341, J. Optim. Theory Appl. 65 (1990), 191–207 (1990) Zbl0699.93053MR1051545DOI10.1007/BF01102341
- Devroye L., Gyorfi L., Nonparametric Density Estimation the View, Wiley, New York 1985 MR0780746
- Dynkin E. B., Yushkevich A. A., Controlled Markov Processes, Springer–Verlag, New York 1979 MR0554083
- Gordienko E. I., Adaptive strategies for certain classes of controlled Markov processes, Theory Probab. Appl. 29 (1985), 504–518 (1985) Zbl0577.93067
- Gordienko E. I., Minjárez-Sosa J. A., Adaptive control for discrete-time Markov processes with unbounded costs: discounted criterion, Kybernetika 34 (1998), 217–234 (1998) MR1621512
- Hasminskii R., Ibragimov I., 10.1214/aos/1176347736, Ann. Statist. 18 (1990), 999–1010 (1990) Zbl0705.62039MR1062695DOI10.1214/aos/1176347736
- Hernández-Lerma O., Adaptive Markov Control Processes, Springer–Verlag, New York 1989 MR0995463
- Hernández-Lerma O., Cavazos-Cadena R., 10.1007/BF00049572, Acta Appl. Math. 20 (1990), 285–307 (1990) MR1081591DOI10.1007/BF00049572
- Hernández-Lerma O., Lasserre J. B., Discrete-Time Markov Control Processes: Basic Optimality Criteria, Springer–Verlag, New York 1996 Zbl0840.93001MR1363487
- Hernández-Lerma O., Lasserre J. B., Further Topics on Discrete-Time Markov Control Processes, Springer–Verlag, New York 1999 Zbl0928.93002MR1697198
- Hernández-Lerma O., Marcus S. I., 10.1016/0167-6911(87)90055-7, Systems Control Lett. 9 (1987), 307–315 (1987) Zbl0637.93075MR0912683DOI10.1016/0167-6911(87)90055-7
- Hilgert N., Minjárez-Sosa J. A., 10.1007/s001860100170, Math. Methods Oper. Res. 54 (2001), 491–505 Zbl1042.93065MR1890916DOI10.1007/s001860100170
- Schäl M., 10.1007/BF00532612, Z. Wahrs. Verw. Gerb. 32 (1975), 179–196 (1975) MR0378841DOI10.1007/BF00532612
Citations in EuDML Documents
top- Beatris A. Escobedo-Trujillo, Carmen G. Higuera-Chan, Time-varying Markov decision processes with state-action-dependent discount factors and unbounded costs
- Yofre H. García, Saul Diaz-Infante, J. Adolfo Minjárez-Sosa, Partially observable queueing systems with controlled service rates under a discounted optimality criterion
- E. Everardo Martinez-Garcia, J. Adolfo Minjárez-Sosa, Oscar Vega-Amaya, Partially observable Markov decision processes with partially observable random discount factors