Optimal threshold probability and expectation in semi-Markov decision processes |
| |
Authors: | Masahiko Sakaguchi |
| |
Institution: | Department of Mathematics, Faculty of Science, Kochi University, Kochi 780-8520, Japan |
| |
Abstract: | We consider undiscounted semi-Markov decision process with a target set and our main concern is a problem minimizing threshold probability. We formulate the problem as an infinite horizon case with a recurrent class. We show that an optimal value function is a unique solution to an optimality equation and there exists a stationary optimal policy. Also several value iteration methods and a policy improvement method are given in our model. Furthermore, we investigate a relationship between threshold probabilities and expectations for total rewards. |
| |
Keywords: | Semi-Markov decision process Optimal threshold probability Existence of optimal policy Value iteration Policy improvement method Stochastic order |
本文献已被 ScienceDirect 等数据库收录! |