Optimal threshold probability and expectation in semi-Markov decision processes期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

Optimal threshold probability and expectation in semi-Markov decision processes

Authors:	Masahiko Sakaguchi

Institution:	Department of Mathematics, Faculty of Science, Kochi University, Kochi 780-8520, Japan

Abstract:	We consider undiscounted semi-Markov decision process with a target set and our main concern is a problem minimizing threshold probability. We formulate the problem as an infinite horizon case with a recurrent class. We show that an optimal value function is a unique solution to an optimality equation and there exists a stationary optimal policy. Also several value iteration methods and a policy improvement method are given in our model. Furthermore, we investigate a relationship between threshold probabilities and expectations for total rewards.

Keywords:	Semi-Markov decision process Optimal threshold probability Existence of optimal policy Value iteration Policy improvement method Stochastic order
本文献已被 ScienceDirect 等数据库收录！