首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Optimal threshold probability and expectation in semi-Markov decision processes
Authors:Masahiko Sakaguchi
Institution:Department of Mathematics, Faculty of Science, Kochi University, Kochi 780-8520, Japan
Abstract:We consider undiscounted semi-Markov decision process with a target set and our main concern is a problem minimizing threshold probability. We formulate the problem as an infinite horizon case with a recurrent class. We show that an optimal value function is a unique solution to an optimality equation and there exists a stationary optimal policy. Also several value iteration methods and a policy improvement method are given in our model. Furthermore, we investigate a relationship between threshold probabilities and expectations for total rewards.
Keywords:Semi-Markov decision process  Optimal threshold probability  Existence of optimal policy  Value iteration  Policy improvement method  Stochastic order
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号