首页 | 本学科首页   官方微博 | 高级检索  
     检索      


First passage Markov decision processes with constraints and varying discount factors
Authors:Xiao WU  Xiaolong ZOU  Xianping GUO
Institution:1. School of Mathematics and Statistics, Zhaoqing University, Zhaoqing 526061, China2. School of Mathematics and Computational Science, Sun Yat-sen University, Guangzhou 510275, China
Abstract:This paper focuses on the constrained optimality problem (COP) of first passage discrete-time Markov decision processes (DTMDPs) in denumerable state and compact Borel action spaces with multi-constraints, state-dependent discount factors, and possibly unbounded costs. By means of the properties of a so-called occupation measure of a policy, we show that the constrained optimality problem is equivalent to an (infinite-dimensional) linear programming on the set of occupation measures with some constraints, and thus prove the existence of an optimal policy under suitable conditions. Furthermore, using the equivalence between the constrained optimality problem and the linear programming, we obtain an exact form of an optimal policy for the case of finite states and actions. Finally, as an example, a controlled queueing system is given to illustrate our results.
Keywords:Discrete-time Markov decision process (DTMDP)  constrained optimality  varying discount factor  unbounded cost  
点击此处可从《Frontiers of Mathematics in China》浏览原始摘要信息
点击此处可从《Frontiers of Mathematics in China》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号