首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Analysis for some properties of discrete time Markov decision processes
Abstract:This paper investigates properties of the optimality equation and optimal policies in discrete time Markov decision processes with expected discounted total rewards under weak conditions that the model is well defined and the optimality equation is true. The optimal value function is characterized as a solution of the optimality equation and the structure of optimal policies is also given.
Keywords:Discrete time  Markov decision processes  Optimality equation  Optimal policies  Expected discounted total rewards  Mathematics Subject Classifications 2000: 90C40  60J05
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号