首页 | 本学科首页   官方微博 | 高级检索  
     


Weighted discounted Markov decision processes with perturbation
Authors:Liu Ke
Affiliation:(1) Institute of Applied Mathematics, the Chinese Academy of Sciences, 100080 Beijing, China
Abstract:In this paper we consider the weighted reward discounted Markov Decision Processes orMDP's, for short, with perturbation. We give the proof of existence of an optimal simple ulti-mately deterministic policy for process We also prove that there exists a -optimalsimple ultimately deterministic policy in the perturbed weighted MDP, for all Finally weprove the following result: If is an optimal policy of then for any >0 there existsan -neighborhood B(D) such that when D1 B(D), is a -optimal policy of
Keywords:Markov decision processes  perturbation  optimal policy
本文献已被 CNKI SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号