首页 | 本学科首页   官方微博 | 高级检索  
     

STRUCTURE OF OPTIMAL POLICIES FOR DISCOUNTED SEMI-MARKOV DECISION PROGRAMMING WITH UNBOUNDED REWARDS
作者姓名:董泽清  刘克
作者单位:Institute of Applied Mathematics,Academia Sinica,Beijing,Institute of Applied Mathematics,Academia Sinica,Beijing
摘    要:In this paper, we discuss the structure of optimal policies for discountedsemi--Markov decision programming with unbounded rewards: {S, (A(i), i∈S), q, t,r,V_α}, where state space S is a countable set; in state i∈S, available action setA(i) is any set, and (A(i),(i)) is a measurable space; q is a time homogeneousfamily of jumps of states; t is a distributiou family of state jump's time, andonly depends on current state and current action too; V_αis the αa-discounted totalexpected reward.

本文献已被 CNKI 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号