STRUCTURE OF OPTIMAL POLICIES FOR DISCOUNTED SEMI-MARKOV DECISION PROGRAMMING WITH UNBOUNDED REWARDS 期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

STRUCTURE OF OPTIMAL POLICIES FOR DISCOUNTED SEMI-MARKOV DECISION PROGRAMMING WITH UNBOUNDED REWARDS

作者姓名：	董泽清刘克

作者单位：	Institute of Applied Mathematics，Academia Sinica，Beijing，Institute of Applied Mathematics，Academia Sinica，Beijing

摘要：	In this paper, we discuss the structure of optimal policies for discountedsemi--Markov decision programming with unbounded rewards: {S, (A(i), i∈S), q, t,r,V_α}, where state space S is a countable set; in state i∈S, available action setA(i) is any set, and (A(i),(i)) is a measurable space; q is a time homogeneousfamily of jumps of states; t is a distributiou family of state jump's time, andonly depends on current state and current action too; V_αis the αa-discounted totalexpected reward.
本文献已被 CNKI 等数据库收录！