首页 | 本学科首页   官方微博 | 高级检索  
     检索      

无界报酬向量值折扣马氏决策规划
引用本文:张升,张继红.无界报酬向量值折扣马氏决策规划[J].云南大学学报(自然科学版),1993,15(3):200-207.
作者姓名:张升  张继红
作者单位:云南大学 (张升),云南大学(张继红)
摘    要:本文建立了一类无界向量值报酬折扣马氏决策规划,在一组无关向量生成的凸锥确定的序关系下,讨论了模型最优策略的存在性;给出强最优策略存在的必要充分条件;指出最优策略的自组合、凸组合策咯仍是最优策略;还证明了平稳策略在一般策略类中的优势。

关 键 词:无界报酬向量  马氏决策规划

A Discounted Vector-Valued Markovian Decision Programming with Unbounded Rewards
Zhang Sheng Zhang Jihong.A Discounted Vector-Valued Markovian Decision Programming with Unbounded Rewards[J].Journal of Yunnan University(Natural Sciences),1993,15(3):200-207.
Authors:Zhang Sheng Zhang Jihong
Institution:Yunnan University
Abstract:In this paper, a discounted vector-valued Markovian decision model with unbounded rewards is investigated.The optimization,here,is made according to a partial-order Criterion determined by linearly independent vectors-generated convex cone.The existence of an optimal policy is proved .The problems of the intrinsic structures of some optimal policies are discussed. Necessary and sufficient conditions for the existence of strongly optimal policy is given. It is also shown that the convex combination policy and the self-combination policy of some optimal policies are optimal ,and that stationary policies possess a predominance in general policies
Keywords:Discounted Markovian Decision Programming  optimal policies  Unbounded vector-valued Reward
本文献已被 CNKI 维普 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号