首页 | 本学科首页   官方微博 | 高级检索  
     检索      

连续时间Markov决策过程的均值-方差优化问题
引用本文:叶柳儿,;黄香香.连续时间Markov决策过程的均值-方差优化问题[J].中国科学:数学,2014,44(8):883-898.
作者姓名:叶柳儿  ;黄香香
作者单位:暨南大学经济学院统计学系, 广州510632;
中山大学数学与计算科学学院, 广州510275
基金项目:国家自然科学基金(批准号:11201182); 中央高校基本科研业务费专项资金(批准号:21612314); 中山大学广东省计算科学重点实验室开放基金(批准号:201206010)资助项目
摘    要:本文考虑连续时间Markov决策过程折扣模型的均值-方差优化问题.假设状态空间和行动空间均为Polish空间,转移率和报酬率函数均无界.本文的优化目标是在折扣最优平稳策略类里,选取相应方差最小的策略.本文致力于寻找Polish空间下Markov决策过程均值-方差最优策略存在的条件.利用首次进入分解方法,本文证明均值-方差优化问题可以转化为"等价"的期望折扣优化问题,进而得到关于均值-方差优化问题的"最优方程"和均值-方差最优策略的存在性以及它相应的特征.最后,本文给出若干例子说明折扣最优策略的不唯一性和均值-方差最优策略的存在性.

关 键 词:连续时间  Markov决策过程  折扣最优化  方差最小  均值-方差最优策略

A mean-variance optimization problem for continuous-time Markov decision processes
YE LiuEr,HUANG XiangXiang.A mean-variance optimization problem for continuous-time Markov decision processes[J].Scientia Sinica Mathemation,2014,44(8):883-898.
Authors:YE LiuEr  HUANG XiangXiang
Institution:YE LiuEr , HUANG XiangXiang
Abstract:This paper deals with the mean-variance optimization problem for continuous-time Markov decision processes in Polish spaces. The transition and reward rates are allowed to be unbounded, and the paper focuseson an optimality criterion that improves the usual we aim to find the conditions for the existence of a excepted (or mean) discounted reward criterion. Especially mean-variance optimal policy under the Polish spaces. First under suitable conditions, we prove that the variance minimization problem can be transformed into an equivalent "discounted-cost" optimization problem by using the so-called "first passage decomposition method". Then, we obtain the so-called mean-variance optimality equation and the existence of a mean-variance optimal policy that minimizes the variance over the set of policies with optimal reward. Finally, we present some examples to illustrate our results.
Keywords:continuous-time Markov decision processes  discounted optimality  variance minimization~ mean-variance optimal policy
本文献已被 CNKI 维普 等数据库收录!
点击此处可从《中国科学:数学》浏览原始摘要信息
点击此处可从《中国科学:数学》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号