首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Nonatomic total rewards Markov decision processes with multiple criteria
Authors:Eugene A Feinberg  Aleksey B Piunovskiy
Institution:a Department of Applied Mathematics and Statistics, State University of New York at Stony Brook, Stony Brook, NY 11794-3600, USA
b Department of Mathematical Sciences, The University of Liverpool, M & O Building, Liverpool, L69 7ZL, UK
Abstract:We consider a Markov decision process with an uncountable state space for which the vector performance functional has the form of expected total rewards. Under the single condition that initial distribution and transition probabilities are nonatomic, we prove that the performance space coincides with that generated by nonrandomized Markov policies. We also provide conditions for the existence of optimal policies when the goal is to maximize one component of the performance vector subject to inequality constraints on other components. We illustrate our results with examples of production and financial problems.
Keywords:
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号