首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Markov ratio decision processes
Authors:V Aggarwal  R Chandrasekaran  K P K Nair
Institution:(1) School of Administration, University of New Brunswick, Fredericton, New Brunswick, Canada;(2) School of Management, University of Texas at Dallas, Dallas, Texas
Abstract:A finite-state Markov decision process, in which, associated with each action in each state, there are two rewards, is considered. The objective is to optimize the ratio of the two rewards over an infinite horizon. In the discounted version of this decision problem, it is shown that the optimal value is unique and the optimal strategy is pure and stationary; however, they are dependent on the starting state. Also, a finite algorithm for computing the solution is given.
Keywords:Markov decision processes  ratio rewards  discounting  optimal solutions  algorithms
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号