Markov ratio decision processes |
| |
Authors: | V Aggarwal R Chandrasekaran K P K Nair |
| |
Institution: | (1) School of Administration, University of New Brunswick, Fredericton, New Brunswick, Canada;(2) School of Management, University of Texas at Dallas, Dallas, Texas |
| |
Abstract: | A finite-state Markov decision process, in which, associated with each action in each state, there are two rewards, is considered. The objective is to optimize the ratio of the two rewards over an infinite horizon. In the discounted version of this decision problem, it is shown that the optimal value is unique and the optimal strategy is pure and stationary; however, they are dependent on the starting state. Also, a finite algorithm for computing the solution is given. |
| |
Keywords: | Markov decision processes ratio rewards discounting optimal solutions algorithms |
本文献已被 SpringerLink 等数据库收录! |
|