Continuous time Markovian decision processes average return criterion |
| |
Authors: | Prasadarao Kakumanu |
| |
Affiliation: | Polytechnic Institute of New York, 333 Jay Street, Brooklyn, New York, USA |
| |
Abstract: | Continuous time Markovian decision models with countable state space are investigated. The existence of an optimal stationary policy is established for the expected average return criterion function. It is shown that the expected average return can be expressed as an expected discounted return of a related Markovian decision process. A policy iteration method is given which converges to an optimal deterministic policy, the policy so obtained is shown optimal over all Markov policies. |
| |
Keywords: | |
本文献已被 ScienceDirect 等数据库收录! |
|