Continuous time Markovian decision processes average return criterion期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Continuous time Markovian decision processes average return criterion

Authors:	Prasadarao Kakumanu

Affiliation:	Polytechnic Institute of New York, 333 Jay Street, Brooklyn, New York, USA

Abstract:	Continuous time Markovian decision models with countable state space are investigated. The existence of an optimal stationary policy is established for the expected average return criterion function. It is shown that the expected average return can be expressed as an expected discounted return of a related Markovian decision process. A policy iteration method is given which converges to an optimal deterministic policy, the policy so obtained is shown optimal over all Markov policies.

Keywords:
本文献已被 ScienceDirect 等数据库收录！