EXISTENCE OF OPTIMAL POLICY FOR TIME NON-HOMOGENEOUS DISCOUNTED MARKOVIAN DECISION PR0GRAMMING 期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

EXISTENCE OF OPTIMAL POLICY FOR TIME NON-HOMOGENEOUS DISCOUNTED MARKOVIAN DECISION PR0GRAMMING

作者姓名：	郭世贞董泽清

作者单位：	Kunming Institute of Technology，Institute of Applied Mathematics，Academia Sinica

摘要：	In this paper we discuss the discrete, time non--homogeneous discounted Markovian decisionprogramming, where the state space and all action sets are countable. Suppose that the optimumvalue function is finite. We give the necessary and sufficient conditions for the existence of anoptimal policy. Suppose that the absolute mean of rewards is relatively bounded. We also give thenecessary and sufficient conditions for the existence of an optimal policy.
本文献已被 CNKI 等数据库收录！