Weighted discounted Markov decision processes with perturbation期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Weighted discounted Markov decision processes with perturbation

Authors:	Liu Ke

Affiliation:	(1) Institute of Applied Mathematics, the Chinese Academy of Sciences, 100080 Beijing, China

Abstract:	In this paper we consider the weighted reward discounted Markov Decision Processes orMDP's, for short, with perturbation. We give the proof of existence of an optimal simple ulti-mately deterministic policy for process We also prove that there exists a -optimalsimple ultimately deterministic policy in the perturbed weighted MDP, for all Finally weprove the following result: If is an optimal policy of then for any >0 there existsan -neighborhood B(D) such that when D1 B(D), is a -optimal policy of

Keywords:	Markov decision processes perturbation optimal policy
本文献已被 CNKI SpringerLink 等数据库收录！