A new strong optimality criterion for nonstationary Markov decision processes期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

A new strong optimality criterion for nonstationary Markov decision processes

Authors:	Xianping Guo Peng Shi Weiping Zhu

Institution:	(1) Department of Mathematics, Zhongshan University, Guangzhou 510275, P. R. China (e-mail: mcsgxp@zsu.edu.cn), CN;(2) Land Operations Division, Defence Science and Technology Organisation, PO Box 1500, Salisbury 5108 SA, Australia (e-mail: peng.shi@dsto.defence.gov.au), AU;(3) Department of Computer Science and Electrical Engineering, The University of Queensland, St. Lucia 4072, QLD, Australia, AU

Abstract:	This paper deals with a new optimality criterion consisting of the usual three average criteria and the canonical triplet (totally so-called strong average-canonical optimality criterion) and introduces the concept of a strong average-canonical policy for nonstationary Markov decision processes, which is an extension of the canonical policies of Herna′ndez-Lerma and Lasserre 16] (pages: 77) for the stationary Markov controlled processes. For the case of possibly non-uniformly bounded rewards and denumerable state space, we first construct, under some conditions, a solution to the optimality equations (OEs), and then prove that the Markov policies obtained from the OEs are not only optimal for the three average criteria but also optimal for all finite horizon criteria with a sequence of additional functions as their terminal rewards (i.e. strong average-canonical optimal). Also, some properties of optimal policies and optimal average value convergence are discussed. Moreover, the error bound in average reward between a rolling horizon policy and a strong average-canonical optimal policy is provided, and then a rolling horizon algorithm for computing strong average ε(>0)-optimal Markov policies is given.

Keywords:	: Nonstationary Markov decision processes optimality equations strong average-canonical optimal policies
本文献已被 SpringerLink 等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏