Successive approximations for average reward Markov games期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Successive approximations for average reward Markov games

Authors:	Ir. J. van der Wal

Affiliation:	1. Department of Mathematics, University of Technology, Den Dolech 2, P.O. Box 513, 5600, MB Eindhoven, The Netherlands

Abstract:	This paper considers two-person zero-sum Markov games with finitely many states and actions with the criterion of average reward per unit time. Two special situations are treated and it is shown that in both cases the method of successive approximations yields anε-band for the value of the game as well as stationaryε-optimal strategies. In the first case all underlying Markov chains of pure stationary optimal strategies are assumed to be unichained. In the second case it is assumed that the functional equation Uv=v+ge has a solution.

Keywords:
本文献已被 SpringerLink 等数据库收录！