Successive approximations for average reward Markov games |
| |
Authors: | Ir. J. van der Wal |
| |
Affiliation: | 1. Department of Mathematics, University of Technology, Den Dolech 2, P.O. Box 513, 5600, MB Eindhoven, The Netherlands
|
| |
Abstract: | This paper considers two-person zero-sum Markov games with finitely many states and actions with the criterion of average reward per unit time. Two special situations are treated and it is shown that in both cases the method of successive approximations yields anε-band for the value of the game as well as stationaryε-optimal strategies. In the first case all underlying Markov chains of pure stationary optimal strategies are assumed to be unichained. In the second case it is assumed that the functional equation Uv=v+ge has a solution. |
| |
Keywords: | |
本文献已被 SpringerLink 等数据库收录! |
|