首页 | 本学科首页   官方微博 | 高级检索  
     


Discounted Markov games; successive approximation and stopping times
Authors:J. van der Wal
Affiliation:1. Dept. of Mathematics, Eindhoven University of Technology, P.O. Box 513, NL-Eindhoven
Abstract:This paper presents a number of successive approximation algorithms for the repeated two-person zero-sum game called Markov game using the criterion of total expected discounted rewards. AsWessels [1977] did for Markov decision processes stopping times are introduced in order to simplify the proofs. It is shown that each algorithm provides upper and lower bounds for the value of the game and nearly optimal stationary strategies for both players.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号