On strategy improvement algorithms for simple stochastic games期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

On strategy improvement algorithms for simple stochastic games

Authors:	Rahul Tripathi Elena Valkanova V.S. Anil Kumar

Affiliation:	^aDepartment of Computer Science and Engineering, University of South Florida, Tampa, FL 33620, USA;^bDepartment of Computer Science and Virginia Bioinformatics Institute, Virginia Tech., Blacksburg, VT 24061, USA

Abstract:	The study of simple stochastic games (SSGs) was initiated by Condon for analyzing the computational power of randomized space-bounded alternating Turing machines. The game is played by two players, MAX and MIN, on a directed multigraph, and when the play terminates at a sink vertex s, MAX wins from MIN a payoff p(s)∈[0,1]. Condon proved that the problem SSG-VALUE—given a SSG, determine whether the expected payoff won by MAX is greater than 1/2 when both players use their optimal strategies—is in NP∩coNP. However, the exact complexity of this problem remains open, as it is not known whether the problem is in P or is hard for some natural complexity class. In this paper, we study the computational complexity of a strategy improvement algorithm by Hoffman and Karp for this problem. The Hoffman–Karp algorithm converges to optimal strategies of a given SSG, but no non-trivial bounds were previously known on its running time. We prove a bound of O(n²/n) on the convergence time of the Hoffman–Karp algorithm, and a bound of O(²0.78n) on a randomized variant. These are the first non-trivial upper bounds on the convergence time of these strategy improvement algorithms.

Keywords:	Algorithms Computational complexity Stochastic games Strategy improvement algorithms Optimal strategies
本文献已被 ScienceDirect 等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏