A superharmonic approach to solving infinite horizon partially observable Markov decision problems |
| |
Authors: | D. J. White |
| |
Affiliation: | (1) Faculty of Economic and Social Studies, Department of Decision Theory, University of Manchester, M 13 9PL Manchester, Great Britain |
| |
Abstract: | In this paper we use an approach which uses a superharmonic property of a sequence of functions generated by an algorithm to show that these functions converge in a non-increasing manner to the optimal value function for our problem, and bounds are given for the loss of optimality if the computational process is terminated at any iteration. The basic procedure is to add an additional linear term at each iteration, selected by solving a particular optimisation problem, for which primal and dual linear programming formulations are given. |
| |
Keywords: | Markov decision processes partially observable linear programming |
本文献已被 SpringerLink 等数据库收录! |
|