Controlled jump processes |
| |
Authors: | Stanley R Pliska |
| |
Institution: | Department of Industrial Engineering and Management Sciences, Northwestern University, Evanston, Ill. 60201, U.S.A. |
| |
Abstract: | Finite and infinite planning horizon Markov decision problems are formulated for a class of jump processes with general state and action spaces and controls which are measurable functions on the time axis taking values in an appropriate metrizable vector space. For the finite horizon problem, the maximum expected reward is the unique solution, which exists, of a certain differential equation and is a strongly continuous function in the space of upper semi-continuous functions. A necessary and sufficient condition is provided for an admissible control to be optimal, and a sufficient condition is provided for the existence of a measurable optimal policy. For the infinite horizon problem, the maximum expected total reward is the fixed point of a certain operator on the space of upper semi-continuous functions. A stationary policy is optimal over all measurable policies in the transient and discounted cases as well as, with certain added conditions, in the positive and negative cases. |
| |
Keywords: | dynamic programming Markov processes stochastic control |
本文献已被 ScienceDirect 等数据库收录! |
|