aSchool of Mathematics and Computational Science, Sun Yat-Sen University, Guangzhou, 510275, PR China
Abstract:
This paper deals with discrete-time Markov decision processes with state-dependent discount factors and unbounded rewards/costs. Under general conditions, we develop an iteration algorithm for computing the optimal value function, and also prove the existence of optimal stationary policies. Furthermore, we illustrate our results with a cash-balance model.