首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Reduction of total-cost and average-cost MDPs with weakly continuous transition probabilities to discounted MDPs
Authors:Eugene A Feinberg  Jefferson Huang
Institution:1. Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, NY 11794-3600, USA;2. School of Operations Research and Information Engineering, Cornell University, Ithaca, NY 14853-3801, USA
Abstract:This note describes sufficient conditions under which total-cost and average-cost Markov decision processes (MDPs) with general state and action spaces, and with weakly continuous transition probabilities, can be reduced to discounted MDPs. For undiscounted problems, these reductions imply the validity of optimality equations and the existence of stationary optimal policies. The reductions also provide methods for computing optimal policies. The results are applied to a capacitated inventory control problem with fixed costs and lost sales.
Keywords:Markov decision process  Reduction  Total cost  Average cost  Discounted  Inventory
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号