首页 | 本学科首页   官方微博 | 高级检索  
     


Constrained markov decision processes with compact state and action spaces: the average case
Abstract:
Constrained Markov decision processes with compact state and action spaces are studied under long-run average reward or cost criteria. By introducing a corresponding Lagrange function, a saddle-point theorem is given, by which the existence of a constrained optimal pair of initial state distribution and policy is shown. Also, under the hypothesis of Doeblin, the functional characterization of a constrained optimal policy is obtained
Keywords:Constrained Markov Decision Processes  Average Criteria  Compact State And Action Spaces  Lagrange Technique  Saddle-Point  Constrained Optimal Pair
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号