首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Efficient approximate linear programming for factored MDPs
Institution:1. Center for Brain-Inspired Computing Research, Department of Automation, Tsinghua University, Beijing, China;2. Beijing Key Laboratory of Security in Big Data Processing and Application, Beijing, China;3. Nanjing Research Institute of Electronic Technology, Nanjing, China;4. School of Management and Engineering, Nanjing University, Nanjing, China
Abstract:Factored Markov Decision Processes (MDPs) provide a compact representation for modeling sequential decision making problems with many variables. Approximate linear programming (LP) is a prominent method for solving factored MDPs. However, it cannot be applied to models with large treewidth due to the exponential number of constraints. This paper proposes a novel and efficient approximate method to represent the exponentially many constraints. We construct an augmented junction graph from the factored MDP, and represent the constraints using a set of cluster constraints and separator constraints, where the cluster constraints play the role of reducing the number of constraints, and the separator constraints enforce the consistency of neighboring clusters so as to improve the accuracy. In the case where the junction graph is tree-structured, our method provides an equivalent representation to the original constraints. In other cases, our method provides a good trade-off between computation and accuracy. Experimental results on different models show that our algorithm performs better than other approximate linear programming algorithms on computational cost or expected reward.
Keywords:Factored MDPs  Approximate linear programming  Junction graph  Cluster constraints
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号