首页 | 本学科首页   官方微博 | 高级检索  
     


Algebraic results and bottom-up algorithm for policies generalization in reinforcement learning using concept lattices
Authors:Marc Ricordeau,Michel Liqui  re
Affiliation:

aLaboratoire d’Informatique, de Robotique et de Micro-électronique de Montpellier, 161 rue Ada, 34392 Montpellier Cedex 5, France

Abstract:The generalization of policies in reinforcement learning is a main issue, both from the theoretical model point of view and for their applicability. However, generalizing from a set of examples or searching for regularities is a problem which has already been intensively studied in machine learning. Thus, existing domains such as Inductive Logic Programming have already been linked with reinforcement learning. Our work uses techniques in which generalizations are constrained by a language bias, in order to regroup similar states. Such generalizations are principally based on the properties of concept lattices. To guide the possible groupings of similar states of the environment, we propose a general algebraic framework, considering the generalization of policies through a partition of the set of states and using a language bias as an a priori knowledge. We give a practical application as an example of our theoretical approach by proposing and experimenting a bottom-up algorithm.
Keywords:Reinforcement learning   Galois lattices
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号