Algebraic results and bottom-up algorithm for policies generalization in reinforcement learning using concept lattices期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Algebraic results and bottom-up algorithm for policies generalization in reinforcement learning using concept lattices

Authors:	Marc Ricordeau,Michel Liqui re

Affiliation:	^aLaboratoire d’Informatique, de Robotique et de Micro-électronique de Montpellier, 161 rue Ada, 34392 Montpellier Cedex 5, France

Abstract:	The generalization of policies in reinforcement learning is a main issue, both from the theoretical model point of view and for their applicability. However, generalizing from a set of examples or searching for regularities is a problem which has already been intensively studied in machine learning. Thus, existing domains such as Inductive Logic Programming have already been linked with reinforcement learning. Our work uses techniques in which generalizations are constrained by a language bias, in order to regroup similar states. Such generalizations are principally based on the properties of concept lattices. To guide the possible groupings of similar states of the environment, we propose a general algebraic framework, considering the generalization of policies through a partition of the set of states and using a language bias as an a priori knowledge. We give a practical application as an example of our theoretical approach by proposing and experimenting a bottom-up algorithm.

Keywords:	Reinforcement learning Galois lattices
本文献已被 ScienceDirect 等数据库收录！