首页 | 本学科首页   官方微博 | 高级检索  
     


A reinforcement learning process in extensive form games
Authors:Jean-François Laslier  Bernard Walliser
Affiliation:(1) CNRS and Laboratoire d’Econométrie de l’Ecole Polytechnique, Paris, France;(2) CERAS, Ecole Nationale des Ponts et Chaussées, 28 rue des Saints Péres, 75007 Paris, France
Abstract:
The CPR (“cumulative proportional reinforcement”) learning rule stipulates that an agent chooses a move with a probability proportional to the cumulative payoff she obtained in the past with that move. Previously considered for strategies in normal form games (Laslier, Topol and Walliser, Games and Econ. Behav., 2001), the CPR rule is here adapted for actions in perfect information extensive form games. The paper shows that the action-based CPR process converges with probability one to the (unique) subgame perfect equilibrium.Received: October 2004
Keywords:learning  Polya process  reinforcement  subgame perfect equilibrium
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号