首页 | 本学科首页   官方微博 | 高级检索  
     检索      

多智能体强化学习在足球机器人中的研究与应用
引用本文:刘春阳,谭应清,柳长安,马莹巍.多智能体强化学习在足球机器人中的研究与应用[J].电子学报,2010,38(8):1958-1962.
作者姓名:刘春阳  谭应清  柳长安  马莹巍
作者单位:1. 华北电力大学控制与计算机工程学院,北京 102206;2. 北京科技大学信息工程学院,北京 100083
基金项目:国家自然科学基金,华北电力大学青年教师科研基金项目
摘    要: 本文提出一种基于投票的多智能体强化学习方法,使球队在比赛中学会协作,自动适应环境,提高实时性和进球数.首先通过定义称为策略的联合行为,将协作问题转化为对策略的学习,简化问题的处理;然后对球场进行划分,以区域表示位置,有效减少了状态空间维数,加快了学习速度;接下来通过区分环境状态并只考虑协作状态,减小状态空间,进一步提高了学习速度;并使用投票的方式综合各个队员的决策,达到协作的目的.最后通过实验结果表明了该方法的正确性和有效性.

关 键 词:强化学习  机器人足球  多智能体系统  投票
收稿时间:2008-12-18

Application of Multi-Agent Reinforcement Learning in Robot Soccer
LIU Chun-yang,TAN Ying-qing,LIU Chang-an,MA Ying-wei.Application of Multi-Agent Reinforcement Learning in Robot Soccer[J].Acta Electronica Sinica,2010,38(8):1958-1962.
Authors:LIU Chun-yang  TAN Ying-qing  LIU Chang-an  MA Ying-wei
Institution:1. School of Computer Science and Technology,North China Electric Power University,Beijing 102206,China;2. University of Science and Technology Beijing,Beijing 100083,China
Abstract:A multi-agent reinforcement learning method based on voting to solve the collaboration problem of team members is presented.The method translates the collaboration problem into learning strategies by defining joint actions which called the strategies and then can simplify the problem.Through dividing of the playground,the location can be measured by a lot of numbered regions and then can effectively reduce the state-space dimensions to speed up the pace of learning.By distinguishing the environment states and taking the collaboration status into account,that causing the reduction of the state-action space,the learning speed can be further improved.Using a voting process that combines the decisions of the agents can realize the collaboration.At last,experimental results show the effectiveness and correctness of the method.
Keywords:reinforcement learning  robot soccer  multi-agent system  vote
本文献已被 万方数据 等数据库收录!
点击此处可从《电子学报》浏览原始摘要信息
点击此处可从《电子学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号