Smoothed Sarsa: Reinforcement Learning for Robot Delivery Tasks

发明申请

US20100094786A1 Smoothed Sarsa: Reinforcement Learning for Robot Delivery Tasks 有权

标题翻译：平滑Sarsa：加强学习机器人传送任务

请登陆查看更多内容

专利标题： Smoothed Sarsa: Reinforcement Learning for Robot Delivery Tasks
专利标题（中）： 平滑Sarsa：加强学习机器人传送任务
申请号： US12578574

申请日： 2009-10-13
公开(公告)号： US20100094786A1

公开(公告)日： 2010-04-15
发明人: Rakesh Gupta , Deepak Ramachandran
申请人： Rakesh Gupta , Deepak Ramachandran
申请人地址： JP Tokyo
专利权人： HONDA MOTOR CO., LTD.
当前专利权人： HONDA MOTOR CO., LTD.
当前专利权人地址： JP Tokyo
主分类号： G06F15/18
IPC分类号： G06F15/18

Smoothed Sarsa: Reinforcement Learning for Robot Delivery Tasks

摘要：

The present invention provides a method for learning a policy used by a computing system to perform a task, such delivery of one or more objects by the computing system. During a first time interval, the computing system determines a first state, a first action and a first reward value. As the computing system determines different states, actions and reward values during subsequent time intervals, a state description identifying the current sate, the current action, the current reward and a predicted action is stored. Responsive to a variance of a stored state description falling below a threshold value, the stored state description is used to modify one or more weights in the policy associated with the first state.

摘要（中）：

本发明提供了一种用于学习由计算系统用于执行任务的策略的方法，所述任务由计算系统传送一个或多个对象。在第一时间间隔期间，计算系统确定第一状态，第一动作和第一回报值。随着计算系统在随后的时间间隔期间确定不同的状态，动作和奖励值，存储识别当前状态，当前动作，当前奖励和预测动作的状态描述。响应于低于阈值的存储状态描述的方差，存储的状态描述用于修改与第一状态相关联的策略中的一个或多个权重。

公开/授权文献

US08326780B2 Smoothed sarsa: reinforcement learning for robot delivery tasks 公开/授权日：2012-12-04

信息查询

Global Dossier Espacenet