DEEP REINFORCEMENT LEARNING FOR ROBOTIC MANIPULATION

    公开(公告)号:US20210237266A1

    公开(公告)日:2021-08-05

    申请号:US17052679

    申请日:2019-06-14

    Applicant: Google LLC

    Abstract: Using large-scale reinforcement learning to train a policy model that can be utilized by a robot in performing a robotic task in which the robot interacts with one or more environmental objects. In various implementations, off-policy deep reinforcement learning is used to train the policy model, and the off-policy deep reinforcement learning is based on self-supervised data collection. The policy model can be a neural network model. Implementations of the reinforcement learning utilized in training the neural network model utilize a continuous-action variant of Q-learning. Through techniques disclosed herein, implementations can learn policies that generalize effectively to previously unseen objects, previously unseen environments, etc.

    DEEP REINFORCEMENT LEARNING FOR ROBOTIC MANIPULATION

    公开(公告)号:US20220388159A1

    公开(公告)日:2022-12-08

    申请号:US17878186

    申请日:2022-08-01

    Applicant: Google LLC

    Abstract: Implementations utilize deep reinforcement learning to train a policy neural network that parameterizes a policy for determining a robotic action based on a current state. Some of those implementations collect experience data from multiple robots that operate simultaneously. Each robot generates instances of experience data during iterative performance of episodes that are each explorations of performing a task, and that are each guided based on the policy network and the current policy parameters for the policy network during the episode. The collected experience data is generated during the episodes and is used to train the policy network by iteratively updating policy parameters of the policy network based on a batch of collected experience data. Further, prior to performance of each of a plurality of episodes performed by the robots, the current updated policy parameters can be provided (or retrieved) for utilization in performance of the episode.

    SYSTEM AND METHODS FOR TRAINING ROBOT POLICIES IN THE REAL WORLD

    公开(公告)号:US20220143819A1

    公开(公告)日:2022-05-12

    申请号:US17094521

    申请日:2020-11-10

    Applicant: Google LLC

    Abstract: Techniques are disclosed that enable training a plurality of policy networks, each policy network corresponding to a disparate robotic training task, using a mobile robot in a real world workspace. Various implementations include selecting a training task based on comparing a pose of the mobile robot to at least one parameter of a real world training workspace. For example, the training task can be selected based on the position of a landmark, within the workspace, relative to the pose. For instance, the training task can be selected such that the selected training task moves the mobile robot towards the landmark.

    DEEP MACHINE LEARNING METHODS AND APPARATUS FOR ROBOTIC GRASPING

    公开(公告)号:US20210162590A1

    公开(公告)日:2021-06-03

    申请号:US17172666

    申请日:2021-02-10

    Applicant: Google LLC

    Abstract: Deep machine learning methods and apparatus related to manipulation of an object by an end effector of a robot. Some implementations relate to training a deep neural network to predict a measure that candidate motion data for an end effector of a robot will result in a successful grasp of one or more objects by the end effector. Some implementations are directed to utilization of the trained deep neural network to servo a grasping end effector of a robot to achieve a successful grasp of an object by the grasping end effector. For example, the trained deep neural network may be utilized in the iterative updating of motion control commands for one or more actuators of a robot that control the pose of a grasping end effector of the robot, and to determine when to generate grasping control commands to effectuate an attempted grasp by the grasping end effector.

Patent Agency Ranking