LEARNING ROBOTIC SKILLS WITH IMITATION AND REINFORCEMENT AT SCALE

    公开(公告)号:US20220410380A1

    公开(公告)日:2022-12-29

    申请号:US17843288

    申请日:2022-06-17

    Abstract: Utilizing an initial set of offline positive-only robotic demonstration data for pre-training an actor network and a critic network for robotic control, followed by further training of the networks based on online robotic episodes that utilize the network(s). Implementations enable the actor network to be effectively pre-trained, while mitigating occurrences of and/or the extent of forgetting when further trained based on episode data. Implementations additionally or alternatively enable the actor network to be trained to a given degree of effectiveness in fewer training steps. In various implementations, one or more adaptation techniques are utilized in performing the robotic episodes and/or in performing the robotic training. The adaptation techniques can each, individually, result in one or more corresponding advantages and, when used in any combination, the corresponding advantages can accumulate. The adaptation techniques include Positive Sample Filtering, Adaptive Exploration, Using Max Q Values, and Using the Actor in CEM.

    Action prediction networks for robotic grasping

    公开(公告)号:US11325252B2

    公开(公告)日:2022-05-10

    申请号:US16570522

    申请日:2019-09-13

    Abstract: Deep machine learning methods and apparatus related to the manipulation of an object by an end effector of a robot are described herein. Some implementations relate to training an action prediction network to predict a probability density which can include candidate actions of successful grasps by the end effector given an input image. Some implementations are directed to utilization of an action prediction network to visually servo a grasping end effector of a robot to achieve a successful grasp of an object by the grasping end effector.

    ACTION PREDICTION NETWORKS FOR ROBOTIC GRASPING

    公开(公告)号:US20200086483A1

    公开(公告)日:2020-03-19

    申请号:US16570522

    申请日:2019-09-13

    Abstract: Deep machine learning methods and apparatus related to the manipulation of an object by an end effector of a robot are described herein. Some implementations relate to training an action prediction network to predict a probability density which can include candidate actions of successful grasps by the end effector given an input image. Some implementations are directed to utilization of an action prediction network to visually servo a grasping end effector of a robot to achieve a successful grasp of an object by the grasping end effector.

Patent Agency Ranking