-
公开(公告)号:US11610153B1
公开(公告)日:2023-03-21
申请号:US16729712
申请日:2019-12-30
Applicant: X Development LLC
Inventor: Alexander Herzog , Adrian Li , Mrinal Kalakrishnan , Benjamin Holson
Abstract: Utilizing at least one existing policy (e.g. a manually engineered policy) for a robotic task, in generating reinforcement learning (RL) data that can be used in training an RL policy for an instance of RL of the robotic task. The existing policy can be one that, standing alone, will not generate data that is compatible with the instance of RL for the robotic task. In contrast, the generated RL data is compatible with RL for the robotic task at least by virtue of it including state data that is in a state space of the RL for the robotic task, and including actions that are in the action space of the RL for the robotic task. The generated RL data can be used in at least some of the initial training for the RL policy using reinforcement learning.
-
公开(公告)号:US20220245503A1
公开(公告)日:2022-08-04
申请号:US17161845
申请日:2021-01-29
Applicant: X Development LLC
Inventor: Adrian Li , Benjamin Holson , Alexander Herzog , Mrinal Kalakrishnan
Abstract: Implementations disclosed herein relate to utilizing at least one existing manually engineered policy, for a robotic task, in training an RL policy model that can be used to at least selectively replace a portion of the engineered policy. The RL policy model can be trained for replacing a portion of a robotic task and can be trained based on data from episodes of attempting performance of the robotic task, including episodes in which the portion is performed based on the engineered policy and/or other portion(s) are performed based on the engineered policy. Once trained, the RL policy model can be used, at least selectively and in lieu of utilization of the engineered policy, to perform the portion of robotic task, while other portion(s) of the robotic task are performed utilizing the engineered policy and/or other similarly trained (but distinct) RL policy model(s).
-
公开(公告)号:US11325252B2
公开(公告)日:2022-05-10
申请号:US16570522
申请日:2019-09-13
Applicant: X Development LLC
Inventor: Adrian Li , Peter Pastor Sampedro , Mengyuan Yan , Mrinal Kalakrishnan
Abstract: Deep machine learning methods and apparatus related to the manipulation of an object by an end effector of a robot are described herein. Some implementations relate to training an action prediction network to predict a probability density which can include candidate actions of successful grasps by the end effector given an input image. Some implementations are directed to utilization of an action prediction network to visually servo a grasping end effector of a robot to achieve a successful grasp of an object by the grasping end effector.
-
公开(公告)号:US20200086483A1
公开(公告)日:2020-03-19
申请号:US16570522
申请日:2019-09-13
Applicant: X Development LLC
Inventor: Adrian Li , Peter Pastor Sampedro , Mengyuan Yan , Mrinal Kalakrishnan
IPC: B25J9/16
Abstract: Deep machine learning methods and apparatus related to the manipulation of an object by an end effector of a robot are described herein. Some implementations relate to training an action prediction network to predict a probability density which can include candidate actions of successful grasps by the end effector given an input image. Some implementations are directed to utilization of an action prediction network to visually servo a grasping end effector of a robot to achieve a successful grasp of an object by the grasping end effector.
-
公开(公告)号:US20240078683A1
公开(公告)日:2024-03-07
申请号:US18295998
申请日:2023-04-05
Applicant: X Development LLC
Inventor: Mrinal Kalakrishnan , Adrian Ling Hin Li , Nicolas Hudson
IPC: G06T7/246 , G06T7/11 , G06T7/60 , G06T7/73 , G06V10/42 , G06V10/75 , G06V10/82 , G06V20/10 , G06V30/186 , G06V30/19 , G06V30/194
CPC classification number: G06T7/246 , G06T7/11 , G06T7/60 , G06T7/73 , G06V10/42 , G06V10/758 , G06V10/82 , G06V20/10 , G06V30/186 , G06V30/19173 , G06V30/194 , G06T2207/10004 , G06T2207/20084 , G06T2207/30244
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium for predicting object pose. In one aspect, a method includes receiving an image of an object having one or more feature points; providing the image as an input to a neural network subsystem trained to receive images of objects and to generate an output including a heat map for each feature point; applying a differentiable transformation on each heat map to generate respective one or more feature coordinates for each feature point; providing the feature coordinates for each feature point as input to an object pose solver configured to compute a predicted object pose for the object, wherein the predicted object pose for the object specifies a position and an orientation of an object; and receiving, at the output of the object pose solver, a predicted object pose for the object in the image.
-
公开(公告)号:US11188821B1
公开(公告)日:2021-11-30
申请号:US15705601
申请日:2017-09-15
Applicant: X Development LLC
Inventor: Mrinal Kalakrishnan , Ali Hamid Yahya Valdovinos , Adrian Ling Hin Li , Yevgen Chebotar , Sergey Vladimir Levine
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, of training a global policy neural network. One of the methods includes initializing an instance of the robotic task for multiple local workers, generating a trajectory of state-action pairs by selecting actions to be performed by the robotic agent while performing the instance of the robotic task, optimizing a local policy controller on the trajectory, generating an optimized trajectory using the optimized local controller, and storing the optimized trajectory in a replay memory associated with the local worker. The method includes sampling, for multiple global workers, an optimized trajectory from one of one or more replay memories associated with the global worker, and training the replica of the global policy neural network maintained by the global worker on the sampled optimized trajectory to determine delta values for the parameters of the global policy neural network.
-
公开(公告)号:US20210229276A1
公开(公告)日:2021-07-29
申请号:US17230628
申请日:2021-04-14
Applicant: X Development LLC
Inventor: Seyed Mohammad Khansari Zadeh , Mrinal Kalakrishnan , Paul Wohlhart
Abstract: Training and/or use of a machine learning model for placement of an object secured by an end effector of a robot. A trained machine learning model can be used to process: (1) a current image, captured by a vision component of a robot, that captures an end effector securing an object; (2) a candidate end effector action that defines a candidate motion of the end effector; and (3) a target placement input that indicates a target placement location for the object. Based on the processing, a prediction can be generated that indicates likelihood of successful placement of the object in the target placement location with application of the motion defined by the candidate end effector action. At many iterations, the candidate end effector action with the highest probability is selected and control commands provided to cause the end effector to move in conformance with the corresponding end effector action. When at least one release criteria is satisfied, control commands can be provided to cause the end effector to release the object, thereby leading to the object being placed in the target placement location.
-
公开(公告)号:US10981270B1
公开(公告)日:2021-04-20
申请号:US16530711
申请日:2019-08-02
Applicant: X Development LLC
Inventor: Peter Pastor Sampedro , Mrinal Kalakrishnan , Ali Yahya Valdovinos , Adrian Li , Kurt Konolige , Vincent Dureau
Abstract: Methods and apparatus related to receiving a request that includes robot instructions and/or environmental parameters, operating each of a plurality of robots based on the robot instructions and/or in an environment configured based on the environmental parameters, and storing data generated by the robots during the operating. In some implementations, at least part of the stored data that is generated by the robots is provided in response to the request and/or additional data that is generated based on the stored data is provided in response to the request.
-
公开(公告)号:US20210078167A1
公开(公告)日:2021-03-18
申请号:US16886545
申请日:2020-05-28
Applicant: X Development LLC
Inventor: Seyed Mohammad Khansari Zadeh , Daniel Kappler , Jianlan Luo , Jeffrey Bingham , Mrinal Kalakrishnan
Abstract: Generating and utilizing action image(s) that represent a candidate pose (e.g., a candidate end effector pose), in determining whether to utilize the candidate pose in performance of a robotic task. The action image(s) and corresponding current image(s) can be processed, using a trained critic network, to generate a value that indicates a probability of success of the robotic task if component(s) of the robot are traversed to the particular pose. When the value satisfies one or more conditions (e.g., satisfies a threshold), the robot can be controlled to cause the component(s) to traverse to the particular pose in performing the robotic task.
-
公开(公告)号:US20200167606A1
公开(公告)日:2020-05-28
申请号:US16692509
申请日:2019-11-22
Applicant: X Development LLC
Inventor: Paul Wohlhart , Stephen James , Mrinal Kalakrishnan , Konstantinos Bousmalis
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a generator neural network to adapt input images.
-
-
-
-
-
-
-
-
-