Patent search ap:("X Development LLC") AND inv:"Mrinal Kalakrishnan" Page 1

1.

发明授权
Generating reinforcement learning data that is compatible with reinforcement learning for a robotic task 有权

公开(公告)号：US11610153B1

公开(公告)日：2023-03-21

申请号：US16729712

申请日：2019-12-30

Applicant: X Development LLC

Inventor： Alexander Herzog , Adrian Li , Mrinal Kalakrishnan , Benjamin Holson

IPC: G06N20/00 , B25J9/16 , G06F30/27

Abstract: Utilizing at least one existing policy (e.g. a manually engineered policy) for a robotic task, in generating reinforcement learning (RL) data that can be used in training an RL policy for an instance of RL of the robotic task. The existing policy can be one that, standing alone, will not generate data that is compatible with the instance of RL for the robotic task. In contrast, the generated RL data is compatible with RL for the robotic task at least by virtue of it including state data that is in a state space of the RL for the robotic task, and including actions that are in the action space of the RL for the robotic task. The generated RL data can be used in at least some of the initial training for the RL policy using reinforcement learning.

2.

发明申请
TRAINING A POLICY MODEL FOR A ROBOTIC TASK, USING REINFORCEMENT LEARNING AND UTILIZING DATA THAT IS BASED ON EPISODES, OF THE ROBOTIC TASK, GUIDED BY AN ENGINEERED POLICY 有权

公开(公告)号：US20220245503A1

公开(公告)日：2022-08-04

申请号：US17161845

申请日：2021-01-29

Applicant: X Development LLC

Inventor： Adrian Li , Benjamin Holson , Alexander Herzog , Mrinal Kalakrishnan

IPC: G06N20/00 , G06N3/00 , G06N5/04

Abstract: Implementations disclosed herein relate to utilizing at least one existing manually engineered policy, for a robotic task, in training an RL policy model that can be used to at least selectively replace a portion of the engineered policy. The RL policy model can be trained for replacing a portion of a robotic task and can be trained based on data from episodes of attempting performance of the robotic task, including episodes in which the portion is performed based on the engineered policy and/or other portion(s) are performed based on the engineered policy. Once trained, the RL policy model can be used, at least selectively and in lieu of utilization of the engineered policy, to perform the portion of robotic task, while other portion(s) of the robotic task are performed utilizing the engineered policy and/or other similarly trained (but distinct) RL policy model(s).

3.

发明授权
Action prediction networks for robotic grasping 有权

公开(公告)号：US11325252B2

公开(公告)日：2022-05-10

申请号：US16570522

申请日：2019-09-13

Applicant: X Development LLC

Inventor： Adrian Li , Peter Pastor Sampedro , Mengyuan Yan , Mrinal Kalakrishnan

IPC: B25J9/00 , B25J9/16

Abstract: Deep machine learning methods and apparatus related to the manipulation of an object by an end effector of a robot are described herein. Some implementations relate to training an action prediction network to predict a probability density which can include candidate actions of successful grasps by the end effector given an input image. Some implementations are directed to utilization of an action prediction network to visually servo a grasping end effector of a robot to achieve a successful grasp of an object by the grasping end effector.

4.

发明申请
ACTION PREDICTION NETWORKS FOR ROBOTIC GRASPING 审中-公开

公开(公告)号：US20200086483A1

公开(公告)日：2020-03-19

申请号：US16570522

申请日：2019-09-13

Applicant: X Development LLC

Inventor： Adrian Li , Peter Pastor Sampedro , Mengyuan Yan , Mrinal Kalakrishnan

IPC: B25J9/16

Abstract: Deep machine learning methods and apparatus related to the manipulation of an object by an end effector of a robot are described herein. Some implementations relate to training an action prediction network to predict a probability density which can include candidate actions of successful grasps by the end effector given an input image. Some implementations are directed to utilization of an action prediction network to visually servo a grasping end effector of a robot to achieve a successful grasp of an object by the grasping end effector.

5.

发明公开
Object Pose Neural Network System 审中-公开

公开(公告)号：US20240078683A1

公开(公告)日：2024-03-07

申请号：US18295998

申请日：2023-04-05

Applicant: X Development LLC

Inventor： Mrinal Kalakrishnan , Adrian Ling Hin Li , Nicolas Hudson

IPC: G06T7/246 , G06T7/11 , G06T7/60 , G06T7/73 , G06V10/42 , G06V10/75 , G06V10/82 , G06V20/10 , G06V30/186 , G06V30/19 , G06V30/194

CPC classification number: G06T7/246 , G06T7/11 , G06T7/60 , G06T7/73 , G06V10/42 , G06V10/758 , G06V10/82 , G06V20/10 , G06V30/186 , G06V30/19173 , G06V30/194 , G06T2207/10004 , G06T2207/20084 , G06T2207/30244

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium for predicting object pose. In one aspect, a method includes receiving an image of an object having one or more feature points; providing the image as an input to a neural network subsystem trained to receive images of objects and to generate an output including a heat map for each feature point; applying a differentiable transformation on each heat map to generate respective one or more feature coordinates for each feature point; providing the feature coordinates for each feature point as input to an object pose solver configured to compute a predicted object pose for the object, wherein the predicted object pose for the object specifies a position and an orientation of an object; and receiving, at the output of the object pose solver, a predicted object pose for the object in the image.

6.

发明授权
Control policies for collective robot learning 有权

公开(公告)号：US11188821B1

公开(公告)日：2021-11-30

申请号：US15705601

申请日：2017-09-15

Applicant: X Development LLC

Inventor： Mrinal Kalakrishnan , Ali Hamid Yahya Valdovinos , Adrian Ling Hin Li , Yevgen Chebotar , Sergey Vladimir Levine

IPC: G06N3/08 , G06N20/00 , B25J9/16

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, of training a global policy neural network. One of the methods includes initializing an instance of the robotic task for multiple local workers, generating a trajectory of state-action pairs by selecting actions to be performed by the robotic agent while performing the instance of the robotic task, optimizing a local policy controller on the trajectory, generating an optimized trajectory using the optimized local controller, and storing the optimized trajectory in a replay memory associated with the local worker. The method includes sampling, for multiple global workers, an optimized trajectory from one of one or more replay memories associated with the global worker, and training the replica of the global policy neural network maintained by the global worker on the sampled optimized trajectory to determine delta values for the parameters of the global policy neural network.

7.

发明申请
MACHINE LEARNING METHODS AND APPARATUS FOR AUTOMATED ROBOTIC PLACEMENT OF SECURED OBJECT IN APPROPRIATE LOCATION 有权

公开(公告)号：US20210229276A1

公开(公告)日：2021-07-29

申请号：US17230628

申请日：2021-04-14

Applicant: X Development LLC

Inventor： Seyed Mohammad Khansari Zadeh , Mrinal Kalakrishnan , Paul Wohlhart

IPC: B25J9/16 , G06N20/00 , G06N5/04

Abstract: Training and/or use of a machine learning model for placement of an object secured by an end effector of a robot. A trained machine learning model can be used to process: (1) a current image, captured by a vision component of a robot, that captures an end effector securing an object; (2) a candidate end effector action that defines a candidate motion of the end effector; and (3) a target placement input that indicates a target placement location for the object. Based on the processing, a prediction can be generated that indicates likelihood of successful placement of the object in the target placement location with application of the motion defined by the candidate end effector action. At many iterations, the candidate end effector action with the highest probability is selected and control commands provided to cause the end effector to move in conformance with the corresponding end effector action. When at least one release criteria is satisfied, control commands can be provided to cause the end effector to release the object, thereby leading to the object being placed in the target placement location.

8.

发明授权
Operating multiple testing robots based on robot instructions and/or environmental parameters received in a request 有权

公开(公告)号：US10981270B1

公开(公告)日：2021-04-20

申请号：US16530711

申请日：2019-08-02

Applicant: X Development LLC

Inventor： Peter Pastor Sampedro , Mrinal Kalakrishnan , Ali Yahya Valdovinos , Adrian Li , Kurt Konolige , Vincent Dureau

IPC: B25J9/00 , B25J9/16

Abstract: Methods and apparatus related to receiving a request that includes robot instructions and/or environmental parameters, operating each of a plurality of robots based on the robot instructions and/or in an environment configured based on the environmental parameters, and storing data generated by the robots during the operating. In some implementations, at least part of the stored data that is generated by the robots is provided in response to the request and/or additional data that is generated based on the stored data is provided in response to the request.

9.

发明申请
ROBOTIC CONTROL USING ACTION IMAGE(S) AND CRITIC NETWORK 有权

公开(公告)号：US20210078167A1

公开(公告)日：2021-03-18

申请号：US16886545

申请日：2020-05-28

Applicant: X Development LLC

Inventor： Seyed Mohammad Khansari Zadeh , Daniel Kappler , Jianlan Luo , Jeffrey Bingham , Mrinal Kalakrishnan

IPC: B25J9/16 , B25J15/00 , G06T7/70 , G06K9/00 , G06T7/50 , G06T7/90

Abstract: Generating and utilizing action image(s) that represent a candidate pose (e.g., a candidate end effector pose), in determining whether to utilize the candidate pose in performance of a robotic task. The action image(s) and corresponding current image(s) can be processed, using a trained critic network, to generate a value that indicates a probability of success of the robotic task if component(s) of the robot are traversed to the particular pose. When the value satisfies one or more conditions (e.g., satisfies a threshold), the robot can be controlled to cause the component(s) to traverse to the particular pose in performing the robotic task.

10.

发明申请
DOMAIN ADAPTATION USING SIMULATION TO SIMULATION TRANSFER 审中-公开

公开(公告)号：US20200167606A1

公开(公告)日：2020-05-28

申请号：US16692509

申请日：2019-11-22

Applicant: X Development LLC

Inventor： Paul Wohlhart , Stephen James , Mrinal Kalakrishnan , Konstantinos Bousmalis

IPC: G06K9/62 , G06N3/08 , G06N3/04 , G05B13/02 , G06T7/50 , B25J9/16

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a generator neural network to adapt input images.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification