Patent search ap:("Google LLC") AND inv:"Mrinal Kalakrishnan" Page 1

1.

发明申请
MACHINE LEARNING METHODS AND APPARATUS FOR ROBOTIC MANIPULATION AND THAT UTILIZE MULTI-TASK DOMAIN ADAPTATION 有权

公开(公告)号：US20250033201A1

公开(公告)日：2025-01-30

申请号：US18918590

申请日：2024-10-17

Applicant: GOOGLE LLC

Inventor： Yunfei Bai , Kuan Fang , Stefan Hinterstoisser , Mrinal Kalakrishnan

IPC: B25J9/16

Abstract: Implementations are directed to training a machine learning model that, once trained, is used in performance of robotic grasping and/or other manipulation task(s) by a robot. The model can be trained using simulated training examples that are based on simulated data that is based on simulated robot(s) attempting simulated manipulations of various simulated objects. Portion(s) of the model can also be trained based on real training examples that are based on data from real-world physical robots attempting manipulations of various objects. The simulated training examples can be utilized to train the model to predict an output that can be utilized in a particular task—and the real training examples used to adapt at least a portion of the model to the real-world domain can be tailored to a distinct task. In some implementations, domain-adversarial similarity losses are determined during training, and utilized to regularize at least portion(s) of the model.

2.

发明授权
Efficient adaption of robot control policy for new task using meta-learning based on meta-imitation learning and meta-reinforcement learning 有权

公开(公告)号：US12083678B2

公开(公告)日：2024-09-10

申请号：US17422260

申请日：2020-01-23

Applicant: Google LLC

Inventor： Mrinal Kalakrishnan , Yunfei Bai , Paul Wohlhart , Eric Jang , Chelsea Finn , Seyed Mohammad Khansari Zadeh , Sergey Levine , Allan Zhou , Alexander Herzog , Daniel Kappler

IPC: B25J9/16

CPC classification number: B25J9/163 , G05B2219/40116 , G05B2219/40499

Abstract: Techniques are disclosed that enable training a meta-learning model, for use in causing a robot to perform a task, using imitation learning as well as reinforcement learning. Some implementations relate to training the meta-learning model using imitation learning based on one or more human guided demonstrations of the task. Additional or alternative implementations relate to training the meta-learning model using reinforcement learning based on trials of the robot attempting to perform the task. Further implementations relate to using the trained meta-learning model to few shot (or one shot) learn a new task based on a human guided demonstration of the new task.

3.

发明授权
Training a policy model for a robotic task, using reinforcement learning and utilizing data that is based on episodes, of the robotic task, guided by an engineered policy 有权

公开(公告)号：US12210943B2

公开(公告)日：2025-01-28

申请号：US17161845

申请日：2021-01-29

Applicant: GOOGLE LLC

Inventor： Adrian Li , Benjamin Holson , Alexander Herzog , Mrinal Kalakrishnan

IPC: G06N20/00 , G06N3/008 , G06N5/04

Abstract: Implementations disclosed herein relate to utilizing at least one existing manually engineered policy, for a robotic task, in training an RL policy model that can be used to at least selectively replace a portion of the engineered policy. The RL policy model can be trained for replacing a portion of a robotic task and can be trained based on data from episodes of attempting performance of the robotic task, including episodes in which the portion is performed based on the engineered policy and/or other portion(s) are performed based on the engineered policy. Once trained, the RL policy model can be used, at least selectively and in lieu of utilization of the engineered policy, to perform the portion of robotic task, while other portion(s) of the robotic task are performed utilizing the engineered policy and/or other similarly trained (but distinct) RL policy model(s).

4.

发明公开
Domain Adaptation Using Simulation to Simulation Transfer 审中-公开

公开(公告)号：US20240238967A1

公开(公告)日：2024-07-18

申请号：US18605452

申请日：2024-03-14

Applicant: Google LLC

Inventor： Paul Wohlhart , Stephen James , Mrinal Kalakrishnan , Konstantinos Bousmalis

IPC: B25J9/16 , G05B13/02 , G06F18/21 , G06F18/214 , G06F18/2431 , G06N3/045 , G06N3/08 , G06T7/50 , G06V10/764 , G06V10/776 , G06V10/82 , G06V20/10

CPC classification number: B25J9/161 , B25J9/163 , B25J9/1671 , B25J9/1697 , G05B13/027 , G06F18/2148 , G06F18/217 , G06F18/2431 , G06N3/045 , G06N3/08 , G06T7/50 , G06V10/764 , G06V10/776 , G06V10/82 , G06V20/10 , G06T2207/20081 , G06T2207/20084

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a generator neural network to adapt input images.

5.

发明授权
Domain adaptation using simulation to simulation transfer 有权

公开(公告)号：US11951622B2

公开(公告)日：2024-04-09

申请号：US17656137

申请日：2022-03-23

Applicant: Google LLC

Inventor： Paul Wohlhart , Stephen James , Mrinal Kalakrishnan , Konstantinos Bousmalis

IPC: G06T7/00 , B25J9/16 , G05B13/02 , G06F18/21 , G06F18/214 , G06F18/2431 , G06N3/045 , G06N3/08 , G06T7/50 , G06V10/764 , G06V10/776 , G06V10/82 , G06V20/10

CPC classification number: B25J9/161 , B25J9/163 , B25J9/1671 , B25J9/1697 , G05B13/027 , G06F18/2148 , G06F18/217 , G06F18/2431 , G06N3/045 , G06N3/08 , G06T7/50 , G06V10/764 , G06V10/776 , G06V10/82 , G06V20/10 , G06T2207/20081 , G06T2207/20084

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a generator neural network to adapt input images.

6.

发明授权
Using simulation and domain adaptation for robotic control 有权

公开(公告)号：US11341364B2

公开(公告)日：2022-05-24

申请号：US16649599

申请日：2018-09-20

Applicant: GOOGLE LLC

Inventor： Konstantinos Bousmalis , Alexander Irpan , Paul Wohlhart , Yunfei Bai , Mrinal Kalakrishnan , Julian Ibarz , Sergey Vladimir Levine , Kurt Konolige , Vincent O. Vanhoucke , Matthew Laurance Kelcey

IPC: G06K9/62 , B25J9/16 , G06N3/04 , G06N3/08

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training an action selection neural network that is used to control a robotic agent interacting with a real-world environment.

7.

发明授权
Machine learning methods and apparatus for robotic manipulation and that utilize multi-task domain adaptation 有权

公开(公告)号：US12138793B2

公开(公告)日：2024-11-12

申请号：US16987771

申请日：2020-08-07

Applicant: GOOGLE LLC

Inventor： Yunfei Bai , Kuan Fang , Stefan Hinterstoisser , Mrinal Kalakrishnan

IPC: B25J9/16

Abstract: Implementations are directed to training a machine learning model that, once trained, is used in performance of robotic grasping and/or other manipulation task(s) by a robot. The model can be trained using simulated training examples that are based on simulated data that is based on simulated robot(s) attempting simulated manipulations of various simulated objects. Portion(s) of the model can also be trained based on real training examples that are based on data from real-world physical robots attempting manipulations of various objects. The simulated training examples can be utilized to train the model to predict an output that can be utilized in a particular task—and the real training examples used to adapt at least a portion of the model to the real-world domain can be tailored to a distinct task. In some implementations, domain-adversarial similarity losses are determined during training, and utilized to regularize at least portion(s) of the model.

8.

发明申请
EFFICIENT ADAPTION OF ROBOT CONTROL POLICY FOR NEW TASK USING META-LEARNING BASED ON META-IMITATION LEARNING AND META-REINFORCEMENT LEARNING 有权

公开(公告)号：US20220105624A1

公开(公告)日：2022-04-07

申请号：US17422260

申请日：2020-01-23

Applicant: Google LLC

Inventor： Mrinal Kalakrishnan , Yunfei Bai , Paul Wohlhart , Eric Jang , Chelsea Finn , Seyed Mohammad Khansari Zadeh , Sergey Levine , Allan Zhou , Alexander Herzog , Daniel Kappler

IPC: B25J9/16

Abstract: Techniques are disclosed that enable training a meta-learning model, for use in causing a robot to perform a task, using imitation learning as well as reinforcement learning. Some implementations relate to training the meta-learning model using imitation learning based on one or more human guided demonstrations of the task. Additional or alternative implementations relate to training the meta-learning model using reinforcement learning based on trials of the robot attempting to perform the task. Further implementations relate to using the trained meta-learning model to few shot (or one shot) learn a new task based on a human guided demonstration of the new task.

9.

发明授权
Operating multiple testing robots based on robot instructions and/or environmental parameters received in a request 有权

公开(公告)号：US12049004B1

公开(公告)日：2024-07-30

申请号：US18103312

申请日：2023-01-30

Applicant: GOOGLE LLC

Inventor： Peter Pastor Sampedro , Mrinal Kalakrishnan , Ali Yahya Valdovinos , Adrian Li , Kurt Konolige , Vincent Dureau

IPC: B25J9/00 , B25J9/16

CPC classification number: B25J9/0084 , B25J9/163 , G05B2219/39271

Abstract: Methods and apparatus related to receiving a request that includes robot instructions and/or environmental parameters, operating each of a plurality of robots based on the robot instructions and/or in an environment configured based on the environmental parameters, and storing data generated by the robots during the operating. In some implementations, at least part of the stored data that is generated by the robots is provided in response to the request and/or additional data that is generated based on the stored data is provided in response to the request.

10.

发明授权
Selecting actions to be performed by a robotic agent 有权

公开(公告)号：US11179847B2

公开(公告)日：2021-11-23

申请号：US16341184

申请日：2017-10-12

Applicant: Google LLC

Inventor： Mrinal Kalakrishnan , Vikas Sindhwani

IPC: B25J9/16

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for a system configured to plan actions to be performed by a robotic agent interacting with an environment to accomplish an objective by determining an optimized trajectory of state—action pairs for accomplishing the objective. The system maintains a current optimized trajectory and a current trust region radius, and optimizes a localized objective within the current trust region radius of the current optimized trajectory to determine a candidate updated optimized trajectory. The system determines whether the candidate updated optimized trajectory improves over the current optimized trajectory. In response to determining that the candidate updated optimized trajectory improves over the current optimized trajectory, the system updates the current optimized trajectory to the candidate updated optimized trajectory and updates the current trust region radius.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification