Patent search ap:("GOOGLE LLC") AND inv:"Pierre Sermanet" Page 1

1.

发明授权
Optimizing policy controllers for robotic agents using image embeddings 有权

公开(公告)号：US12240117B2

公开(公告)日：2025-03-04

申请号：US18157919

申请日：2023-01-23

Applicant: Google LLC

Inventor： Yevgen Chebotar , Pierre Sermanet , Harrison Lynch

IPC: B25J9/16 , G05B13/02 , G06N3/084 , G06N20/00

Abstract: There are provided systems, methods, and apparatus, for optimizing a policy controller to control a robotic agent that interacts with an environment to perform a robotic task. One of the methods includes optimizing the policy controller using a neural network that generates numeric embeddings of images of the environment and a demonstration sequence of demonstration images of another agent performing a version of the robotic task.

2.

发明授权
Unsupervised detection of intermediate reinforcement learning goals 有权

公开(公告)号：US12106200B2

公开(公告)日：2024-10-01

申请号：US18168000

申请日：2023-02-13

Applicant: Google LLC

Inventor： Pierre Sermanet

IPC: G06N3/006 , G06F18/21 , G06F18/2111 , G06F18/2132 , G06N3/045 , G06N3/08

CPC classification number: G06N3/006 , G06F18/2111 , G06F18/2132 , G06F18/217 , G06N3/045 , G06N3/08

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for detecting intermediate reinforcement learning goals. One of the methods includes obtaining a plurality of demonstration sequences, each of the demonstration sequences being a sequence of images of an environment while a respective instance of a reinforcement learning task is being performed; for each demonstration sequence, processing each image in the demonstration sequence through an image processing neural network to determine feature values for a respective set of features for the image; determining, from the demonstration sequences, a partitioning of the reinforcement learning task into a plurality of subtasks, wherein each image in each demonstration sequence is assigned to a respective subtask of the plurality of subtasks; and determining, from the feature values for the images in the demonstration sequences, a respective set of discriminative features for each of the plurality of subtasks.

3.

发明授权
Optimizing policy controllers for robotic agents using image embeddings 有权

公开(公告)号：US11559887B2

公开(公告)日：2023-01-24

申请号：US16649596

申请日：2018-09-20

Applicant: GOOGLE LLC

Inventor： Yevgen Chebotar , Pierre Sermanet , Harrison Lynch

IPC: G06N20/00 , G05B13/02 , G06N3/08 , B25J9/16

Abstract: There are provided systems, methods, and apparatus, for optimizing a policy controller to control a robotic agent that interacts with an environment to perform a robotic task. One of the methods includes optimizing the policy controller using a neural network that generates numeric embeddings of images of the environment and a demonstration sequence of demonstration images of another agent performing a version of the robotic task.

4.

发明申请
MIRROR LOSS NEURAL NETWORKS 有权

公开(公告)号：US20230020615A1

公开(公告)日：2023-01-19

申请号：US17893454

申请日：2022-08-23

Applicant: Google LLC

Inventor： Pierre Sermanet

IPC: B25J9/16 , G05B13/02 , G06N3/08

Abstract: This description relates to a neural network that has multiple network parameters and is configured to receive an input observation characterizing a state of an environment and to process the input observation to generate a numeric embedding of the state of the environment. The neural network can be used to control a robotic agent. The network can be trained using a method comprising: obtaining a first observation captured by a first modality; obtaining a second observation that is co-occurring with the first observation and that is captured by a second, different modality; obtaining a third observation captured by the first modality that is not co-occurring with the first observation; determining a gradient of a triplet loss that uses the first observation, the second observation, and the third observation; and updating current values of the network parameters using the gradient of the triplet loss.

5.

发明申请
OPTIMIZING POLICY CONTROLLERS FOR ROBOTIC AGENTS USING IMAGE EMBEDDINGS 审中-公开

公开(公告)号：US20200276703A1

公开(公告)日：2020-09-03

申请号：US16649596

申请日：2018-09-20

Applicant: GOOGLE LLC

Inventor： Yevgen Chebotar , Pierre Sermanet , Harrison Lynch

IPC: B25J9/16 , G05B13/02 , G06N3/08 , G06N20/00

Abstract: There are provided systems, methods, and apparatus, for optimizing a policy controller to control a robotic agent that interacts with an environment to perform a robotic task. One of the methods includes optimizing the policy controller using a neural network that generates numeric embeddings of images of the environment and a demonstration sequence of demonstration images of another agent performing a version of the robotic task.

6.

发明公开
OPTIMIZING POLICY CONTROLLERS FOR ROBOTIC AGENTS USING IMAGE EMBEDDINGS 审中-公开

公开(公告)号：US20230150127A1

公开(公告)日：2023-05-18

申请号：US18157919

申请日：2023-01-23

Applicant: Google LLC

Inventor： YEVGEN CHEBOTAR , Pierre Sermanet , Harrison Lynch

IPC: B25J9/16 , G06N20/00 , G05B13/02 , G06N3/084

CPC classification number: B25J9/163 , G06N20/00 , B25J9/1664 , B25J9/1697 , G05B13/0205 , G05B13/027 , G06N3/084

Abstract: There are provided systems, methods, and apparatus, for optimizing a policy controller to control a robotic agent that interacts with an environment to perform a robotic task. One of the methods includes optimizing the policy controller using a neural network that generates numeric embeddings of images of the environment and a demonstration sequence of demonstration images of another agent performing a version of the robotic task.

7.

发明授权
Unsupervised detection of intermediate reinforcement learning goals 有权

公开(公告)号：US11580360B2

公开(公告)日：2023-02-14

申请号：US16347651

申请日：2017-11-06

Applicant: GOOGLE LLC

Inventor： Pierre Sermanet

IPC: G06N3/045 , G06N3/08 , G06F18/21 , G06F18/2111 , G06F18/2132

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for detecting intermediate reinforcement learning goals. One of the methods includes obtaining a plurality of demonstration sequences, each of the demonstration sequences being a sequence of images of an environment while a respective instance of a reinforcement learning task is being performed; for each demonstration sequence, processing each image in the demonstration sequence through an image processing neural network to determine feature values for a respective set of features for the image; determining, from the demonstration sequences, a partitioning of the reinforcement learning task into a plurality of subtasks, wherein each image in each demonstration sequence is assigned to a respective subtask of the plurality of subtasks; and determining, from the feature values for the images in the demonstration sequences, a respective set of discriminative features for each of the plurality of subtasks.

8.

发明授权
Training a deep neural network model to generate rich object-centric embeddings of robotic vision data 有权

公开(公告)号：US11887363B2

公开(公告)日：2024-01-30

申请号：US17279924

申请日：2019-09-27

Applicant: Google LLC

Inventor： Soeren Pirk , Yunfei Bai , Pierre Sermanet , Seyed Mohammad Khansari Zadeh , Harrison Lynch

IPC: G06V20/10 , B25J9/16 , B25J13/00 , G05B13/02 , G06N3/08 , G10L15/22 , G06F18/21 , G06F18/2413 , G06V10/764 , G06V10/70 , G06V10/776 , G06V10/82

CPC classification number: G06V20/10 , B25J9/1697 , B25J13/003 , G05B13/027 , G06F18/217 , G06F18/2413 , G06N3/08 , G06V10/764 , G06V10/768 , G06V10/776 , G06V10/82 , G10L15/22

Abstract: Training a machine learning model (e.g., a neural network model such as a convolutional neural network (CNN) model) so that, when trained, the model can be utilized in processing vision data (e.g., from a vision component of a robot), that captures an object, to generate a rich object-centric embedding for the vision data. The generated embedding can enable differentiation of even subtle variations of attributes of the object captured by the vision data.

9.

发明授权
Mirror loss neural networks 有权

公开(公告)号：US11453121B2

公开(公告)日：2022-09-27

申请号：US16468987

申请日：2018-03-19

Applicant: Google LLC

Inventor： Pierre Sermanet

IPC: G05B19/04 , B25J9/16 , G05B13/02 , G06N3/08 , H04N7/18

Abstract: This description relates to a neural network that has multiple network parameters and is configured to receive an input observation characterizing a state of an environment and to process the input observation to generate a numeric embedding of the state of the environment. The neural network can be used to control a robotic agent. The network can be trained using a method comprising: obtaining a first observation captured by a first modality; obtaining a second observation that is co-occurring with the first observation and that is captured by a second, different modality; obtaining a third observation captured by the first modality that is not co-occurring with the first observation; determining a gradient of a triplet loss that uses the first observation, the second observation, and the third observation; and updating current values of the network parameters using the gradient of the triplet loss.

10.

发明申请
CONTROLLING AGENTS USING LATENT PLANS 有权

公开(公告)号：US20220076099A1

公开(公告)日：2022-03-10

申请号：US17432366

申请日：2020-02-19

Applicant: Google LLC

Inventor： Pierre Sermanet , Seyed Mohammad Khansari Zadeh , Harrison Corey Lynch

IPC: G06N3/04 , G06N3/08

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for controlling an agent. One of the methods includes controlling the agent using a policy neural network that processes a policy input that includes (i) a current observation, (ii) a goal observation, and (iii) a selected latent plan to generate a current action output that defines an action to be performed in response to the current observation.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification