Patent search ap:("Waymo LLC") AND inv:"Congcong Li" Page 1

1.

发明授权
Spatio-temporal-interactive networks 有权

公开(公告)号：US11610423B2

公开(公告)日：2023-03-21

申请号：US17099642

申请日：2020-11-16

Applicant: Waymo LLC

Inventor： Junhua Mao , Jiyang Gao , Yukai Liu , Congcong Li , Zhishuai Zhang , Dragomir Anguelov

IPC: G06V40/10 , G06K9/62 , B60W30/095 , G06V10/25

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for processing point cloud data using spatio-temporal-interactive networks.

2.

发明授权
Phrase recognition model for autonomous vehicles 有权

公开(公告)号：US11562573B2

公开(公告)日：2023-01-24

申请号：US17123185

申请日：2020-12-16

Applicant: WAYMO LLC

Inventor： Victoria Dean , Abhijit S Ogale , Henrik Kretzschmar , David Harrison Silver , Carl Kershaw , Pankaj Chaudhari , Chen Wu , Congcong Li

IPC: G06V20/58 , G05D1/00 , G06N3/08 , G06T11/20 , G06F40/30 , G06V30/148 , G06V30/10

Abstract: Aspects of the disclosure relate to training and using a phrase recognition model to identify phrases in images. As an example, a selected phrase list may include a plurality of phrases is received. Each phrase of the plurality of phrases includes text. An initial plurality of images may be received. A training image set may be selected from the initial plurality of images by identifying the phrase-containing images that include one or more phrases from the selected phrase list. Each given phrase-containing image of the training image set may be labeled with information identifying the one or more phrases from the selected phrase list included in the given phrase-containing images. The model may be trained based on the training image set such that the model is configured to, in response to receiving an input image, output data indicating whether a phrase of the plurality of phrases is included in the input image.

3.

发明申请
MULTI-MODAL 3-D POSE ESTIMATION 有权

公开(公告)号：US20220156965A1

公开(公告)日：2022-05-19

申请号：US17505900

申请日：2021-10-20

Applicant: Waymo LLC

Inventor： Jingxiao Zheng , Xinwei Shi , Alexander Gorban , Junhua Mao , Andre Liang Cornman , Yang Song , Ting Liu , Ruizhongtai Qi , Yin Zhou , Congcong Li , Dragomir Anguelov

IPC: G06T7/73 , G06K9/62 , G06K9/00

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for estimating a 3-D pose of an object of interest from image and point cloud data. In one aspect, a method includes obtaining an image of an environment; obtaining a point cloud of a three-dimensional region of the environment; generating a fused representation of the image and the point cloud; and processing the fused representation using a pose estimation neural network and in accordance with current values of a plurality of pose estimation network parameters to generate a pose estimation network output that specifies, for each of multiple keypoints, a respective estimated position in the three-dimensional region of the environment.

4.

发明申请
Object Action Classification For Autonomous Vehicles 有权

公开(公告)号：US20210294346A1

公开(公告)日：2021-09-23

申请号：US17343187

申请日：2021-06-09

Applicant: Waymo LLC

Inventor： Junhua Mao , Congcong Li , Alper Ayvaci , Chen Sun , Kevin Murphy , Ruichi Yu

IPC: G05D1/02 , G05D1/00 , G06K9/00 , G06K9/62 , B60W30/095 , G01S17/931

Abstract: Aspects of the disclosure relate to training and using a model for identifying actions of objects. For instance, LIDAR sensor data frames including an object bounding box corresponding to an object as well as an action label for the bounding box may be received. Each sensor frame is associated with a timestamp and is sequenced with respect to other sensor frames. Each given sensor data frame may be projected into a camera image of the object based on the timestamp associated with the given sensor data frame in order to provide fused data. The model may be trained using the fused data such that the model is configured to, in response to receiving fused data, the model outputs an action label for each object bounding box of the fused data. This output may then be used to control a vehicle in an autonomous driving mode.

5.

发明申请
SPATIO-TEMPORAL EMBEDDINGS 有权

公开(公告)号：US20210103744A1

公开(公告)日：2021-04-08

申请号：US17063553

申请日：2020-10-05

Applicant: Waymo LLC

Inventor： Jiyang Gao , Zijian Guo , Congcong Li

IPC: G06K9/00 , G06N3/04 , G06K9/62 , G06N3/08

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating a spatio-temporal embedding of a sequence of point clouds. One of the methods includes obtaining a temporal sequence comprising a respective point cloud input corresponding to each of a plurality of time points, each point cloud input comprising point cloud data generated from sensor data captured by one or more sensors of a vehicle at the respective time point; processing each point cloud input using a first neural network to generate a respective spatial embedding that characterizes the point cloud input; and processing the spatial embeddings of the point cloud inputs using a second neural network to generate a spatio-temporal embedding that characterizes the point cloud inputs in the temporal sequence.

6.

发明授权
Phrase recognition model for autonomous vehicles 有权

公开(公告)号：US10699141B2

公开(公告)日：2020-06-30

申请号：US16018490

申请日：2018-06-26

Applicant: Waymo LLC

Inventor： Victoria Dean , Abhijit S. Ogale , Henrik Kretzschmar , David Harrison Silver , Carl Kershaw , Pankaj Chaudhari , Chen Wu , Congcong Li

IPC: G06K9/00 , G05D1/00 , G06N3/08 , G06T11/20 , G06K9/34 , G06F40/30

Abstract: Aspects of the disclosure relate to training and using a phrase recognition model to identify phrases in images. As an example, a selected phrase list may include a plurality of phrases is received. Each phrase of the plurality of phrases includes text. An initial plurality of images may be received. A training image set may be selected from the initial plurality of images by identifying the phrase-containing images that include one or more phrases from the selected phrase list. Each given phrase-containing image of the training image set may be labeled with information identifying the one or more phrases from the selected phrase list included in the given phrase-containing images. The model may be trained based on the training image set such that the model is configured to, in response to receiving an input image, output data indicating whether a phrase of the plurality of phrases is included in the input image.

7.

发明授权
Agent trajectory prediction using target locations 有权

公开(公告)号：US11987265B1

公开(公告)日：2024-05-21

申请号：US17387852

申请日：2021-07-28

Applicant: Waymo LLC

Inventor： Hang Zhao , Jiyang Gao , Chen Sun , Yi Shen , Yuning Chai , Cordelia Luise Schmid , Congcong Li , Benjamin Sapp , Dragomir Anguelov , Tian Lan , Yue Shen

IPC: B60W60/00 , G06N3/02

CPC classification number: B60W60/001 , G06N3/02 , B60W2420/42 , B60W2554/4049

Abstract: A system obtains scene context data characterizing the environment. The scene context data includes data that characterizes a trajectory of an agent in a vicinity of a vehicle up to a current time point. The system identifies a plurality of initial target locations, and generates, for each of a plurality of target locations that each corresponds to one of the initial target locations, a respective predicted likelihood score that represents a likelihood that the target location will be an intended final location for a future trajectory of the agent. For each target location in a first subset of the target locations, the system generates a predicted future trajectory for the agent given that the target location is the intended final location for the future trajectory. The system further selects, as likely future trajectories of the agent, one or more of the predicted future trajectories.

8.

发明授权
Recurrent neural network classifier 有权

公开(公告)号：US11880758B1

公开(公告)日：2024-01-23

申请号：US17391627

申请日：2021-08-02

Applicant: Waymo LLC

Inventor： Congcong Li , Ury Zhilinsky , Yun Jiang , Zhaoyin Jia

IPC: G06N3/04 , G06N3/08 , G06N3/044 , G06N3/045

CPC classification number: G06N3/044 , G06N3/045 , G06N3/08

Abstract: Disclosed herein are neural networks for generating target classifications for an object from a set of input sequences. Each input sequence includes a respective input at each of multiple time steps, and each input sequence corresponds to a different sensing subsystem of multiple sensing subsystems. For each time step in the multiple time steps and for each input sequence in the set of input sequences, a respective feature representation is generated for the input sequence by processing the respective input from the input sequence at the time step using a respective encoder recurrent neural network (RNN) subsystem for the sensing subsystem that corresponds to the input sequence. For each time step in at least a subset of the multiple time steps, the respective feature representations are processed using a classification neural network subsystem to select a respective target classification for the object at the time step.

9.

发明授权
Geo-motion and appearance aware data association 有权

公开(公告)号：US11830203B2

公开(公告)日：2023-11-28

申请号：US17728682

申请日：2022-04-25

Applicant: Waymo LLC

Inventor： Ruichi Yu , Sachithra Madhawa Hemachandra , Ian James Mahon , Congcong Li

IPC: G06T7/246 , G06N3/084 , G06N3/045 , G05D1/00

CPC classification number: G06T7/248 , G06N3/045 , G06N3/084 , G05D1/0088 , G06T2207/20084 , G06T2207/30252

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for associating a new measurement of an object surrounding a vehicle with a maintained track. One of the methods includes receiving an object track for a particular object, receiving a new measurement characterizing a new object at a new time step, and determining whether the new object is the same as the particular object, comprising: generating a representation of the new object at the new and preceding time steps; generating a representation of the particular object at the new and preceding time steps; processing a first network input comprising the representations using a first neural network to generate an embedding of the first network input; and processing the embedding of the first network input using a second neural network to generate a predicted likelihood that the new object and the particular object are the same.

10.

发明申请
TRAINING DISTILLED MACHINE LEARNING MODELS USING A PRE-TRAINED FEATURE EXTRACTOR 有权

公开(公告)号：US20220366263A1

公开(公告)日：2022-11-17

申请号：US17313655

申请日：2021-05-06

Applicant: Waymo LLC

Inventor： Ming Ji , Edward Stephen Walker, JR. , Yang Song , Zijian Guo , Congcong Li

IPC: G06N3/08 , G06N3/04

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training a student machine learning model using a teacher machine learning model that has a pre-trained feature extractor. In one aspect, a method includes obtaining data specifying the teacher machine learning model that is configured to perform a machine learning task; obtaining first training data; training the teacher machine learning model on the first training data to obtain a trained teacher machine learning model; generating second, automatically labeled training data by using the trained teacher machine learning model to process unlabeled training data; and training a student machine learning model to perform the machine learning task using at least the second, automatically labeled training data, wherein the student machine learning model does not include the pre-trained feature extractor and instead includes a different feature extractor having fewer parameters than the pre-trained feature extractor.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification