Phrase recognition model for autonomous vehicles

    公开(公告)号:US11562573B2

    公开(公告)日:2023-01-24

    申请号:US17123185

    申请日:2020-12-16

    Applicant: WAYMO LLC

    Abstract: Aspects of the disclosure relate to training and using a phrase recognition model to identify phrases in images. As an example, a selected phrase list may include a plurality of phrases is received. Each phrase of the plurality of phrases includes text. An initial plurality of images may be received. A training image set may be selected from the initial plurality of images by identifying the phrase-containing images that include one or more phrases from the selected phrase list. Each given phrase-containing image of the training image set may be labeled with information identifying the one or more phrases from the selected phrase list included in the given phrase-containing images. The model may be trained based on the training image set such that the model is configured to, in response to receiving an input image, output data indicating whether a phrase of the plurality of phrases is included in the input image.

    MULTI-MODAL 3-D POSE ESTIMATION
    3.
    发明申请

    公开(公告)号:US20220156965A1

    公开(公告)日:2022-05-19

    申请号:US17505900

    申请日:2021-10-20

    Applicant: Waymo LLC

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for estimating a 3-D pose of an object of interest from image and point cloud data. In one aspect, a method includes obtaining an image of an environment; obtaining a point cloud of a three-dimensional region of the environment; generating a fused representation of the image and the point cloud; and processing the fused representation using a pose estimation neural network and in accordance with current values of a plurality of pose estimation network parameters to generate a pose estimation network output that specifies, for each of multiple keypoints, a respective estimated position in the three-dimensional region of the environment.

    Object Action Classification For Autonomous Vehicles

    公开(公告)号:US20210294346A1

    公开(公告)日:2021-09-23

    申请号:US17343187

    申请日:2021-06-09

    Applicant: Waymo LLC

    Abstract: Aspects of the disclosure relate to training and using a model for identifying actions of objects. For instance, LIDAR sensor data frames including an object bounding box corresponding to an object as well as an action label for the bounding box may be received. Each sensor frame is associated with a timestamp and is sequenced with respect to other sensor frames. Each given sensor data frame may be projected into a camera image of the object based on the timestamp associated with the given sensor data frame in order to provide fused data. The model may be trained using the fused data such that the model is configured to, in response to receiving fused data, the model outputs an action label for each object bounding box of the fused data. This output may then be used to control a vehicle in an autonomous driving mode.

    SPATIO-TEMPORAL EMBEDDINGS
    5.
    发明申请

    公开(公告)号:US20210103744A1

    公开(公告)日:2021-04-08

    申请号:US17063553

    申请日:2020-10-05

    Applicant: Waymo LLC

    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating a spatio-temporal embedding of a sequence of point clouds. One of the methods includes obtaining a temporal sequence comprising a respective point cloud input corresponding to each of a plurality of time points, each point cloud input comprising point cloud data generated from sensor data captured by one or more sensors of a vehicle at the respective time point; processing each point cloud input using a first neural network to generate a respective spatial embedding that characterizes the point cloud input; and processing the spatial embeddings of the point cloud inputs using a second neural network to generate a spatio-temporal embedding that characterizes the point cloud inputs in the temporal sequence.

    Phrase recognition model for autonomous vehicles

    公开(公告)号:US10699141B2

    公开(公告)日:2020-06-30

    申请号:US16018490

    申请日:2018-06-26

    Applicant: Waymo LLC

    Abstract: Aspects of the disclosure relate to training and using a phrase recognition model to identify phrases in images. As an example, a selected phrase list may include a plurality of phrases is received. Each phrase of the plurality of phrases includes text. An initial plurality of images may be received. A training image set may be selected from the initial plurality of images by identifying the phrase-containing images that include one or more phrases from the selected phrase list. Each given phrase-containing image of the training image set may be labeled with information identifying the one or more phrases from the selected phrase list included in the given phrase-containing images. The model may be trained based on the training image set such that the model is configured to, in response to receiving an input image, output data indicating whether a phrase of the plurality of phrases is included in the input image.

    Agent trajectory prediction using target locations

    公开(公告)号:US11987265B1

    公开(公告)日:2024-05-21

    申请号:US17387852

    申请日:2021-07-28

    Applicant: Waymo LLC

    CPC classification number: B60W60/001 G06N3/02 B60W2420/42 B60W2554/4049

    Abstract: A system obtains scene context data characterizing the environment. The scene context data includes data that characterizes a trajectory of an agent in a vicinity of a vehicle up to a current time point. The system identifies a plurality of initial target locations, and generates, for each of a plurality of target locations that each corresponds to one of the initial target locations, a respective predicted likelihood score that represents a likelihood that the target location will be an intended final location for a future trajectory of the agent. For each target location in a first subset of the target locations, the system generates a predicted future trajectory for the agent given that the target location is the intended final location for the future trajectory. The system further selects, as likely future trajectories of the agent, one or more of the predicted future trajectories.

    Recurrent neural network classifier

    公开(公告)号:US11880758B1

    公开(公告)日:2024-01-23

    申请号:US17391627

    申请日:2021-08-02

    Applicant: Waymo LLC

    CPC classification number: G06N3/044 G06N3/045 G06N3/08

    Abstract: Disclosed herein are neural networks for generating target classifications for an object from a set of input sequences. Each input sequence includes a respective input at each of multiple time steps, and each input sequence corresponds to a different sensing subsystem of multiple sensing subsystems. For each time step in the multiple time steps and for each input sequence in the set of input sequences, a respective feature representation is generated for the input sequence by processing the respective input from the input sequence at the time step using a respective encoder recurrent neural network (RNN) subsystem for the sensing subsystem that corresponds to the input sequence. For each time step in at least a subset of the multiple time steps, the respective feature representations are processed using a classification neural network subsystem to select a respective target classification for the object at the time step.

    Geo-motion and appearance aware data association

    公开(公告)号:US11830203B2

    公开(公告)日:2023-11-28

    申请号:US17728682

    申请日:2022-04-25

    Applicant: Waymo LLC

    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for associating a new measurement of an object surrounding a vehicle with a maintained track. One of the methods includes receiving an object track for a particular object, receiving a new measurement characterizing a new object at a new time step, and determining whether the new object is the same as the particular object, comprising: generating a representation of the new object at the new and preceding time steps; generating a representation of the particular object at the new and preceding time steps; processing a first network input comprising the representations using a first neural network to generate an embedding of the first network input; and processing the embedding of the first network input using a second neural network to generate a predicted likelihood that the new object and the particular object are the same.

    TRAINING DISTILLED MACHINE LEARNING MODELS USING A PRE-TRAINED FEATURE EXTRACTOR

    公开(公告)号:US20220366263A1

    公开(公告)日:2022-11-17

    申请号:US17313655

    申请日:2021-05-06

    Applicant: Waymo LLC

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training a student machine learning model using a teacher machine learning model that has a pre-trained feature extractor. In one aspect, a method includes obtaining data specifying the teacher machine learning model that is configured to perform a machine learning task; obtaining first training data; training the teacher machine learning model on the first training data to obtain a trained teacher machine learning model; generating second, automatically labeled training data by using the trained teacher machine learning model to process unlabeled training data; and training a student machine learning model to perform the machine learning task using at least the second, automatically labeled training data, wherein the student machine learning model does not include the pre-trained feature extractor and instead includes a different feature extractor having fewer parameters than the pre-trained feature extractor.

Patent Agency Ranking