UNSUPERVISED DEPTH PREDICTION NEURAL NETWORKS

    公开(公告)号:US20210319578A1

    公开(公告)日:2021-10-14

    申请号:US17272419

    申请日:2019-09-05

    Applicant: Google LLC

    Abstract: A system for generating a depth output for an image is described. The system receives input images that depict the same scene, each input image including one or more potential objects. The system generates, for each input image, a respective background image and processes the background images to generate a camera motion output that characterizes the motion of the camera between the input images. For each potential object, the system generates a respective object motion output for the potential object based on the input images and the camera motion output. The system processes a particular input image of the input images using a depth prediction neural network (NN) to generate a depth output for the particular input image, and updates the current values of parameters of the depth prediction NN based on the particular depth output, the camera motion output, and the object motion outputs for the potential objects.

    Training neural networks using consistency measures

    公开(公告)号:US11544498B2

    公开(公告)日:2023-01-03

    申请号:US17194090

    申请日:2021-03-05

    Applicant: Google LLC

    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a neural network using consistency measures. One of the methods includes processing a particular training example from a mediator training data set using a first neural network to generate a first output for a first machine learning task; processing the particular training example in the mediator training data set using each of one or more second neural networks, wherein each second neural network is configured to generate a second output for a respective second machine learning task; determining, for each second machine learning task, a consistency target output for the first machine learning task; determining, for each second machine learning task, an error between the first output and the consistency target output corresponding to the second machine learning task; and generating a parameter update for the first neural network from the determined errors.

    Unsupervised depth prediction neural networks

    公开(公告)号:US12260576B2

    公开(公告)日:2025-03-25

    申请号:US18367888

    申请日:2023-09-13

    Applicant: Google LLC

    Abstract: A system for generating a depth output for an image is described. The system receives input images that depict the same scene, each input image including one or more potential objects. The system generates, for each input image, a respective background image and processes the background images to generate a camera motion output that characterizes the motion of the camera between the input images. For each potential object, the system generates a respective object motion output for the potential object based on the input images and the camera motion output. The system processes a particular input image of the input images using a depth prediction neural network (NN) to generate a depth output for the particular input image, and updates the current values of parameters of the depth prediction NN based on the particular depth output, the camera motion output, and the object motion outputs for the potential objects.

    ROBOT NAVIGATION IN DEPENDENCE ON GESTURE(S) OF HUMAN(S) IN ENVIRONMENT WITH ROBOT

    公开(公告)号:US20240094736A1

    公开(公告)日:2024-03-21

    申请号:US18240124

    申请日:2023-08-30

    Applicant: GOOGLE LLC

    Abstract: Training and/or utilizing a high-level neural network (NN) model, such as a sequential NN model. The high-level NN model, when trained, can be used to process a sequence of consecutive state data instances (e.g., N most recent, including a current state date instance) to generate a sequence of outputs that indicate a sequence of position deltas. The sequence of position deltas can be used to generate an intermediate target position for navigation and, optionally, an intermediate target orientation that corresponds to the intermediate target position. The intermediate target position and, optionally, the intermediate target orientation, can be provided to a low-level navigation policy, such as an MPC policy, and used by the low-level navigation policy as its goal position (and optionally goal orientation) for a plurality of iterations (e.g., until a new intermediate target position (and optionally new target orientation) is generated using the high-level NN model.

    Unsupervised depth prediction neural networks

    公开(公告)号:US11783500B2

    公开(公告)日:2023-10-10

    申请号:US17272419

    申请日:2019-09-05

    Applicant: Google LLC

    Abstract: A system for generating a depth output for an image is described. The system receives input images that depict the same scene, each input image including one or more potential objects. The system generates, for each input image, a respective background image and processes the background images to generate a camera motion output that characterizes the motion of the camera between the input images. For each potential object, the system generates a respective object motion output for the potential object based on the input images and the camera motion output. The system processes a particular input image of the input images using a depth prediction neural network (NN) to generate a depth output for the particular input image, and updates the current values of parameters of the depth prediction NN based on the particular depth output, the camera motion output, and the object motion outputs for the potential objects.

    Future semantic segmentation prediction using 3D structure

    公开(公告)号:US11100646B2

    公开(公告)日:2021-08-24

    申请号:US16562819

    申请日:2019-09-06

    Applicant: Google LLC

    Abstract: A method for generating a predicted segmentation map for potential objects in a future scene depicted in a future image is described. The method includes receiving input images that depict a same scene; processing a current input image to generate a segmentation map for potential objects in the current input image and a respective depth map; generating a point cloud for the current input image; processing the input images to generate, for each pair of two input images in the sequence, a respective ego-motion output that characterizes motion of the camera between the two input images; processing the ego-motion outputs to generate a future ego-motion output; processing the point cloud of the current input image and the future ego-motion output to generate a future point cloud; and processing the future point cloud to generate the predicted segmentation map for potential objects in the future scene depicted in the future image.

Patent Agency Ranking