Model disentanglement for domain adaptation

    公开(公告)号:US12019726B2

    公开(公告)日:2024-06-25

    申请号:US17655506

    申请日:2022-03-18

    CPC分类号: G06F21/32 G06N5/022

    摘要: Certain aspects of the present disclosure provide techniques for improved domain adaptation in machine learning. A feature tensor is generated by processing input data using a feature extractor. A first set of logits is generated by processing the feature tensor using a domain-agnostic classifier, and a second set of logits is generated by processing the feature tensor using a domain-specific classifier. A loss is computed based at least in part on the first set of logits and the second set of logits, where the loss includes a divergence loss component. The feature extractor, the domain-agnostic classifier, and the domain-specific classifier are refined using the loss.

    Sparse optical flow estimation
    3.
    发明授权

    公开(公告)号:US12100169B2

    公开(公告)日:2024-09-24

    申请号:US17481047

    申请日:2021-09-21

    IPC分类号: G06T7/269 G01P13/00 G06T7/246

    摘要: Systems and techniques are described herein for performing optical flow estimation between one or more frames. For example, a process can include determining a subset of pixels of at least one of a first frame and a second frame, and generating a mask indicating the subset of pixels. The process can include determining, based on the mask, one or more features associated with the subset of pixels of at least the first frame and the second frame. The process can include determining optical flow vectors between the subset of pixels of the first frame and corresponding pixels of a second frame. The process can include generating an optical flow map for the second frame using the optical flow vectors.

    Volumetric sampling with correlative characterization for dense estimation

    公开(公告)号:US11640668B2

    公开(公告)日:2023-05-02

    申请号:US17344283

    申请日:2021-06-10

    IPC分类号: G06T7/246 G06T7/73 G06F18/24

    摘要: Systems and techniques are described herein for performing optical flow estimation for one or more frames. For example, a process can include determining an optical flow prediction associated with a plurality of frames. The process can include determining a position of at least one feature associated with a first frame and determining, based on the position of the at least one feature in the first frame and the optical flow prediction, a position estimate of a search area for searching for the at least one feature in a second frame. The process can include determining, from within the search area, a position of the at least one feature in the second frame.

    Efficient video processing via dynamic knowledge propagation

    公开(公告)号:US12067777B2

    公开(公告)日:2024-08-20

    申请号:US17654986

    申请日:2022-03-15

    IPC分类号: G06V20/40 G06V10/82 H04L67/04

    CPC分类号: G06V20/46 G06V10/82 H04L67/04

    摘要: Certain aspects of the present disclosure provide a method of processing video data. In one example, the method includes receiving input video data; sampling a first subset of clips from the input video data; providing the first subset of clips to a first component of a machine learning model to generate first output; sampling a second subset of clips from the input video data, wherein the second subset of clips comprises fewer clips than the first subset of clips; providing the second subset of clips to a second component of the machine learning model to generate a second output; aggregating the first output from the first component of the machine learning model with the second output from the second component of the machine learning model to generate aggregated output; and determining a characteristic of the input video data based on the aggregated output.

    Supervised learning and occlusion masking for optical flow estimation

    公开(公告)号:US12039742B2

    公开(公告)日:2024-07-16

    申请号:US17510763

    申请日:2021-10-26

    摘要: Systems and techniques are described for performing supervised learning (e.g., semi-supervised learning, self-supervised learning, and/or mixed supervision learning) for optical flow estimation. For example, a method can include obtaining an image associated with a sequence of images and generating an occluded image. The occluded image can include at least one of the image with an occlusion applied to the image and a different image of the sequence of images with the occlusion applied. The method can include determining a matching map based at least on matching areas of the image and the occluded image and, based on the matching map, determining a loss term associated with an optical flow loss prediction associated with the image and the occluded image. The loss term may include a matched loss and/or other loss. Based on the loss term, the method can include training a network configured to determine an optical flow between images.

    EFFICIENT VIDEO PROCESSING VIA DYNAMIC KNOWLEDGE PROPAGATION

    公开(公告)号:US20220301310A1

    公开(公告)日:2022-09-22

    申请号:US17654986

    申请日:2022-03-15

    IPC分类号: G06V20/40 G06V10/82 H04L67/04

    摘要: Certain aspects of the present disclosure provide a method of processing video data. In one example, the method includes receiving input video data; sampling a first subset of clips from the input video data; providing the first subset of clips to a first component of a machine learning model to generate first output; sampling a second subset of clips from the input video data, wherein the second subset of clips comprises fewer clips than the first subset of clips; providing the second subset of clips to a second component of the machine learning model to generate a second output; aggregating the first output from the first component of the machine learning model with the second output from the second component of the machine learning model to generate aggregated output; and determining a characteristic of the input video data based on the aggregated output.