Pose Empowered RGB-Flow Net
    2.
    发明公开

    公开(公告)号:US20230419538A1

    公开(公告)日:2023-12-28

    申请号:US18464912

    申请日:2023-09-11

    Applicant: Google LLC

    Abstract: A method includes receiving video data that includes a series of frames of image data. Here, the video data is representative of an actor performing an activity. The method also includes processing the video data to generate a spatial input stream including a series of spatial images representative of spatial features of the actor performing the activity, a temporal input stream representative of motion of the actor performing the activity, and a pose input stream including a series of images representative of a pose of the actor performing the activity. Using at least one neural network, the method also includes processing the temporal input stream, the spatial input stream, and the pose input stream. The method also includes classifying, by the at least one neural network, the activity based on the temporal input stream, the spatial input stream, and the pose input stream.

    Pose empowered RGB-flow net
    3.
    发明授权

    公开(公告)号:US11776156B2

    公开(公告)日:2023-10-03

    申请号:US17303969

    申请日:2021-06-11

    Applicant: Google LLC

    Abstract: A method includes receiving video data that includes a series of frames of image data. Here, the video data is representative of an actor performing an activity. The method also includes processing the video data to generate a spatial input stream including a series of spatial images representative of spatial features of the actor performing the activity, a temporal input stream representative of motion of the actor performing the activity, and a pose input stream including a series of images representative of a pose of the actor performing the activity. Using at least one neural network, the method also includes processing the temporal input stream, the spatial input stream, and the pose input stream. The method also includes classifying, by the at least one neural network, the activity based on the temporal input stream, the spatial input stream, and the pose input stream.

    Systems and Methods for Training Multi-Class Object Classification Models with Partially Labeled Training Data

    公开(公告)号:US20230274527A1

    公开(公告)日:2023-08-31

    申请号:US18015301

    申请日:2020-10-06

    Applicant: Google LLC

    CPC classification number: G06V10/764 G06V10/776

    Abstract: Systems and methods of the present disclosure are directed to a computer-implemented method for training a machine-learned multi-class object classification model with partially labeled training data. The method can include obtaining image data depicting objects and ground truth data comprising a subset of object class annotations respectively associated with a subset of object classes of a plurality of object classes. The method can include processing the image data with the machine-learned multi-class object classification model to obtain object classification data. The method can include evaluating a loss function that evaluates a multi-class classification loss and adjusting one or more parameters of the multi-class object classification model based on the loss function.

    Pose Empowered RGB-Flow Net
    5.
    发明申请

    公开(公告)号:US20210390733A1

    公开(公告)日:2021-12-16

    申请号:US17303969

    申请日:2021-06-11

    Applicant: Google LLC

    Abstract: A method includes receiving video data that includes a series of frames of image data. Here, the video data is representative of an actor performing an activity. The method also includes processing the video data to generate a spatial input stream including a series of spatial images representative of spatial features of the actor performing the activity, a temporal input stream representative of motion of the actor performing the activity, and a pose input stream including a series of images representative of a pose of the actor performing the activity. Using at least one neural network, the method also includes processing the temporal input stream, the spatial input stream, and the pose input stream. The method also includes classifying, by the at least one neural network, the activity based on the temporal input stream, the spatial input stream, and the pose input stream.

Patent Agency Ranking