Co-Training of Action Recognition Machine Learning Models

    公开(公告)号:US20250037426A1

    公开(公告)日:2025-01-30

    申请号:US18716912

    申请日:2022-12-09

    Applicant: Google LLC

    Abstract: A method includes obtaining video datasets each including pairs of a training video and a ground-truth action classification of the training video. The method also includes generating an action recognition model that includes a shared encoder model and action classification heads. A number of the action classifications heads may be equal to a number of the video datasets, and each action classification head may be configured to, based on an output of the shared encoder model, classify training videos sampled from a corresponding video dataset. The method also includes determining, by the action recognition model and for each training video sampled from the video datasets, an inferred action classification. The method further includes determining a loss value based on the inferred action classifications and the ground-truth action classifications, and adjusting parameters of the action recognition model based on the loss value.

Patent Agency Ranking