Patent search ap:("Google LLC") AND inv:"Fei Sha" Page 1

1.

发明公开
Hierarchical Video Encoders 审中-公开

公开(公告)号：US20240114158A1

公开(公告)日：2024-04-04

申请号：US18529173

申请日：2023-12-05

Applicant: Google LLC

Inventor： Vihan Jain , Joonseok Lee , Ming Zhao , Sheide Chammas , Hexiang Hu , Bowen Zhang , Fei Sha , Tze Way Eugene Ie

IPC: H04N19/30 , G06N20/00 , H04N19/172

CPC classification number: H04N19/30 , G06N20/00 , H04N19/172

Abstract: A computer-implemented method for generating video representations utilizing a hierarchical video encoder includes obtaining a video, wherein the video includes a plurality of frames, processing each of the plurality of frames with a machine-learned frame-level encoder model to respectively generate a plurality of frame representations for the plurality of frames, the plurality of frame representations respective to the plurality of frames determining a plurality of segment representations representative of a plurality of video segments including one or more of the plurality of frames, the plurality of segment representations based at least in part on the plurality of frame representations, processing the plurality of segment representations with a machine-learned segment-level encoder model to generate a plurality of contextualized segment representations, determining a video representation based at least in part on the plurality of contextualized segment representations, and providing the video representation as an output.

2.

发明申请
Hierarchical Video Encoders 有权

公开(公告)号：US20230103148A1

公开(公告)日：2023-03-30

申请号：US18070556

申请日：2022-11-29

Applicant: Google LLC

Inventor： Vihan Jain , Joonseok Lee , Ming Zhao , Sheide Chammas , Hexiang Hu , Bowen Zhang , Fei Sha , Tze Way Eugene Ie

IPC: H04N19/30 , G06N20/00 , H04N19/172

Abstract: A computer-implemented method for generating video representations utilizing a hierarchical video encoder includes obtaining a video, wherein the video includes a plurality of frames, processing each of the plurality of frames with a machine-learned frame-level encoder model to respectively generate a plurality of frame representations for the plurality of frames, the plurality of frame representations respective to the plurality of frames determining a plurality of segment representations representative of a plurality of video segments including one or more of the plurality of frames, the plurality of segment representations based at least in part on the plurality of frame representations, processing the plurality of segment representations with a machine-learned segment-level encoder model to generate a plurality of contextualized segment representations, determining a video representation based at least in part on the plurality of contextualized segment representations, and providing the video representation as an output.

3.

发明申请
Hierarchical Video Encoders 有权

公开(公告)号：US20220256175A1

公开(公告)日：2022-08-11

申请号：US17162150

申请日：2021-01-29

Applicant: Google LLC

Inventor： Vihan Jain , Joonseok Lee , Ming Zhao , Sheide Chammas , Hexiang Hu , Bowen Zhang , Fei Sha , Tze Way Eugene Ie

IPC: H04N19/30 , H04N19/172 , G06N20/00

Abstract: A computer-implemented method for generating video representations utilizing a hierarchical video encoder includes obtaining a video, wherein the video includes a plurality of frames, processing each of the plurality of frames with a machine-learned frame-level encoder model to respectively generate a plurality of frame representations for the plurality of frames, the plurality of frame representations respective to the plurality of frames determining a plurality of segment representations representative of a plurality of video segments including one or more of the plurality of frames, the plurality of segment representations based at least in part on the plurality of frame representations, processing the plurality of segment representations with a machine-learned segment-level encoder model to generate a plurality of contextualized segment representations, determining a video representation based at least in part on the plurality of contextualized segment representations, and providing the video representation as an output.

4.

发明授权
Hierarchical video encoders 有权

公开(公告)号：US12301847B2

公开(公告)日：2025-05-13

申请号：US18529173

申请日：2023-12-05

Applicant: Google LLC

Inventor： Vihan Jain , Joonseok Lee , Ming Zhao , Sheide Chammas , Hexiang Hu , Bowen Zhang , Fei Sha , Tze Way Eugene Ie

IPC: H04N19/30 , G06N20/00 , H04N19/172 , H04N19/20 , H04N19/40

Abstract: A computer-implemented method for generating video representations utilizing a hierarchical video encoder includes obtaining a video, wherein the video includes a plurality of frames, processing each of the plurality of frames with a machine-learned frame-level encoder model to respectively generate a plurality of frame representations for the plurality of frames, the plurality of frame representations respective to the plurality of frames determining a plurality of segment representations representative of a plurality of video segments including one or more of the plurality of frames, the plurality of segment representations based at least in part on the plurality of frame representations, processing the plurality of segment representations with a machine-learned segment-level encoder model to generate a plurality of contextualized segment representations, determining a video representation based at least in part on the plurality of contextualized segment representations, and providing the video representation as an output.

5.

发明申请
Co-Training of Action Recognition Machine Learning Models 有权

公开(公告)号：US20250037426A1

公开(公告)日：2025-01-30

申请号：US18716912

申请日：2022-12-09

Applicant: Google LLC

Inventor： Bowen Zhang , Jiahui Yu , Christopher Fifty , Wei Han , Andrew M. Dai , Ruoming Pang , Fei Sha

IPC: G06V10/764 , G06V10/774

Abstract: A method includes obtaining video datasets each including pairs of a training video and a ground-truth action classification of the training video. The method also includes generating an action recognition model that includes a shared encoder model and action classification heads. A number of the action classifications heads may be equal to a number of the video datasets, and each action classification head may be configured to, based on an output of the shared encoder model, classify training videos sampled from a corresponding video dataset. The method also includes determining, by the action recognition model and for each training video sampled from the video datasets, an inferred action classification. The method further includes determining a loss value based on the inferred action classifications and the ground-truth action classifications, and adjusting parameters of the action recognition model based on the loss value.

6.

发明授权
Hierarchical video encoders 有权

公开(公告)号：US11876986B2

公开(公告)日：2024-01-16

申请号：US18070556

申请日：2022-11-29

Applicant: Google LLC

Inventor： Vihan Jain , Joonseok Lee , Ming Zhao , Sheide Chammas , Hexiang Hu , Bowen Zhang , Fei Sha , Tze Way Eugene Ie

IPC: H04N19/30 , H04N19/00 , H04N19/172 , G06N20/00

CPC classification number: H04N19/30 , G06N20/00 , H04N19/172

Abstract: A computer-implemented method for generating video representations utilizing a hierarchical video encoder includes obtaining a video, wherein the video includes a plurality of frames, processing each of the plurality of frames with a machine-learned frame-level encoder model to respectively generate a plurality of frame representations for the plurality of frames, the plurality of frame representations respective to the plurality of frames determining a plurality of segment representations representative of a plurality of video segments including one or more of the plurality of frames, the plurality of segment representations based at least in part on the plurality of frame representations, processing the plurality of segment representations with a machine-learned segment-level encoder model to generate a plurality of contextualized segment representations, determining a video representation based at least in part on the plurality of contextualized segment representations, and providing the video representation as an output.

7.

发明授权
Hierarchical video encoders 有权

公开(公告)号：US11533495B2

公开(公告)日：2022-12-20

申请号：US17162150

申请日：2021-01-29

Applicant: Google LLC

Inventor： Vihan Jain , Joonseok Lee , Ming Zhao , Sheide Chammas , Hexiang Hu , Bowen Zhang , Fei Sha , Tze Way Eugene Ie

IPC: H04N19/30 , H04N19/00 , H04N19/172 , G06N20/00

Abstract: A computer-implemented method for generating video representations utilizing a hierarchical video encoder includes obtaining a video, wherein the video includes a plurality of frames, processing each of the plurality of frames with a machine-learned frame-level encoder model to respectively generate a plurality of frame representations for the plurality of frames, the plurality of frame representations respective to the plurality of frames determining a plurality of segment representations representative of a plurality of video segments including one or more of the plurality of frames, the plurality of segment representations based at least in part on the plurality of frame representations, processing the plurality of segment representations with a machine-learned segment-level encoder model to generate a plurality of contextualized segment representations, determining a video representation based at least in part on the plurality of contextualized segment representations, and providing the video representation as an output.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification