Invention Grant
- Patent Title: Hierarchical video encoders
-
Application No.: US18070556Application Date: 2022-11-29
-
Publication No.: US11876986B2Publication Date: 2024-01-16
- Inventor: Vihan Jain , Joonseok Lee , Ming Zhao , Sheide Chammas , Hexiang Hu , Bowen Zhang , Fei Sha , Tze Way Eugene Ie
- Applicant: Google LLC
- Applicant Address: US CA Mountain View
- Assignee: GOOGLE LLC
- Current Assignee: GOOGLE LLC
- Current Assignee Address: US CA Mountain View
- Agency: Dority & Manning, P.A.
- The original application number of the division: US17162150 2021.01.29
- Main IPC: H04N19/30
- IPC: H04N19/30 ; H04N19/00 ; H04N19/172 ; G06N20/00

Abstract:
A computer-implemented method for generating video representations utilizing a hierarchical video encoder includes obtaining a video, wherein the video includes a plurality of frames, processing each of the plurality of frames with a machine-learned frame-level encoder model to respectively generate a plurality of frame representations for the plurality of frames, the plurality of frame representations respective to the plurality of frames determining a plurality of segment representations representative of a plurality of video segments including one or more of the plurality of frames, the plurality of segment representations based at least in part on the plurality of frame representations, processing the plurality of segment representations with a machine-learned segment-level encoder model to generate a plurality of contextualized segment representations, determining a video representation based at least in part on the plurality of contextualized segment representations, and providing the video representation as an output.
Public/Granted literature
- US20230103148A1 Hierarchical Video Encoders Public/Granted day:2023-03-30
Information query