Invention Grant
- Patent Title: Systems and methods for improved video understanding
-
Application No.: US17370522Application Date: 2021-07-08
-
Publication No.: US12112538B2Publication Date: 2024-10-08
- Inventor: Anurag Arnab , Mostafa Dehghani , Georg Heigold , Chen Sun , Mario Lucic , Cordelia Luise Schmid
- Applicant: Google LLC
- Applicant Address: US CA Mountain View
- Assignee: GOOGLE LLC
- Current Assignee: GOOGLE LLC
- Current Assignee Address: US CA Mountain View
- Agency: DORITY & MANNING P.A.
- Main IPC: G06V20/40
- IPC: G06V20/40 ; G06N20/00

Abstract:
A computer-implemented method for classifying video data with improved accuracy includes obtaining, by a computing system comprising one or more computing devices, video data comprising a plurality of video frames; extracting, by the computing system, a plurality of video tokens from the video data, the plurality of video tokens comprising a representation of spatiotemporal information in the video data; providing, by the computing system, the plurality of video tokens as input to a video understanding model, the video understanding model comprising a video transformer encoder model; and receiving, by the computing system, a classification output from the video understanding model.
Public/Granted literature
- US20230017072A1 Systems And Methods For Improved Video Understanding Public/Granted day:2023-01-19
Information query