Invention Application
- Patent Title: Systems And Methods For Improved Video Understanding
-
Application No.: US17370522Application Date: 2021-07-08
-
Publication No.: US20230017072A1Publication Date: 2023-01-19
- Inventor: Anurag Arnab , Mostafa Dehghani , Georg Heigold , Chen Sun , Mario Lucic , Cordelia Luise Schmid
- Applicant: Google LLC
- Applicant Address: US CA Mountain View
- Assignee: Google LLC
- Current Assignee: Google LLC
- Current Assignee Address: US CA Mountain View
- Main IPC: G06K9/00
- IPC: G06K9/00 ; G06N20/00

Abstract:
A computer-implemented method for classifying video data with improved accuracy includes obtaining, by a computing system comprising one or more computing devices, video data comprising a plurality of video frames; extracting, by the computing system, a plurality of video tokens from the video data, the plurality of video tokens comprising a representation of spatiotemporal information in the video data; providing, by the computing system, the plurality of video tokens as input to a video understanding model, the video understanding model comprising a video transformer encoder model; and receiving, by the computing system, a classification output from the video understanding model.
Public/Granted literature
- US12112538B2 Systems and methods for improved video understanding Public/Granted day:2024-10-08
Information query