Systems And Methods For Improved Video Understanding

Invention Application

US20230017072A1 Systems And Methods For Improved Video Understanding 有权

Please log in to see more content

Patent Title: Systems And Methods For Improved Video Understanding
Application No.: US17370522

Application Date: 2021-07-08
Publication No.: US20230017072A1

Publication Date: 2023-01-19
Inventor: Anurag Arnab , Mostafa Dehghani , Georg Heigold , Chen Sun , Mario Lucic , Cordelia Luise Schmid
Applicant: Google LLC
Applicant Address: US CA Mountain View
Assignee: Google LLC
Current Assignee: Google LLC
Current Assignee Address: US CA Mountain View
Main IPC: G06K9/00
IPC: G06K9/00 ; G06N20/00

Systems And Methods For Improved Video Understanding

Abstract:

A computer-implemented method for classifying video data with improved accuracy includes obtaining, by a computing system comprising one or more computing devices, video data comprising a plurality of video frames; extracting, by the computing system, a plurality of video tokens from the video data, the plurality of video tokens comprising a representation of spatiotemporal information in the video data; providing, by the computing system, the plurality of video tokens as input to a video understanding model, the video understanding model comprising a video transformer encoder model; and receiving, by the computing system, a classification output from the video understanding model.

Public/Granted literature

US12112538B2 Systems and methods for improved video understanding Public/Granted day:2024-10-08

Information query

Global Dossier Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06K	图形数据读取（图像或视频识别或理解G06V）；数据的呈现；记录载体；处理记录载体
G06K9/00	识别模式的方法或装置（图形读取或将机械参数模式（例如力或存在）转换为电信号的方法或装置 G06K11/00）（图像或视频识别或理解 G06V）（语音识别 G10L15/00 )