Systems and approaches for learning efficient representations for video understanding

Invention Grant

US11348336B2 Systems and approaches for learning efficient representations for video understanding 有权

Please log in to see more content

Patent Title: Systems and approaches for learning efficient representations for video understanding
Application No.: US15931075

Application Date: 2020-05-13
Publication No.: US11348336B2

Publication Date: 2022-05-31
Inventor: Quanfu Fan , Richard Chen , Sijia Liu , Hildegard Kuehne
Applicant: International Business Machines Corporation
Applicant Address: US NY Armonk
Assignee: International Business Machines Corporation
Current Assignee: International Business Machines Corporation
Current Assignee Address: US NY Armonk
Agency: Fleit Intellectual Property Law
Agent Jeffrey N. Giunta
Main IPC: G06V20/00
IPC: G06V20/00 ; G06V20/40 ; G06F17/16 ; G06N3/04

Systems and approaches for learning efficient representations for video understanding

Abstract:

Systems and methods for performing video understanding and analysis. Sets of feature maps for high resolution images and low resolution images in a time sequence of images are combined into combined sets of feature maps each having N feature maps. A time sequence of temporally aggregated sets of feature maps is created for each combined set of feature maps by: selecting a selected combined set of feature maps corresponding to an image at time “t” in the time sequence of images; applying, by channel-wise multiplication, a feature map weighting vector to a number of combined sets of feature maps that are temporally adjacent to the selected combined set of feature maps; and summing elements of the number of combined set of feature maps into a temporally aggregated set of feature maps. The time sequence of temporally aggregated sets of feature maps is processed to perform video understanding processing.

Public/Granted literature

US20210357651A1 SYSTEMS AND APPROACHES FOR LEARNING EFFICIENT REPRESENTATIONS FOR VIDEO UNDERSTANDING Public/Granted day:2021-11-18

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06V	图像或视频识别或理解
G06V20/00	场景；特定场景元素（控制数码相机 H04N5/232）