Invention Grant
- Patent Title: Systems and approaches for learning efficient representations for video understanding
-
Application No.: US15931075Application Date: 2020-05-13
-
Publication No.: US11348336B2Publication Date: 2022-05-31
- Inventor: Quanfu Fan , Richard Chen , Sijia Liu , Hildegard Kuehne
- Applicant: International Business Machines Corporation
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agency: Fleit Intellectual Property Law
- Agent Jeffrey N. Giunta
- Main IPC: G06V20/00
- IPC: G06V20/00 ; G06V20/40 ; G06F17/16 ; G06N3/04

Abstract:
Systems and methods for performing video understanding and analysis. Sets of feature maps for high resolution images and low resolution images in a time sequence of images are combined into combined sets of feature maps each having N feature maps. A time sequence of temporally aggregated sets of feature maps is created for each combined set of feature maps by: selecting a selected combined set of feature maps corresponding to an image at time “t” in the time sequence of images; applying, by channel-wise multiplication, a feature map weighting vector to a number of combined sets of feature maps that are temporally adjacent to the selected combined set of feature maps; and summing elements of the number of combined set of feature maps into a temporally aggregated set of feature maps. The time sequence of temporally aggregated sets of feature maps is processed to perform video understanding processing.
Public/Granted literature
- US20210357651A1 SYSTEMS AND APPROACHES FOR LEARNING EFFICIENT REPRESENTATIONS FOR VIDEO UNDERSTANDING Public/Granted day:2021-11-18
Information query