Fusing multilayer and multimodal deep neural networks for video classification

Invention Grant

US10402697B2 Fusing multilayer and multimodal deep neural networks for video classification 有权

Please log in to see more content

Patent Title: Fusing multilayer and multimodal deep neural networks for video classification
Application No.: US15660719

Application Date: 2017-07-26
Publication No.: US10402697B2

Publication Date: 2019-09-03
Inventor: Xiaodong Yang , Pavlo Molchanov , Jan Kautz
Applicant: NVIDIA Corporation
Applicant Address: US CA Santa Clara
Assignee: NVIDIA Corporation
Current Assignee: NVIDIA Corporation
Current Assignee Address: US CA Santa Clara
Agency: Leydig, Voit & Mayer, Ltd.
Main IPC: G06K9/62
IPC: G06K9/62 ; G06K9/00 ; G06K9/66 ; G06K9/46 ; G06N3/04 ; G06N3/08 ; G06N20/10

Fusing multilayer and multimodal deep neural networks for video classification

Abstract:

A method, computer readable medium, and system are disclosed for classifying video image data. The method includes the steps of processing training video image data by at least a first layer of a convolutional neural network (CNN) to extract a first set of feature maps and generate classification output data for the training video image data. Spatial classification accuracy data is computed based on the classification output data and target classification output data and spatial discrimination factors for the first layer are computed based on the spatial classification accuracies and the first set of feature maps.

Public/Granted literature

US20180032846A1 FUSING MULTILAYER AND MULTIMODAL DEEP NEURAL NETWORKS FOR VIDEO CLASSIFICATION Public/Granted day:2018-02-01

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06K	图形数据读取（图像或视频识别或理解G06V）；数据的呈现；记录载体；处理记录载体
G06K9/00	识别模式的方法或装置（图形读取或将机械参数模式（例如力或存在）转换为电信号的方法或装置 G06K11/00）（图像或视频识别或理解 G06V）（语音识别 G10L15/00 )
G06K9/62	.应用电子设备进行识别的方法或装置