Invention Grant
- Patent Title: Fusing multilayer and multimodal deep neural networks for video classification
-
Application No.: US15660719Application Date: 2017-07-26
-
Publication No.: US10402697B2Publication Date: 2019-09-03
- Inventor: Xiaodong Yang , Pavlo Molchanov , Jan Kautz
- Applicant: NVIDIA Corporation
- Applicant Address: US CA Santa Clara
- Assignee: NVIDIA Corporation
- Current Assignee: NVIDIA Corporation
- Current Assignee Address: US CA Santa Clara
- Agency: Leydig, Voit & Mayer, Ltd.
- Main IPC: G06K9/62
- IPC: G06K9/62 ; G06K9/00 ; G06K9/66 ; G06K9/46 ; G06N3/04 ; G06N3/08 ; G06N20/10

Abstract:
A method, computer readable medium, and system are disclosed for classifying video image data. The method includes the steps of processing training video image data by at least a first layer of a convolutional neural network (CNN) to extract a first set of feature maps and generate classification output data for the training video image data. Spatial classification accuracy data is computed based on the classification output data and target classification output data and spatial discrimination factors for the first layer are computed based on the spatial classification accuracies and the first set of feature maps.
Public/Granted literature
- US20180032846A1 FUSING MULTILAYER AND MULTIMODAL DEEP NEURAL NETWORKS FOR VIDEO CLASSIFICATION Public/Granted day:2018-02-01
Information query