DEEP 3D ATTENTION LONG SHORT-TERM MEMORY FOR VIDEO-BASED ACTION RECOGNITION

Invention Application

US20170293804A1 DEEP 3D ATTENTION LONG SHORT-TERM MEMORY FOR VIDEO-BASED ACTION RECOGNITION 审中-公开

Please log in to see more content

Patent Title: DEEP 3D ATTENTION LONG SHORT-TERM MEMORY FOR VIDEO-BASED ACTION RECOGNITION
Application No.: US15479408

Application Date: 2017-04-05
Publication No.: US20170293804A1

Publication Date: 2017-10-12
Inventor: Renqiang Min , Yang Gao , Eric Cosatto
Applicant: NEC Laboratories America, Inc.
Main IPC: G06K9/00
IPC: G06K9/00 ; G06K9/62 ; G06N3/04

DEEP 3D ATTENTION LONG SHORT-TERM MEMORY FOR VIDEO-BASED ACTION RECOGNITION

Abstract:

A method, a computer program product, and a system are provided for video based action recognition. The system includes a processor. One or more frames from one or more video sequences are received. A feature vector for each patch of the one w more frames is generated using a deep convolutional neural network. An attention factor for the feature vectors is generated based on a within-frame attention and a between-frame attention. A target action is identified using a multi-layer deep long short-term memory process applied to the attention factor, said target action representing at least one of the one or more video sequences. An operation of a processor-based machine is controlled to change a state of the processor-based machine, responsive to the at least one of the one or more video sequences including the identified target action

Public/Granted literature

US10296793B2 Deep 3D attention long short-term memory for video-based action recognition Public/Granted day:2019-05-21

Information query

Global Dossier Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06K	图形数据读取（图像或视频识别或理解G06V）；数据的呈现；记录载体；处理记录载体
G06K9/00	识别模式的方法或装置（图形读取或将机械参数模式（例如力或存在）转换为电信号的方法或装置 G06K11/00）（图像或视频识别或理解 G06V）（语音识别 G10L15/00 )