Invention Application
- Patent Title: ACTION RECOGNITION IN VIDEOS USING 3D SPATIO-TEMPORAL CONVOLUTIONAL NEURAL NETWORKS
-
Application No.: US16681671Application Date: 2019-11-12
-
Publication No.: US20200125852A1Publication Date: 2020-04-23
- Inventor: Joao Carreira , Andrew Zisserman
- Applicant: DeepMind Technologies Limited
- Main IPC: G06K9/00
- IPC: G06K9/00 ; G06T7/269 ; G06K9/62 ; G06N3/04 ; G06N3/08

Abstract:
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for processing video data. An example system receives video data and generates optical flow data. An image sequence from the video data is provided to a first 3D spatio-temporal convolutional neural network to process the image data in at least three space-time dimensions and to provide a first convolutional neural network output. A corresponding sequence of optical flow image frames is provided to a second 3D spatio-temporal convolutional neural network to process the optical flow data in at least three space-time dimensions and to provide a second convolutional neural network output. The first and second convolutional neural network outputs are combined to provide a system output.
Public/Granted literature
- US10789479B2 Action recognition in videos using 3D spatio-temporal convolutional neural networks Public/Granted day:2020-09-29
Information query