Patent search ap:("NVIDIA Corporation") AND inv:"Fitsum A. Reda" Page 1

1.

发明申请
VIDEO PREDICTION USING SPATIALLY DISPLACED CONVOLUTION 审中-公开

公开(公告)号：US20190297326A1

公开(公告)日：2019-09-26

申请号：US16360853

申请日：2019-03-21

Applicant: NVIDIA Corporation

Inventor： Fitsum A. Reda , Guilin Liu , Kevin Shih , Robert Kirby , Jonathan Barker , David Tarjan , Andrew Tao , Bryan Catanzaro

IPC: H04N19/139 , G06N3/08 , G06N20/10 , G06N3/04 , G06N20/20 , H04N19/587 , H04N19/132 , H04N19/172

Abstract: A neural network architecture is disclosed for performing video frame prediction using a sequence of video frames and corresponding pairwise optical flows. The neural network processes the sequence of video frames and optical flows utilizing three-dimensional convolution operations, where time (or multiple video frames in the sequence of video frames) provides the third dimension in addition to the two-dimensional pixel space of the video frames. The neural network generates a set of parameters used to predict a next video frame in the sequence of video frames by sampling a previous video frame utilizing spatially-displaced convolution operations. In one embodiment, the set of parameters includes a displacement vector and at least one convolution kernel per pixel. Generating a pixel value in the next video frame includes applying the convolution kernel to a corresponding patch of pixels in the previous video frame based on the displacement vector.

2.

发明申请
IMAGE IN-PAINTING FOR IRREGULAR HOLES USING PARTIAL CONVOLUTIONS 审中-公开

公开(公告)号：US20190295228A1

公开(公告)日：2019-09-26

申请号：US16360895

申请日：2019-03-21

Applicant: NVIDIA Corporation

Inventor： Guilin Liu , Fitsum A. Reda , Kevin Shih , Ting-Chun Wang , Andrew Tao , Bryan Catanzaro

IPC: G06T5/00 , G06T5/20 , G06N3/08 , G06N20/10 , G06T3/40

Abstract: A neural network architecture is disclosed for performing image in-painting using partial convolution operations. The neural network processes an image and a corresponding mask that identifies holes in the image utilizing partial convolution operations, where the mask is used by the partial convolution operation to zero out coefficients of the convolution kernel corresponding to invalid pixel data for the holes. The mask is updated after each partial convolution operation is performed in an encoder section of the neural network. In one embodiment, the neural network is implemented using an encoder-decoder framework with skip links to forward representations of the features at different sections of the encoder to corresponding sections of the decoder.

Patent Agency Ranking