-
公开(公告)号:US10373332B2
公开(公告)日:2019-08-06
申请号:US15836549
申请日:2017-12-08
Applicant: NVIDIA Corporation
Inventor: Jinwei Gu , Xiaodong Yang , Shalini De Mello , Jan Kautz
Abstract: A method, computer readable medium, and system are disclosed for dynamic facial analysis. The method includes the steps of receiving video data representing a sequence of image frames including at least one head and extracting, by a neural network, spatial features comprising pitch, yaw, and roll angles of the at least one head from the video data. The method also includes the step of processing, by a recurrent neural network, the spatial features for two or more image frames in the sequence of image frames to produce head pose estimates for the at least one head.
-
公开(公告)号:US20180293737A1
公开(公告)日:2018-10-11
申请号:US15942213
申请日:2018-03-30
Applicant: NVIDIA Corporation
Inventor: Deqing Sun , Xiaodong Yang , Ming-Yu Liu , Jan Kautz
CPC classification number: G06T7/207 , G06N3/0454 , G06N3/08 , G06N5/046 , G06T3/0093 , G06T7/246 , G06T7/251 , G06T7/97 , G06T2200/28 , G06T2207/10016 , G06T2207/20016 , G06T2207/20032 , G06T2207/20084
Abstract: A method, computer readable medium, and system are disclosed for estimating optical flow between two images. A first pyramidal set of features is generated for a first image and a partial cost volume for a level of the first pyramidal set of features is computed, by a neural network, using features at the level of the first pyramidal set of features and warped features extracted from a second image, where the partial cost volume is computed across a limited range of pixels that is less than a full resolution of the first image, in pixels, at the level. The neural network processes the features and the partial cost volume to produce a refined optical flow estimate for the first image and the second image.
-
公开(公告)号:US09934714B2
公开(公告)日:2018-04-03
申请号:US14660637
申请日:2015-03-17
Applicant: NVIDIA Corporation
Inventor: Felix Heide , Douglas Lanman , Dikpal Reddy , Jan Kautz , Kari Pulli , David Luebke
CPC classification number: G09G3/20 , G09G3/007 , G09G3/2025 , G09G3/36 , G09G2300/023 , G09G2340/0407 , G09G2340/0435
Abstract: System and method of displaying images in temporal superresolution by multiplicative superposition of cascaded display layers integrated in a display device. Using an original video with a target temporal resolution as a priori, a factorization process is performed to derive respective image data for presentation on each display layer. The multiple layers are refreshed in staggered intervals to synthesize a video with an effective refresh rate exceeding that of each individual display layer, e.g., by a factor equal to the number of layers. Further optically averaging neighboring pixels can minimize artifacts.
-
公开(公告)号:US09905196B2
公开(公告)日:2018-02-27
申请号:US15276626
申请日:2016-09-26
Applicant: NVIDIA Corporation
Inventor: Dawid Stanislaw Pajak , Felix Heide , Nagilla Dikpal Reddy , Mushfiqur Rouf , Jan Kautz , Kari Pulli , Orazio Gallo
CPC classification number: G09G5/02 , G06T3/4015 , G06T5/001 , G06T5/002 , G09G5/026 , G09G5/363 , G09G2320/0238 , G09G2320/0242 , G09G2320/0247 , G09G2320/066 , G09G2360/08
Abstract: A computer implemented method of determining a latent image from an observed image is disclosed. The method comprises implementing a plurality of image processing operations within a single optimization framework, wherein the single optimization framework comprises solving a linear minimization expression. The method further comprises mapping the linear minimization expression onto at least one non-linear solver. Further, the method comprises using the non-linear solver, iteratively solving the linear minimization expression in order to extract the latent image from the observed image, wherein the linear minimization expression comprises: a data term, and a regularization term, and wherein the regularization term comprises a plurality of non-linear image priors.
-
公开(公告)号:US09892669B2
公开(公告)日:2018-02-13
申请号:US14660030
申请日:2015-03-17
Applicant: NVIDIA Corporation
Inventor: Felix Heide , Douglas Lanman , Dikpal Reddy , Jan Kautz , Kari Pulli , David Luebke
CPC classification number: G09G3/20 , G09G3/007 , G09G3/2025 , G09G3/36 , G09G2300/023 , G09G2340/0407 , G09G2340/0435
Abstract: System and method of displaying images in spatial/temporal superresolution by multiplicative superposition of cascaded display layers integrated in a display device. Using an original image with a target spatial/temporal resolution as a priori, a factorization process is performed to derive respective image data for presentation on each display layer. The cascaded display layers may be progressive and laterally shifted with each other, resulting in an effective spatial resolution exceeding the native display resolutions of the display layers. Factorized images may be refreshed on respective display layers in synchronization or out of synchronization.
-
公开(公告)号:US20170249401A1
公开(公告)日:2017-08-31
申请号:US15055440
申请日:2016-02-26
Applicant: NVIDIA Corporation
Inventor: Benjamin David Eckart , Kihwan Kim , Alejandro Jose Troccoli , Jan Kautz
CPC classification number: G06F17/5009 , G06F17/18 , G06F2217/16 , G06K9/00986 , G06K9/6219 , G06K9/6277 , G06K9/6282 , G06N5/003 , G06N7/005
Abstract: A method, computer readable medium, and system are disclosed for generating a Gaussian mixture model hierarchy. The method includes the steps of receiving point cloud data defining a plurality of points; defining a Gaussian Mixture Model (GMM) hierarchy that includes a number of mixels, each mixel encoding parameters for a probabilistic occupancy map; and adjusting the parameters for one or more probabilistic occupancy maps based on the point cloud data utilizing a number of iterations of an Expectation-Maximum (EM) algorithm.
-
公开(公告)号:US20150310798A1
公开(公告)日:2015-10-29
申请号:US14660637
申请日:2015-03-17
Applicant: NVIDIA Corporation
Inventor: Felix Heide , Douglas Lanman , Dikpal Reddy , Jan Kautz , Kari Pulli , David Luebke
IPC: G09G3/20
CPC classification number: G09G3/20 , G09G3/007 , G09G3/2025 , G09G3/36 , G09G2300/023 , G09G2340/0407 , G09G2340/0435
Abstract: System and method of displaying images in temporal superresolution by multiplicative superposition of cascaded display layers integrated in a display device. Using an original video with a target temporal resolution as a priori, a factorization process is performed to derive respective image data for presentation on each display layer. The multiple layers are refreshed in staggered intervals to synthesize a video with an effective refresh rate exceeding that of each individual display layer, e.g., by a factor equal to the number of layers. Further optically averaging neighboring pixels can minimize artifacts.
Abstract translation: 通过在显示装置中集成的级联显示层的乘法叠加在时间超分辨率中显示图像的系统和方法。 使用具有目标时间分辨率的原始视频作为先验,执行因式分解处理以导出用于在每个显示层上呈现的各个图像数据。 多层以交错的间隔刷新以合成具有超过每个单独显示层的有效刷新率的视频,例如等于层数的因子。 进一步光学平均相邻像素可以最小化伪像。
-
公开(公告)号:US20150310789A1
公开(公告)日:2015-10-29
申请号:US14660030
申请日:2015-03-17
Applicant: NVIDIA Corporation
Inventor: Felix Heide , Douglas Lanman , Dikpal Reddy , Jan Kautz , Kari Pulli , David Luebke
IPC: G09G3/20
CPC classification number: G09G3/20 , G09G3/007 , G09G3/2025 , G09G3/36 , G09G2300/023 , G09G2340/0407 , G09G2340/0435
Abstract: System and method of displaying images in spatial/temporal superresolution by multiplicative superposition of cascaded display layers integrated in a display device. Using an original image with a target spatial/temporal resolution as a priori, a factorization process is performed to derive respective image data for presentation on each display layer. The cascaded display layers may be progressive and laterally shifted with each other, resulting in an effective spatial resolution exceeding the native display resolutions of the display layers. Factorized images may be refreshed on respective display layers in synchronization or out of synchronization.
Abstract translation: 通过在显示装置中集成的级联显示层的乘法叠加在空间/时间超分辨率中显示图像的系统和方法。 作为先验使用具有目标空间/时间分辨率的原始图像,执行因式分解处理以导出用于在每个显示层上呈现的各个图像数据。 级联的显示层可以是逐行的并且横向移位,导致超过显示层的本机显示分辨率的有效的空间分辨率。 分解图像可以在同步或不同步的情况下在各个显示层上刷新。
-
公开(公告)号:US20240404174A1
公开(公告)日:2024-12-05
申请号:US18653723
申请日:2024-05-02
Applicant: NVIDIA Corporation
Inventor: Xueting Li , Shalini De Mello , Sifei Liu , Koki Nagano , Umar Iqbal , Jan Kautz
Abstract: Systems and methods are disclosed that animate a source portrait image with motion (i.e., pose and expression) from a target image. In contrast to conventional systems, given an unseen single-view portrait image, an implicit three-dimensional (3D) head avatar is constructed that not only captures photo-realistic details within and beyond the face region, but also is readily available for animation without requiring further optimization during inference. In an embodiment, three processing branches of a system produce three tri-planes representing coarse 3D geometry for the head avatar, detailed appearance of a source image, as well as the expression of a target image. By applying volumetric rendering to a combination of the three tri-planes, an image of the desired identity, expression and pose is generated.
-
公开(公告)号:US20240338871A1
公开(公告)日:2024-10-10
申请号:US18746911
申请日:2024-06-18
Applicant: NVIDIA Corporation
Inventor: Donghoom LEE , Sifei Liu , Jinwei Gu , Ming-Yu Liu , Jan Kautz
CPC classification number: G06T11/60 , G06F18/217 , G06F18/24 , G06T3/02 , G06T7/30 , G06V30/274 , G06T7/70 , G06T2207/20081 , G06T2207/20084 , G06T2210/12
Abstract: One embodiment of a method includes applying a first generator model to a semantic representation of an image to generate an affine transformation, where the affine transformation represents a bounding box associated with at least one region within the image. The method further includes applying a second generator model to the affine transformation and the semantic representation to generate a shape of an object. The method further includes inserting the object into the image based on the bounding box and the shape.
-
-
-
-
-
-
-
-
-