Patent search ap:("Nvidia Corporation") AND inv:"Jan Kautz" Page 10

91.

发明申请
FAST MULTI-SCALE POINT CLOUD REGISTRATION WITH A HIERARCHICAL GAUSSIAN MIXTURE 审中-公开

公开(公告)号：US20190319851A1

公开(公告)日：2019-10-17

申请号：US16351312

申请日：2019-03-12

Applicant: NVIDIA Corporation

Inventor： Benjamin David Eckart , Kihwan Kim , Jan Kautz

IPC: H04L12/24 , H04L29/08 , G06F16/22

Abstract: Point cloud registration sits at the core of many important and challenging 3D perception problems including autonomous navigation, object/scene recognition, and augmented reality (AR). A new registration algorithm is presented that achieves speed and accuracy by registering a point cloud to a representation of a reference point cloud. A target point cloud is registered to the reference point cloud by iterating through a number of cycles of an EM algorithm where, during an Expectation step, each point in the target point cloud is associated with a node of a hierarchical tree data structure and, during a Maximization step, an estimated transformation is determined based on the association of the points with corresponding nodes of the hierarchical tree data structure. The estimated transformation is determined by solving a minimization problem associated with a sum, over a number of mixture components, over terms related to a Mahalanobis distance.

92.

发明授权
System and method for optical flow estimation 有权

公开(公告)号：US10424069B2

公开(公告)日：2019-09-24

申请号：US15942213

申请日：2018-03-30

Applicant: NVIDIA Corporation

Inventor： Deqing Sun , Xiaodong Yang , Ming-Yu Liu , Jan Kautz

IPC: G06T7/207 , G06N5/04 , G06T3/00 , G06T7/00 , G06T7/246 , G06N3/04 , G06N3/08

Abstract: A method, computer readable medium, and system are disclosed for estimating optical flow between two images. A first pyramidal set of features is generated for a first image and a partial cost volume for a level of the first pyramidal set of features is computed, by a neural network, using features at the level of the first pyramidal set of features and warped features extracted from a second image, where the partial cost volume is computed across a limited range of pixels that is less than a full resolution of the first image, in pixels, at the level. The neural network processes the features and the partial cost volume to produce a refined optical flow estimate for the first image and the second image.

93.

发明申请
Domain Stylization Using a Neural Network Model 审中-公开

公开(公告)号：US20190244060A1

公开(公告)日：2019-08-08

申请号：US16265725

申请日：2019-02-01

Applicant: NVIDIA Corporation

Inventor： Aysegul Dundar , Ming-Yu Liu , Ting-Chun Wang , John Zedlewski , Jan Kautz

IPC: G06K9/62 , G06N3/08 , G06N3/04 , G06K9/32 , G06T3/00 , G06T7/10

CPC classification number: G06K9/6256 , G06K9/3233 , G06K9/6267 , G06N3/0454 , G06N3/08 , G06T3/0056 , G06T7/10

Abstract: A style transfer neural network may be used to generate stylized synthetic images, where real images provide the style (e.g., seasons, weather, lighting) for transfer to synthetic images. The stylized synthetic images may then be used to train a recognition neural network. In turn, the trained neural network may be used to predict semantic labels for the real images, providing recognition data for the real images. Finally, the real training dataset (real images and predicted recognition data) and the synthetic training dataset are used by the style transfer neural network to generate stylized synthetic images. The training of the neural network, prediction of recognition data for the real images, and stylizing of the synthetic images may be repeated for a number of iterations. The stylization operation more closely aligns a covariate of the synthetic images to the covariate of the real images, improving accuracy of the recognition neural network.

94.

发明申请
SWITCHABLE PROPAGATION NEURAL NETWORK 审中-公开

公开(公告)号：US20190213439A1

公开(公告)日：2019-07-11

申请号：US16353835

申请日：2019-03-14

Applicant: NVIDIA Corporation

Inventor： Sifei Liu , Shalini De Mello , Jinwei Gu , Varun Jampani , Jan Kautz

IPC: G06K9/62 , G06K9/00 , G06T7/90 , G06T5/00 , G06T7/10 , G06T5/50 , G06N3/08 , G06N3/04

CPC classification number: G06K9/6215 , G06K9/00744 , G06K9/6256 , G06N3/04 , G06N3/08 , G06N3/084 , G06T5/009 , G06T5/50 , G06T7/10 , G06T7/90 , G06T2207/10016 , G06T2207/20081 , G06T2207/20084 , G06T2207/20208

Abstract: A temporal propagation network (TPN) system learns the affinity matrix for video image processing tasks. An affinity matrix is a generic matrix that defines the similarity of two points in space. The TPN system includes a guidance neural network model and a temporal propagation module and is trained for a particular computer vision task to propagate visual properties from a key-frame represented by dense data (color), to another frame that is represented by coarse data (grey-scale). The guidance neural network model generates an affinity matrix referred to as a global transformation matrix from task-specific data for the key-frame and the other frame. The temporal propagation module applies the global transformation matrix to the key-frame property data to produce propagated property data (color) for the other frame. For example, the TPN system may be used to colorize several frames of greyscale video using a single manually colorized key-frame.

95.

发明申请
SYSTEMS AND METHODS FOR DYNAMIC FACIAL ANALYSIS USING A RECURRENT NEURAL NETWORK 审中-公开

公开(公告)号：US20190180469A1

公开(公告)日：2019-06-13

申请号：US15836549

申请日：2017-12-08

Applicant: NVIDIA Corporation

Inventor： Jinwei Gu , Xiaodong Yang , Shalini De Mello , Jan Kautz

IPC: G06T7/73 , G06N3/08

CPC classification number: G06T7/73 , G06N3/08 , G06T3/4046 , G06T13/40 , G06T2207/10016 , G06T2207/20081 , G06T2207/20084 , G06T2207/30201 , G06T2207/30204

Abstract: A method, computer readable medium, and system are disclosed for dynamic facial analysis. The method includes the steps of receiving video data representing a sequence of image frames including at least one head and extracting, by a neural network, spatial features comprising pitch, yaw, and roll angles of the at least one head from the video data. The method also includes the step of processing, by a recurrent neural network, the spatial features for two or more image frames in the sequence of image frames to produce head pose estimates for the at least one head.

96.

发明申请
CREATING AN IMAGE UTILIZING A MAP REPRESENTING DIFFERENT CLASSES OF PIXELS 审中-公开

公开(公告)号：US20190147296A1

公开(公告)日：2019-05-16

申请号：US16188920

申请日：2018-11-13

Applicant: NVIDIA Corporation

Inventor： Ting-Chun Wang , Ming-Yu Liu , Bryan Christopher Catanzaro , Jan Kautz , Andrew J. Tao

IPC: G06K9/62 , G06K9/68 , G06K9/72

Abstract: A method, computer readable medium, and system are disclosed for creating an image utilizing a map representing different classes of specific pixels within a scene. One or more computing systems use the map to create a preliminary image. This preliminary image is then compared to an original image that was used to create the map. A determination is made whether the preliminary image matches the original image, and results of the determination are used to adjust the computing systems that created the preliminary image, which improves a performance of such computing systems. The adjusted computing systems are then used to create images based on different input maps representing various object classes of specific pixels within a scene.

97.

发明申请
Learning-Based Camera Pose Estimation From Images of an Environment 审中-公开

公开(公告)号：US20190108651A1

公开(公告)日：2019-04-11

申请号：US16137064

申请日：2018-09-20

Applicant: NVIDIA Corporation

Inventor： Jinwei Gu , Samarth Manoj Brahmbhatt , Kihwan Kim , Jan Kautz

IPC: G06T7/80 , G06T7/00

Abstract: A deep neural network (DNN) system learns a map representation for estimating a camera position and orientation (pose). The DNN is trained to learn a map representation corresponding to the environment, defining positions and attributes of structures, trees, walls, vehicles, walls, etc. The DNN system learns a map representation that is versatile and performs well for many different environments (indoor, outdoor, natural, synthetic, etc.). The DNN system receives images of an environment captured by a camera (observations) and outputs an estimated camera pose within the environment. The estimated camera pose is used to perform camera localization, i.e., recover the three-dimensional (3D) position and orientation of a moving camera, which is a fundamental task in computer vision with a wide variety of applications in robot navigation, car localization for autonomous driving, device localization for mobile navigation, and augmented/virtual reality.

98.

发明申请
UNIFIED OPTIMIZATION METHOD FOR END-TO-END CAMERA IMAGE PROCESSING FOR TRANSLATING A SENSOR CAPTURED IMAGE TO A DISPLAY IMAGE 审中-公开
Title translation: 用于将传感器捕获的图像转换为显示图像的端到端相机图像处理的统一优化方法

公开(公告)号：US20170011710A1

公开(公告)日：2017-01-12

申请号：US15276626

申请日：2016-09-26

Applicant: NVIDIA Corporation

Inventor： Dawid Stanislaw Pajak , Felix Heide , Nagilla Dikpal Reddy , Mushfiqur Rouf , Jan Kautz , Kari Pulli , Orazio Gallo

IPC: G09G5/02 , G09G5/36 , G06T3/40 , G06T5/00

CPC classification number: G09G5/02 , G06T3/4015 , G06T5/001 , G06T5/002 , G09G5/026 , G09G5/363 , G09G2320/0238 , G09G2320/0242 , G09G2320/0247 , G09G2320/066 , G09G2360/08

Abstract: A computer implemented method of determining a latent image from an observed image is disclosed. The method comprises implementing a plurality of image processing operations within a single optimization framework, wherein the single optimization framework comprises solving a linear minimization expression. The method further comprises mapping the linear minimization expression onto at least one non-linear solver. Further, the method comprises using the non-linear solver, iteratively solving the linear minimization expression in order to extract the latent image from the observed image, wherein the linear minimization expression comprises: a data term, and a regularization term, and wherein the regularization term comprises a plurality of non-linear image priors.

Abstract translation: 公开了一种从观察图像确定潜像的计算机实现方法。该方法包括在单个优化框架内实现多个图像处理操作，其中单个优化框架包括求解线性最小化表达式。该方法还包括将线性最小化表达映射到至少一个非线性求解器上。此外，该方法包括使用非线性求解器，迭代地求解线性最小化表达以从观察图像中提取潜像，其中线性最小化表达式包括：数据项和正则化项，其中正则化术语包括多个非线性图像先验。

99.

发明申请
SYNTHETIC BRACKETING FOR EXPOSURE CORRECTION 有权

公开(公告)号：US20250069191A1

公开(公告)日：2025-02-27

申请号：US18452634

申请日：2023-08-21

Applicant: NVIDIA Corporation

Inventor： Iuri Frosio , Mayoore Selvarasa Jaiswal , Jan Kautz , Jianyuan Min

IPC: G06T5/50 , H04N23/743

Abstract: Systems and methods are disclosed related to synthetic bracketing for exposure correction. A deep learning based method and system produces a set of differently exposed images from a single input image. The images in the set may be combined to produce an output image with improved global and local exposure compared with the input image. An image encoder applies learned parameters to each input image to generate a set of image features including local exposure estimates for each of two or more regions of the input image and a low resolution latent representation of the input image. A decoder receives the local exposure estimates, the latent representation, and target enhancements that are processed to generate synthesized transformations. When applied to the input image, the synthesized transformations produce the set of transformed images. Each transformed image is a version of the input image synthesized to correspond to a respective target enhancement.

100.

发明申请
VARIATIONAL INFERENCING BY A DIFFUSION MODEL 有权

公开(公告)号：US20250045892A1

公开(公告)日：2025-02-06

申请号：US18593742

申请日：2024-03-01

Applicant: NVIDIA Corporation

Inventor： Morteza Mardani , Jiaming Song , Jan Kautz , Arash Vahdat

IPC: G06T7/00 , G06T3/40 , G06T5/70 , G06T5/73 , G06T5/77

Abstract: Diffusion models are machine learning algorithms that are uniquely trained to generate high-quality data from an input lower-quality data. For example, they can be trained in the image domain, for example, to perform specific image restoration tasks, such as inpainting (e.g. completing an incomplete image), deblurring (e.g. removing blurring from an image), and super-resolution (e.g. increasing a resolution of an image), or they can be trained to perform image rendering tasks, including 2D-to-3D image generation tasks. However, current approaches to training diffusion models only allow the models to be optimized for a specific task such that they will not achieve high-quality results when used for other tasks. The present disclosure provides a diffusion model that uses variational inferencing to approximate a distribution of data, which allows the diffusion model to universally solve different tasks without having to be re-trained specifically for each task.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification