Patent search ap:("Nvidia Corporation") AND inv:"Huan Ling" Page 1

1.

发明申请
FEATURE EXTRACTION WITH THREE-DIMENSIONAL INFORMATION 有权

公开(公告)号：US20250131680A1

公开(公告)日：2025-04-24

申请号：US18802288

申请日：2024-08-13

Applicant: NVIDIA Corporation

Inventor： Or Litany , Sanja Fidler , Huan Ling , Chenfeng Xu

IPC: G06V10/25 , G06T3/18 , G06T7/70 , G06T11/00 , G06V20/70

Abstract: Disclosed are systems and methods relating to extracting 3D features, such as bounding boxes. The systems can apply, to one or more features of a source image that depicts a scene using a first set of camera parameters, based on a condition view image associated with the source image, an epipolar geometric warping to determine a second set of camera parameters. The systems can generate, using a neural network, a synthetic image representing the one or more features and corresponding to the second set of camera parameters.

2.

发明授权
High-resolution video generation using image diffusion models 有权

公开(公告)号：US12192547B2

公开(公告)日：2025-01-07

申请号：US18181729

申请日：2023-03-10

Applicant: NVIDIA Corporation

Inventor： Karsten Julian Kreis , Robin Rombach , Andreas Blattmann , Seung Wook Kim , Huan Ling , Sanja Fidler , Tim Dockhorn

IPC: H04N21/2343 , G06T9/00 , G06V10/24 , G06V10/25 , G06V10/82 , H04N7/01

Abstract: In various examples, systems and methods are disclosed relating to aligning images into frames of a first video using at least one first temporal attention layer of a neural network model. The first video has a first spatial resolution. A second video having a second spatial resolution is generated by up-sampling the first video using at least one second temporal attention layer of an up-sampler neural network model, wherein the second spatial resolution is higher than the first spatial resolution.

3.

发明申请
HIGH-RESOLUTION VIDEO GENERATION USING IMAGE DIFFUSION MODELS 有权

公开(公告)号：US20250142145A1

公开(公告)日：2025-05-01

申请号：US19010882

申请日：2025-01-06

Applicant: NVIDIA Corporation

Inventor： Karsten Julian Kreis , Robin Rombach , Andreas Blattmann , Seung Wook Kim , Huan Ling , Sanja Fidler , Tim Dockhorn

IPC: H04N21/2343 , G06T9/00 , G06V10/24 , G06V10/25 , G06V10/82 , H04N7/01

Abstract: In various examples, systems and methods are disclosed relating to aligning images into frames of a first video using at least one first temporal attention layer of a neural network model. The first video has a first spatial resolution. A second video having a second spatial resolution is generated by up-sampling the first video using at least one second temporal attention layer of an up-sampler neural network model, wherein the second spatial resolution is higher than the first spatial resolution.

4.

发明申请
NEURAL RENDERING FOR INVERSE GRAPHICS GENERATION 有权

公开(公告)号：US20210279952A1

公开(公告)日：2021-09-09

申请号：US17193405

申请日：2021-03-05

Applicant: Nvidia Corporation

Inventor： Wenzheng Chen , Yuxuan Zhang , Sanja Fidler , Huan Ling , Jun Gao , Antonio Torralba Barriuso

IPC: G06T17/00 , G06T15/10 , G06N3/04 , G06N3/08

Abstract: Approaches are presented for training an inverse graphics network. An image synthesis network can generate training data for an inverse graphics network. In turn, the inverse graphics network can teach the synthesis network about the physical three-dimensional (3D) controls. Such an approach can provide for accurate 3D reconstruction of objects from 2D images using the trained inverse graphics network, while requiring little annotation of the provided training data. Such an approach can extract and disentangle 3D knowledge learned by generative models by utilizing differentiable renderers, enabling a disentangled generative model to function as a controllable 3D “neural renderer,” complementing traditional graphics renderers.

5.

发明公开
TEXT-DRIVEN 3D OBJECT STYLIZATION USING NEURAL NETWORKS 审中-公开

公开(公告)号：US20240290054A1

公开(公告)日：2024-08-29

申请号：US18174863

申请日：2023-02-27

Applicant: Nvidia Corporation

Inventor： Kangxue Yin , Huan Ling , Masha Shugrina , Sameh Khamis , Sanja Fidler

IPC: G06T19/20 , G06N3/0475 , G06N3/08 , G06T15/04 , G06T15/10

CPC classification number: G06T19/20 , G06N3/0475 , G06N3/08 , G06T15/04 , G06T15/10 , G06T2219/2021 , G06T2219/2024

Abstract: Generation of three-dimensional (3D) object models may be challenging for users without a sufficient skill set for content creation and may also be resource intensive. One or more style transfer networks may be combined with a generative network to generate objects based on parameters associated with a textual input. An input including a 3D mesh and texture may be provided to a trained system along with a textual input that includes parameters for object generation. Features of the input object may be identified and then tuned in accordance with the textual input to generate a modified 3D object that includes a new texture along with one or more geometric adjustments.

6.

发明公开
GENERATING MASK INFORMATION 审中-公开

公开(公告)号：US20240096064A1

公开(公告)日：2024-03-21

申请号：US17832400

申请日：2022-06-03

Applicant: NVIDIA Corporation

Inventor： Daiqing Li , Huan Ling , Seung Wook Kim , Karsten Julian Kreis , Sanja Fidler , Antonio Torralba Barriuso

IPC: G06V10/774 , G06V10/764 , G06V10/82

CPC classification number: G06V10/774 , G06V10/764 , G06V10/82

Abstract: Apparatuses, systems, and techniques to annotate images using neural models. In at least one embodiment, neural networks generate mask information from labels of one or more objects within one or more images identified by one or more other neural networks.

7.

发明申请
NEURAL RENDERING FOR INVERSE GRAPHICS GENERATION 有权

公开(公告)号：US20230134690A1

公开(公告)日：2023-05-04

申请号：US17981770

申请日：2022-11-07

Applicant: Nvidia Corporation

Inventor： Wenzheng Chen , Yuxuan Zhang , Sanja Fidler , Huan Ling , Jun Gao , Antonio Torralba Barriuso

IPC: G06T17/00 , G06N3/088 , G06T15/10 , G06N3/045

Abstract: Approaches are presented for training an inverse graphics network. An image synthesis network can generate training data for an inverse graphics network. In turn, the inverse graphics network can teach the synthesis network about the physical three-dimensional (3D) controls. Such an approach can provide for accurate 3D reconstruction of objects from 2D images using the trained inverse graphics network, while requiring little annotation of the provided training data. Such an approach can extract and disentangle 3D knowledge learned by generative models by utilizing differentiable renderers, enabling a disentangled generative model to function as a controllable 3D “neural renderer,” complementing traditional graphics renderers.

8.

发明公开
UNSUPERVISED PRE-TRAINING OF NEURAL NETWORKS USING GENERATIVE MODELS 审中-公开

公开(公告)号：US20240256831A1

公开(公告)日：2024-08-01

申请号：US18159815

申请日：2023-01-26

Applicant: NVIDIA Corporation

Inventor： Daiqing Li , Huan Ling , Seung Wook Kim , Karsten Julian Kreis , Antonio Torralba Barriuso , Sanja Fidler , Amlan Kar

IPC: G06N3/045 , G06T5/00 , G06V10/774 , G06V10/82

CPC classification number: G06N3/045 , G06T5/70 , G06V10/7753 , G06V10/82

Abstract: In various examples, systems and methods are disclosed relating to generating a response from image and/or video input for image/video-based artificial intelligence (AI) systems and applications. Systems and methods are disclosed for a first model (e.g., a teacher model) distilling its knowledge to a second model (a student model). The second model receives a downstream image in a downstream task and generates at least one feature. The first model generates first features corresponding to an image which can be a real image or a synthetic image. The second model generates second features using the image as an input to the second model. Loss with respect to first features is determined. The second model is updated using the loss.

9.

发明申请
HIGH-PRECISION SEMANTIC IMAGE EDITING USING NEURAL NETWORKS FOR SYNTHETIC DATA GENERATION SYSTEMS AND APPLICATIONS 有权

公开(公告)号：US20220383570A1

公开(公告)日：2022-12-01

申请号：US17827394

申请日：2022-05-27

Applicant: NVIDIA Corporation

Inventor： Huan Ling , Karsten Kreis , Daiqing Li , Seung Wook Kim , Antonio Torralba Barriuso , Sanja Fidler

IPC: G06T11/60 , G06T7/10 , G06V10/776 , G06V10/774

Abstract: In various examples, high-precision semantic image editing for machine learning systems and applications are described. For example, a generative adversarial network (GAN) may be used to jointly model images and their semantic segmentations based on a same underlying latent code. Image editing may be achieved by using segmentation mask modifications (e.g., provided by a user, or otherwise) to optimize the latent code to be consistent with the updated segmentation, thus effectively changing the original, e.g., RGB image. To improve efficiency of the system, and to not require optimizations for each edit on each image, editing vectors may be learned in latent space that realize the edits, and that can be directly applied on other images with or without additional optimizations. As a result, a GAN in combination with the optimization approaches described herein may simultaneously allow for high precision editing in real-time with straightforward compositionality of multiple edits.

10.

发明申请
GENERATING LABELS FOR SYNTHETIC IMAGES USING ONE OR MORE NEURAL NETWORKS 有权

公开(公告)号：US20220083807A1

公开(公告)日：2022-03-17

申请号：US17020649

申请日：2020-09-14

Applicant: NVIDIA Corporation

Inventor： Yuxuan Zhang , Huan Ling , Jun Gao , Wenzheng Chen , Antonio Torralba Barriuso , Sanja Fidler

IPC: G06K9/62 , G06N3/04 , G06N3/08

Abstract: Apparatuses, systems, and techniques to determine pixel-level labels of a synthetic image. In at least one embodiment, the synthetic image is generated by one or more generative networks and the pixel-level labels are generated using a combination of data output by a plurality of layers of the generative networks.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification