-
公开(公告)号:WO2021067126A1
公开(公告)日:2021-04-08
申请号:PCT/US2020/052665
申请日:2020-09-25
Applicant: NVIDIA CORPORATION
Inventor: LIU, Ming-Yu
IPC: G06T11/00
Abstract: Apparatuses, systems, and techniques are presented to generate or manipulate digital images. In at least one embodiment, a network is trained to generate modified images including user-selected features.
-
公开(公告)号:WO2021086796A1
公开(公告)日:2021-05-06
申请号:PCT/US2020/057408
申请日:2020-10-26
Applicant: NVIDIA CORPORATION
Inventor: TREMBLAY, Jonathan , LIU, Ming-Yu , FOX, Dieter , AMMIRATO, Philip
IPC: G06K9/32
Abstract: Apparatuses, systems, and techniques to determine orientation of an objects in an image. In at least one embodiment, images are processed using a neural network trained to determine orientation of an object.
-
公开(公告)号:WO2023004030A1
公开(公告)日:2023-01-26
申请号:PCT/US2022/037852
申请日:2022-07-21
Applicant: NVIDIA CORPORATION
Inventor: LIU, Ming-Yu , NAGANO, Koki , SEOL, Yeongho , VALLE GOMES DA COSTA, Jose, Rafael , SEO, Jaewoo , WANG, Ting-Chun , MALLYA, Arun , KHAMIS, Sameh , PING, Wei , BADLANI, Rohan , SHIH, Kevin, Jonathan , CATANZARO, Bryan , YUEN, Simon , KAUTZ, Jan
Abstract: Apparatuses, systems, and techniques are presented to generate media content. In at least one embodiment, a first neural network is used to generate first video information based, at least in part, upon voice information corresponding to one or more users, and a second neural network is used to generate second video information corresponding to the one or more users based, at least in part, upon the first video information and one or more images corresponding to the one or more users.
-
公开(公告)号:WO2022133883A1
公开(公告)日:2022-06-30
申请号:PCT/CN2020/138937
申请日:2020-12-24
Applicant: NVIDIA CORPORATION
Inventor: LIU, Ming-Yu , WANG, Ting-Chun , LIU, Xihui
IPC: G06T19/00
Abstract: Apparatuses, systems, and techniques to produce an image of a first subject positioned in a pose demonstrated by an image of a second subject. In at least one embodiment, an image of a first subject can be generated from a variety of points of view.
-
公开(公告)号:WO2021211750A1
公开(公告)日:2021-10-21
申请号:PCT/US2021/027343
申请日:2021-04-14
Applicant: NVIDIA CORPORATION
Inventor: LIU, Ming-Yu , WANG, Ting-Chun , MALLYA, Arun Mohanray , KARRAS, Tero Tapani , LAINE, Samuli Matias , LUEBKE, David Patrick , LEHTINEN, Jaakko , AITTALA, Miika Samuli , AILA, Timo Oskari
Abstract: Apparatuses, systems, and techniques to perform compression of video data using neural networks to facilitate video streaming, such as video conferencing. In at least one embodiment, a sender transmits to a receiver a key frame from video data and one or more keypoints identified by a neural network from said video data, and a receiver reconstructs video data using said key frame and one or more received keypoints.
-
公开(公告)号:WO2021050581A1
公开(公告)日:2021-03-18
申请号:PCT/US2020/049987
申请日:2020-09-09
Applicant: NVIDIA CORPORATION
Inventor: VAHDAT, Arash , MALLYA, Arun Mohanray , LIU, Ming-Yu , KAUTZ, Jan
Abstract: In at least one embodiment, differentiable neural architecture search and reinforcement learning are combined under one framework to discover network architectures with desired properties such as high accuracy, low latency, or both. In at least one embodiment, an objective function for search based on generalization error prevents the selection of architectures prone to overfitting.
-
公开(公告)号:WO2020102733A1
公开(公告)日:2020-05-22
申请号:PCT/US2019/061820
申请日:2019-11-15
Applicant: NVIDIA CORPORATION
Inventor: KAR, Amlan , PRAKASH, Aayush , LIU, Ming-Yu , ACUNA MARRERO, David Jesus , BARRIUSO, Antonio Torralba , FIDLER, Sanja
Abstract: In various examples, a generative model is used to synthesize datasets for use in training a downstream machine learning model to perform an associated task. The synthesized datasets may be generated by sampling a scene graph from a scene grammar – such as a probabilistic grammar – and applying the scene graph to the generative model to compute updated scene graphs more representative of object attribute distributions of real-world datasets. The downstream machine learning model may be validated against a real-world validation dataset, and the performance of the model on the real-world validation dataset may be used as an additional factor in further training or fine-tuning the generative model for generating the synthesized datasets specific to the task of the downstream machine learning model.
-
公开(公告)号:WO2022250796A1
公开(公告)日:2022-12-01
申请号:PCT/US2022/024306
申请日:2022-04-11
Applicant: NVIDIA CORPORATION
Inventor: SHEN, Tianchang , GAO, Jun , YIN, Kangxue , LIU, Ming-Yu , FIDLER, Sanja
IPC: G06T17/20
Abstract: In various examples, a deep three-dimensional (3D) conditional generative model is implemented that can synthesize high resolution 3D shapes using simple guides – such as coarse voxels, point clouds, etc. – by marrying implicit and explicit 3D representations into a hybrid 3D representation. The present approach may directly optimize for the reconstructed surface, allowing for the synthesis of finer geometric details with fewer artifacts. The systems and methods described herein may use a deformable tetrahedral grid that encodes a discretized signed distance function (SDF) and a differentiable marching tetrahedral layer that converts the implicit SDF representation to an explicit surface mesh representation. This combination allows joint optimization of the surface geometry and topology as well as generation of the hierarchy of subdivisions using reconstruction and adversarial losses defined explicitly on the surface mesh.
-
公开(公告)号:WO2022119844A1
公开(公告)日:2022-06-09
申请号:PCT/US2021/061238
申请日:2021-11-30
Applicant: NVIDIA CORPORATION
Inventor: HAO, Zekun , LIU, Ming-Yu , MALLYA, Arun Mohanray
Abstract: Apparatuses, systems, and techniques are presented to generate images. In at least one embodiment, one or more neural networks are used to generate one or more images based, at least in part, upon one or more semantic features projected from a three-dimensional environment.
-
公开(公告)号:WO2020198084A1
公开(公告)日:2020-10-01
申请号:PCT/US2020/024083
申请日:2020-03-21
Applicant: NVIDIA CORPORATION
Inventor: LI, Daiqing , FIDLER, Sanja , CHU, Hang , ACUNA MARRERO, David Jesus , KAR, Amlan , SHUGRINA, Maria , LIU, Ming-Yu , TORRALBA, Antonio
IPC: G06F30/27 , G06F30/13 , G06F16/901 , G06T3/00
Abstract: A generative model can be used for generation of spatial layouts and graphs. Such a model can progressively grow these layouts and graphs based on local statistics, where nodes can represent spatial control points of the layout, and edges can represent segments or paths between nodes, such as may correspond to road segments. A generative model can utilize an encoder-decoder architecture where the encoder is a recurrent neural network (RNN) that encodes local incoming paths into a node and the decoder is another RNN that generates outgoing nodes and edges connecting an existing node to the newly generated nodes. Generation is done iteratively, and can finish once all nodes are visited or another end condition is satisfied. Such a model can generate layouts by additionally conditioning on a set of attributes, giving control to a user in generating the layout.
-
-
-
-
-
-
-
-
-