-
公开(公告)号:US11715251B2
公开(公告)日:2023-08-01
申请号:US17507620
申请日:2021-10-21
Applicant: NVIDIA Corporation
Inventor: Jonathan Tremblay , Aayush Prakash , Mark A. Brophy , Varun Jampani , Cem Anil , Stanley Thomas Birchfield , Thang Hong To , David Jesus Acuna Marrero
IPC: G06T15/00 , G06T15/04 , G06T15/50 , G06T15/20 , G06F18/214 , G06F18/211 , G06V10/774 , G06V10/82 , G06N3/04 , G06N3/084
CPC classification number: G06T15/00 , G06F18/211 , G06F18/2148 , G06T15/04 , G06T15/20 , G06T15/50 , G06V10/7747 , G06V10/82 , G06N3/04 , G06N3/084 , G06T2210/12 , G06V2201/07
Abstract: Training deep neural networks requires a large amount of labeled training data. Conventionally, labeled training data is generated by gathering real images that are manually labelled which is very time-consuming. Instead of manually labelling a training dataset, domain randomization technique is used generate training data that is automatically labeled. The generated training data may be used to train neural networks for object detection and segmentation (labelling) tasks. In an embodiment, the generated training data includes synthetic input images generated by rendering three-dimensional (3D) objects of interest in a 3D scene. In an embodiment, the generated training data includes synthetic input images generated by rendering 3D objects of interest on a 2D background image. The 3D objects of interest are objects that a neural network is trained to detect and/or label.
-
公开(公告)号:US20210374489A1
公开(公告)日:2021-12-02
申请号:US17226561
申请日:2021-04-09
Applicant: Nvidia Corporation
Inventor: Aayush Prakash , Shoubhik Debnath , Jean-Francois Lafleche , Eric Cameracci , Gavriel State , Marc Teva Law
Abstract: Approaches are presented for training and using scene graph generators for transfer learning. A scene graph generation technique can decompose a domain gap into individual types of discrepancies, such as may relate to appearance, label, and prediction discrepancies. These discrepancies can be reduced, at least in part, by aligning the corresponding latent and output distributions using one or more gradient reversal layers (GRLs). Label discrepancies can be addressed using self-pseudo-statistics collected from target data. Pseudo statistic-based self-learning and adversarial techniques can be used to manage these discrepancies without the need for costly supervision from a real-world dataset.
-
公开(公告)号:US11574155B2
公开(公告)日:2023-02-07
申请号:US17226561
申请日:2021-04-09
Applicant: Nvidia Corporation
Inventor: Aayush Prakash , Shoubhik Debnath , Jean-Francois Lafleche , Eric Cameracci , Gavriel State , Marc Teva Law
Abstract: Approaches are presented for training and using scene graph generators for transfer learning. A scene graph generation technique can decompose a domain gap into individual types of discrepancies, such as may relate to appearance, label, and prediction discrepancies. These discrepancies can be reduced, at least in part, by aligning the corresponding latent and output distributions using one or more gradient reversal layers (GRLs). Label discrepancies can be addressed using self-pseudo-statistics collected from target data. Pseudo statistic-based self-learning and adversarial techniques can be used to manage these discrepancies without the need for costly supervision from a real-world dataset.
-
公开(公告)号:US20210097346A1
公开(公告)日:2021-04-01
申请号:US17119971
申请日:2020-12-11
Applicant: NVIDIA Corporation
Inventor: Jonathan Tremblay , Aayush Prakash , Mark A. Brophy , Varun Jampani , Cem Anil , Stanley Thomas Birchfield , Thang Hong To , David Jesus Acuna Marrero
Abstract: Training deep neural networks requires a large amount of labeled training data. Conventionally, labeled training data is generated by gathering real images that are manually labelled which is very time-consuming. Instead of manually labelling a training dataset, domain randomization technique is used generate training data that is automatically labeled. The generated training data may be used to train neural networks for object detection and segmentation (labelling) tasks. In an embodiment, the generated training data includes synthetic input images generated by rendering three-dimensional (3D) objects of interest in a 3D scene. In an embodiment, the generated training data includes synthetic input images generated by rendering 3D objects of interest on a 2D background image. The 3D objects of interest are objects that a neural network is trained to detect and/or label.
-
公开(公告)号:US20200160178A1
公开(公告)日:2020-05-21
申请号:US16685795
申请日:2019-11-15
Applicant: NVIDIA Corporation
Inventor: Amlan Kar , Aayush Prakash , Ming-Yu Liu , David Jesus Acuna Marrero , Antonio Torralba Barriuso , Sanja Fidler
IPC: G06N3/08 , G06F16/901 , G06N3/04 , G06T11/60
Abstract: In various examples, a generative model is used to synthesize datasets for use in training a downstream machine learning model to perform an associated task. The synthesized datasets may be generated by sampling a scene graph from a scene grammar—such as a probabilistic grammar—and applying the scene graph to the generative model to compute updated scene graphs more representative of object attribute distributions of real-world datasets. The downstream machine learning model may be validated against a real-world validation dataset, and the performance of the model on the real-world validation dataset may be used as an additional factor in further training or fine-tuning the generative model for generating the synthesized datasets specific to the task of the downstream machine learning model.
-
公开(公告)号:US20230229919A1
公开(公告)日:2023-07-20
申请号:US18186696
申请日:2023-03-20
Applicant: NVIDIA Corporation
Inventor: Amlan Kar , Aayush Prakash , Ming-Yu Liu , David Jesus Acuna Marrero , Antonio Torralba Barriuso , Sanja Fidler
IPC: G06N3/08 , G06F16/901 , G06T11/60 , G06N3/045 , G06V10/764 , G06V10/774 , G06V10/82 , G06V10/426
CPC classification number: G06N3/08 , G06F16/9024 , G06T11/60 , G06N3/045 , G06V10/764 , G06V10/774 , G06V10/82 , G06V10/426 , G06T2210/61
Abstract: In various examples, a generative model is used to synthesize datasets for use in training a downstream machine learning model to perform an associated task. The synthesized datasets may be generated by sampling a scene graph from a scene grammar—such as a probabilistic grammar— and applying the scene graph to the generative model to compute updated scene graphs more representative of object attribute distributions of real-world datasets. The downstream machine learning model may be validated against a real-world validation dataset, and the performance of the model on the real-world validation dataset may be used as an additional factor in further training or fine-tuning the generative model for generating the synthesized datasets specific to the task of the downstream machine learning model.
-
公开(公告)号:US11610115B2
公开(公告)日:2023-03-21
申请号:US16685795
申请日:2019-11-15
Applicant: NVIDIA Corporation
Inventor: Amlan Kar , Aayush Prakash , Ming-Yu Liu , David Jesus Acuna Marrero , Antonio Torralba Barriuso , Sanja Fidler
IPC: G06N3/08 , G06F16/901 , G06T11/60 , G06N3/04
Abstract: In various examples, a generative model is used to synthesize datasets for use in training a downstream machine learning model to perform an associated task. The synthesized datasets may be generated by sampling a scene graph from a scene grammar—such as a probabilistic grammar—and applying the scene graph to the generative model to compute updated scene graphs more representative of object attribute distributions of real-world datasets. The downstream machine learning model may be validated against a real-world validation dataset, and the performance of the model on the real-world validation dataset may be used as an additional factor in further training or fine-tuning the generative model for generating the synthesized datasets specific to the task of the downstream machine learning model.
-
公开(公告)号:US20230004760A1
公开(公告)日:2023-01-05
申请号:US17361202
申请日:2021-06-28
Applicant: NVIDIA Corporation
Inventor: Siva Karthik Mustikovela , Shalini De Mello , Aayush Prakash , Umar Iqbal , Sifei Liu , Jan Kautz
IPC: G06K9/62
Abstract: Apparatuses, systems, and techniques to identify objects within an image using self-supervised machine learning. In at least one embodiment, a machine learning system is trained to recognize objects by training a first network to recognize objects within images that are generated by a second network. In at least one embodiment, the second network is a controllable network.
-
公开(公告)号:US20190251397A1
公开(公告)日:2019-08-15
申请号:US16256820
申请日:2019-01-24
Applicant: NVIDIA Corporation
Inventor: Jonathan Tremblay , Aayush Prakash , Mark A. Brophy , Varun Jampani , Cem Anil , Stanley Thomas Birchfield , Thang Hong To , David Jesus Acuna Marrero
CPC classification number: G06K9/6257 , G06K9/6228 , G06K2209/21 , G06N3/04 , G06N3/084 , G06T15/04 , G06T15/20 , G06T15/50 , G06T2210/12
Abstract: Training deep neural networks requires a large amount of labeled training data. Conventionally, labeled training data is generated by gathering real images that are manually labelled which is very time-consuming. Instead of manually labelling a training dataset, domain randomization technique is used generate training data that is automatically labeled. The generated training data may be used to train neural networks for object detection and segmentation (labelling) tasks. In an embodiment, the generated training data includes synthetic input images generated by rendering three-dimensional (3D) objects of interest in a 3D scene. In an embodiment, the generated training data includes synthetic input images generated by rendering 3D objects of interest on a 2D background image. The 3D objects of interest are objects that a neural network is trained to detect and/or label.
-
公开(公告)号:US20240320993A1
公开(公告)日:2024-09-26
申请号:US18673785
申请日:2024-05-24
Applicant: Nvidia Corporation
Inventor: Aayush Prakash , Shoubhik Debnath , Jean-Francois Lafleche , Eric Cameracci , Gavriel State , Marc Teva Law
IPC: G06V20/70 , G06F18/10 , G06F18/20 , G06F18/24 , G06V10/764 , G06V10/82 , G06V10/84 , G06V20/00 , G06V20/56
CPC classification number: G06V20/70 , G06F18/10 , G06F18/24 , G06F18/29 , G06V10/764 , G06V10/82 , G06V10/84 , G06V20/00 , G06V20/56
Abstract: Approaches are presented for training and using scene graph generators for transfer learning. A scene graph generation technique can decompose a domain gap into individual types of discrepancies, such as may relate to appearance, label, and prediction discrepancies. These discrepancies can be reduced, at least in part, by aligning the corresponding latent and output distributions using one or more gradient reversal layers (GRLs). Label discrepancies can be addressed using self-pseudo-statistics collected from target data. Pseudo statistic-based self-learning and adversarial techniques can be used to manage these discrepancies without the need for costly supervision from a real-world dataset.
-
-
-
-
-
-
-
-
-