-
公开(公告)号:US20210275918A1
公开(公告)日:2021-09-09
申请号:US17117425
申请日:2020-12-10
Applicant: Nvidia Corporation
Inventor: Jeevan Devaranjan , Sanja Fidler , Amlan Kar
Abstract: A rule set or scene grammar can be used to generate a scene graph that represents the structure and visual parameters of objects in a scene. A renderer can take this scene graph as input and, with a library of content for assets identified in the scene graph, can generate a synthetic image of a scene that has the desired scene structure without the need for manual placement of any of the objects in the scene. Images or environments synthesized in this way can be used to, for example, generate training data for real world navigational applications, as well as to generate virtual worlds for games or virtual reality experiences.
-
公开(公告)号:US11816790B2
公开(公告)日:2023-11-14
申请号:US17117425
申请日:2020-12-10
Applicant: Nvidia Corporation
Inventor: Jeevan Devaranjan , Sanja Fidler , Amlan Kar
IPC: G06T17/00 , A63F13/52 , G06F16/51 , G06F16/54 , G06N3/08 , G06N5/025 , G06T15/20 , G06N7/01 , G06V10/25 , G06V10/774 , G06V20/20 , G06V20/40 , G06F18/214
CPC classification number: G06T17/00 , A63F13/52 , G06F16/51 , G06F16/54 , G06N3/08 , G06N5/025 , G06N7/01 , G06T15/205 , G06V10/25 , G06V10/774 , G06V20/20 , G06F18/2148 , G06F18/2155 , G06T2210/61 , G06V20/40
Abstract: A rule set or scene grammar can be used to generate a scene graph that represents the structure and visual parameters of objects in a scene. A renderer can take this scene graph as input and, with a library of content for assets identified in the scene graph, can generate a synthetic image of a scene that has the desired scene structure without the need for manual placement of any of the objects in the scene. Images or environments synthesized in this way can be used to, for example, generate training data for real world navigational applications, as well as to generate virtual worlds for games or virtual reality experiences.
-
公开(公告)号:US20200160178A1
公开(公告)日:2020-05-21
申请号:US16685795
申请日:2019-11-15
Applicant: NVIDIA Corporation
Inventor: Amlan Kar , Aayush Prakash , Ming-Yu Liu , David Jesus Acuna Marrero , Antonio Torralba Barriuso , Sanja Fidler
IPC: G06N3/08 , G06F16/901 , G06N3/04 , G06T11/60
Abstract: In various examples, a generative model is used to synthesize datasets for use in training a downstream machine learning model to perform an associated task. The synthesized datasets may be generated by sampling a scene graph from a scene grammar—such as a probabilistic grammar—and applying the scene graph to the generative model to compute updated scene graphs more representative of object attribute distributions of real-world datasets. The downstream machine learning model may be validated against a real-world validation dataset, and the performance of the model on the real-world validation dataset may be used as an additional factor in further training or fine-tuning the generative model for generating the synthesized datasets specific to the task of the downstream machine learning model.
-
公开(公告)号:US20240256831A1
公开(公告)日:2024-08-01
申请号:US18159815
申请日:2023-01-26
Applicant: NVIDIA Corporation
Inventor: Daiqing Li , Huan Ling , Seung Wook Kim , Karsten Julian Kreis , Antonio Torralba Barriuso , Sanja Fidler , Amlan Kar
IPC: G06N3/045 , G06T5/00 , G06V10/774 , G06V10/82
CPC classification number: G06N3/045 , G06T5/70 , G06V10/7753 , G06V10/82
Abstract: In various examples, systems and methods are disclosed relating to generating a response from image and/or video input for image/video-based artificial intelligence (AI) systems and applications. Systems and methods are disclosed for a first model (e.g., a teacher model) distilling its knowledge to a second model (a student model). The second model receives a downstream image in a downstream task and generates at least one feature. The first model generates first features corresponding to an image which can be a real image or a synthetic image. The second model generates second features using the image as an input to the second model. Loss with respect to first features is determined. The second model is updated using the loss.
-
公开(公告)号:US20240161396A1
公开(公告)日:2024-05-16
申请号:US18505283
申请日:2023-11-09
Applicant: Nvidia Corporation
Inventor: Jeevan Devaranjan , Sanja Fidler , Amlan Kar
IPC: G06T17/00 , A63F13/52 , G06F16/51 , G06F16/54 , G06N3/08 , G06N5/025 , G06N7/01 , G06T15/20 , G06V10/25 , G06V10/774 , G06V20/20
CPC classification number: G06T17/00 , A63F13/52 , G06F16/51 , G06F16/54 , G06N3/08 , G06N5/025 , G06N7/01 , G06T15/205 , G06V10/25 , G06V10/774 , G06V20/20 , G06T2210/61 , G06V20/40
Abstract: A rule set or scene grammar can be used to generate a scene graph that represents the structure and visual parameters of objects in a scene. A renderer can take this scene graph as input and, with a library of content for assets identified in the scene graph, can generate a synthetic image of a scene that has the desired scene structure without the need for manual placement of any of the objects in the scene. Images or environments synthesized in this way can be used to, for example, generate training data for real world navigational applications, as well as to generate virtual worlds for games or virtual reality experiences.
-
公开(公告)号:US20230229919A1
公开(公告)日:2023-07-20
申请号:US18186696
申请日:2023-03-20
Applicant: NVIDIA Corporation
Inventor: Amlan Kar , Aayush Prakash , Ming-Yu Liu , David Jesus Acuna Marrero , Antonio Torralba Barriuso , Sanja Fidler
IPC: G06N3/08 , G06F16/901 , G06T11/60 , G06N3/045 , G06V10/764 , G06V10/774 , G06V10/82 , G06V10/426
CPC classification number: G06N3/08 , G06F16/9024 , G06T11/60 , G06N3/045 , G06V10/764 , G06V10/774 , G06V10/82 , G06V10/426 , G06T2210/61
Abstract: In various examples, a generative model is used to synthesize datasets for use in training a downstream machine learning model to perform an associated task. The synthesized datasets may be generated by sampling a scene graph from a scene grammar—such as a probabilistic grammar— and applying the scene graph to the generative model to compute updated scene graphs more representative of object attribute distributions of real-world datasets. The downstream machine learning model may be validated against a real-world validation dataset, and the performance of the model on the real-world validation dataset may be used as an additional factor in further training or fine-tuning the generative model for generating the synthesized datasets specific to the task of the downstream machine learning model.
-
公开(公告)号:US11610115B2
公开(公告)日:2023-03-21
申请号:US16685795
申请日:2019-11-15
Applicant: NVIDIA Corporation
Inventor: Amlan Kar , Aayush Prakash , Ming-Yu Liu , David Jesus Acuna Marrero , Antonio Torralba Barriuso , Sanja Fidler
IPC: G06N3/08 , G06F16/901 , G06T11/60 , G06N3/04
Abstract: In various examples, a generative model is used to synthesize datasets for use in training a downstream machine learning model to perform an associated task. The synthesized datasets may be generated by sampling a scene graph from a scene grammar—such as a probabilistic grammar—and applying the scene graph to the generative model to compute updated scene graphs more representative of object attribute distributions of real-world datasets. The downstream machine learning model may be validated against a real-world validation dataset, and the performance of the model on the real-world validation dataset may be used as an additional factor in further training or fine-tuning the generative model for generating the synthesized datasets specific to the task of the downstream machine learning model.
-
公开(公告)号:US20250061153A1
公开(公告)日:2025-02-20
申请号:US18935222
申请日:2024-11-01
Applicant: Nvidia Corporation
Inventor: Hang Chu , Daiqing Li , David Jesus Acuna Marrero , Amlan Kar , Maria Shugrina , Ming-Yu Liu , Antonio Torralba Barriuso , Sanja Fidler
IPC: G06F16/901 , G06F30/13 , G06F30/27 , G06N3/044 , G06N3/045 , G06N3/047 , G06N3/08 , G06N3/084 , G06N5/04 , G06N20/10 , G06N20/20 , G06V10/764 , G06V10/82 , G06V10/84 , G06V20/10
Abstract: A generative model can be used for generation of spatial layouts and graphs. Such a model can progressively grow these layouts and graphs based on local statistics, where nodes can represent spatial control points of the layout, and edges can represent segments or paths between nodes, such as may correspond to road segments. A generative model can utilize an encoder-decoder architecture where the encoder is a recurrent neural network (RNN) that encodes local incoming paths into a node and the decoder is another RNN that generates outgoing nodes and edges connecting an existing node to the newly generated nodes. Generation is done iteratively, and can finish once all nodes are visited or another end condition is satisfied. Such a model can generate layouts by additionally conditioning on a set of attributes, giving control to a user in generating the layout.
-
公开(公告)号:US20200302250A1
公开(公告)日:2020-09-24
申请号:US16825199
申请日:2020-03-20
Applicant: Nvidia Corporation
Inventor: Hang Chu , Daiqing Li , David Jesus Acuna Marrero , Amlan Kar , Maria Shugrina , Ming-Yu Liu , Antonio Torralba Barriuso , Sanja Fidler
Abstract: A generative model can be used for generation of spatial layouts and graphs. Such a model can progressively grow these layouts and graphs based on local statistics, where nodes can represent spatial control points of the layout, and edges can represent segments or paths between nodes, such as may correspond to road segments. A generative model can utilize an encoder-decoder architecture where the encoder is a recurrent neural network (RNN) that encodes local incoming paths into a node and the decoder is another RNN that generates outgoing nodes and edges connecting an existing node to the newly generated nodes. Generation is done iteratively, and can finish once all nodes are visited or another end condition is satisfied. Such a model can generate layouts by additionally conditioning on a set of attributes, giving control to a user in generating the layout.
-
-
-
-
-
-
-
-