-
1.
公开(公告)号:US20230368468A1
公开(公告)日:2023-11-16
申请号:US17744467
申请日:2022-05-13
Applicant: NVIDIA CORPORATION
Inventor: Ben ECKART , Christopher CHOY , Chao LIU , Yurong YOU
CPC classification number: G06T17/10 , G06N3/0454 , G06N3/084 , G06T19/20 , G06T2219/2016
Abstract: In various embodiments, an unsupervised training application executes a neural network on a first point cloud to generate keys and values. The unsupervised training application generates output vectors based on a first query set, the keys, and the values and then computes spatial features based on the output vectors. The unsupervised training application computes quantized context features based on the output vectors and a first set of codes representing a first set of 3D geometry blocks. The unsupervised training application modifies the first neural network based on a likelihood of reconstructing the first point cloud, the quantized context features, and the spatial features to generate an updated neural network. A trained machine learning model includes the updated neural network, a second query set, and a second set of codes representing a second set of 3D geometry blocks and maps a point cloud to a representation of 3D geometry instances.
-
2.
公开(公告)号:US20230368032A1
公开(公告)日:2023-11-16
申请号:US17744456
申请日:2022-05-13
Applicant: NVIDIA CORPORATION
Inventor: Ben ECKART , Christopher CHOY , Chao LIU , Yurong YOU
Abstract: In various embodiments, an unsupervised training application trains a machine learning model to generate representations of point clouds. The unsupervised training application executes a neural network on a first point cloud representing a first three-dimensional (3D) scene to generate segmentations. Based on the segmentations, the unsupervised training application computes spatial features. The unsupervised training application computes quantized context features based on the segmentations and a first set of codes representing a first set of 3D geometry blocks. The unsupervised training application modifies the neural network based on a likelihood of reconstructing the first point cloud, the quantized context features, and the spatial features to generate an updated neural network. A trained machine learning model that includes the updated neural network and a second set of codes representing a second set of 3D geometry blocks maps a point cloud representing a 3D scene to a representation of 3D geometry instances.
-