-
公开(公告)号:US20250104277A1
公开(公告)日:2025-03-27
申请号:US18608804
申请日:2024-03-18
Applicant: NVIDIA CORPORATION
Inventor: Jonathan TREMBLAY , Stanley BIRCHFIELD , Valts BLUKIS , Balakumar SUNDARALINGAM , Stephen TYREE , Bowen WEN
Abstract: One embodiment of a method for determining object poses includes receiving first sensor data and second sensor data, where the first sensor data is associated with a first modality, and the second sensor data is associated with a second modality that is different from the first modality, and performing one or more iterative operations to determine a pose of an object based on one or more comparisons of (i) one or more renderings of a three-dimensional (3D) representation of the object in the first modality with the first sensor data, and (ii) one or more renderings of the 3D representation of the object in the second modality with the second sensor data.
-
公开(公告)号:US20240161468A1
公开(公告)日:2024-05-16
申请号:US18453248
申请日:2023-08-21
Applicant: NVIDIA CORPORATION
Inventor: Xueting LI , Stanley BIRCHFIELD , Shalini DE MELLO , Sifei LIU , Jiaming SONG , Yufei YE
IPC: G06V10/774 , G06T5/00 , G06T7/11 , G06V10/82 , G06V40/10
CPC classification number: G06V10/774 , G06T5/002 , G06T5/005 , G06T7/11 , G06V10/82 , G06V40/11 , G06T2207/20081 , G06T2207/20084 , G06T2207/30196
Abstract: Techniques are disclosed herein for generating an image. The techniques include performing one or more first denoising operations based on a first machine learning model and an input image that includes a first object to generate a mask that indicates a spatial arrangement associated with a second object interacting with the first object, and performing one or more second denoising operations based on a second machine learning model, the input image, and the mask to generate an image of the second object interacting with the first object.
-
公开(公告)号:US20250124654A1
公开(公告)日:2025-04-17
申请号:US18740264
申请日:2024-06-11
Applicant: NVIDIA CORPORATION
Inventor: Bowen WEN , Stanley BIRCHFIELD , Jonathan TREMBLAY , Valts BLUKIS , Dieter FOX , Yijia WENG
Abstract: One embodiment of a method for generating an articulation model includes receiving a first set of images of an object in a first articulation and a second set of images of the object in a second articulation, performing one or more operations to generate first three-dimensional (3D) geometry based on the first set of images, performing one or more operations to generate second 3D geometry based on the second set of images, and performing one or more operations to generate an articulation model of the object based on the first 3D geometry and the second 3D geometry.
-
公开(公告)号:US20240066710A1
公开(公告)日:2024-02-29
申请号:US18168482
申请日:2023-02-13
Applicant: NVIDIA CORPORATION
Inventor: Balakumar SUNDARALINGAM , Stanley BIRCHFIELD , Zhenggang TANG , Jonathan TREMBLAY , Stephen TYREE , Bowen WEN , Ye YUAN , Charles LOOP
CPC classification number: B25J9/1697 , B25J9/163 , B25J9/1664 , B25J9/1676 , B25J19/023
Abstract: One embodiment of a method for controlling a robot includes generating a representation of spatial occupancy within an environment based on a plurality of red, green, blue (RGB) images of the environment, determining one or more actions for the robot based on the representation of spatial occupancy and a goal, and causing the robot to perform at least a portion of a movement based on the one or more actions.
-
-
-