-
公开(公告)号:US20230281847A1
公开(公告)日:2023-09-07
申请号:US17592096
申请日:2022-02-03
Applicant: NVIDIA Corporation
Inventor: Yiran Zhong , Charles Loop , Nikolai Smolyanskiy , Ke Chen , Stan Birchfield , Alexander Popov
CPC classification number: G06T7/55 , G06T7/70 , G06V10/462 , G06T2207/20081 , G06T2207/30252
Abstract: In various examples, methods and systems are provided for estimating depth values for images (e.g., from a monocular sequence). Disclosed approaches may define a search space of potential pixel matches between two images using one or more depth hypothesis planes based at least on a camera pose associated with one or more cameras used to generate the images. A machine learning model(s) may use this search space to predict likelihoods of correspondence between one or more pixels in the images. The predicted likelihoods may be used to compute depth values for one or more of the images. The predicted depth values may be transmitted and used by a machine to perform one or more operations.
-
公开(公告)号:US11062471B1
公开(公告)日:2021-07-13
申请号:US16868342
申请日:2020-05-06
Applicant: NVIDIA Corporation
Inventor: Yiran Zhong , Wonmin Byeon , Charles Loop , Stanley Thomas Birchfield
Abstract: Stereo matching generates a disparity map indicating pixels offsets between matched points in a stereo image pair. A neural network may be used to generate disparity maps in real time by matching image features in stereo images using only 2D convolutions. The proposed method is faster than 3D convolution-based methods, with only a slight accuracy loss and higher generalization capability. A 3D efficient cost aggregation volume is generated by combining cost maps for each disparity level. Different disparity levels correspond to different amounts of shift between pixels in the left and right image pair. In general, each disparity level is inversely proportional to a different distance from the viewpoint.
-