-
公开(公告)号:US12131557B2
公开(公告)日:2024-10-29
申请号:US17521193
申请日:2021-11-08
Applicant: NEC Laboratories America, Inc.
Inventor: Buyu Liu , Bingbing Zhuang , Manmohan Chandraker
IPC: G06T7/10 , B60W10/18 , B60W10/20 , B60W30/02 , B60W30/09 , G06F18/214 , G06V20/56 , G06V30/262
CPC classification number: G06V20/588 , B60W10/18 , B60W10/20 , B60W30/02 , B60W30/09 , G06F18/2155 , G06T7/10 , G06V30/274 , B60W2420/403 , G06T2207/10024 , G06T2207/20081 , G06T2207/30256
Abstract: A computer-implemented method for road layout prediction is provided. The method includes segmenting, by a first processor-based element, an RGB image to output pixel-level semantic segmentation results for the RGB image in a perspective view for both visible and occluded pixels in the perspective view based on contextual clues. The method further includes learning, by a second processor-based element, a mapping from the pixel-level semantic segmentation results for the RGB image in the perspective view to a top view of the RGB image using a road plane assumption. The method also includes generating, by a third processor-based element, an occlusion-aware parametric road layout prediction for road layout related attributes in the top view.
-
公开(公告)号:US12131422B2
公开(公告)日:2024-10-29
申请号:US17963471
申请日:2022-10-11
Applicant: NEC Laboratories America, Inc.
Inventor: Bingbing Zhuang , Samuel Schulter , Yi-Hsuan Tsai , Buyu Liu , Nanbo Li
CPC classification number: G06T15/20 , G06T15/00 , G06T17/00 , G06V10/774 , G06V10/82 , G06V20/41 , G06T2200/08 , G06T2210/56
Abstract: A method for achieving high-fidelity novel view synthesis and 3D reconstruction for large-scale scenes is presented. The method includes obtaining images from a video stream received from a plurality of video image capturing devices, grouping the images into different image clusters representing a large-scale 3D scene, training a neural radiance field (NeRF) and an uncertainty multilayer perceptron (MLP) for each of the image clusters to generate a plurality of NeRFs and a plurality of uncertainty MLPs for the large-scale 3D scene, applying a rendering loss and an entropy loss to the plurality of NeRFs, performing uncertainty-based fusion to the plurality of NeRFs to define a fused NeRF, and jointly fine-tuning the plurality of NeRFs and the plurality of uncertainty MLPs, and during inference, applying the fused NeRF for novel view synthesis of the large-scale 3D scene.
-
公开(公告)号:US20240037187A1
公开(公告)日:2024-02-01
申请号:US18484832
申请日:2023-10-11
Applicant: NEC Laboratories America, Inc.
Inventor: Yi-Hsuan Tsai , Xiang Yu , Bingbing Zhuang , Manmohan Chandraker , Donghyun Kim
IPC: G06F18/213 , G06N3/08 , G06V10/75 , G06F18/22 , G06F18/214
CPC classification number: G06F18/213 , G06N3/08 , G06V10/751 , G06F18/22 , G06F18/2155
Abstract: Video methods and systems include extracting features of a first modality and a second modality from a labeled first training dataset in a first domain and an unlabeled second training dataset in a second domain. A video analysis model is trained using contrastive learning on the extracted features, including optimization of a loss function that includes a cross-domain regularization part and a cross-modality regularization part.
-
公开(公告)号:US20240037186A1
公开(公告)日:2024-02-01
申请号:US18484826
申请日:2023-10-11
Applicant: NEC Laboratories America, Inc.
Inventor: Yi-Hsuan Tsai , Xiang Yu , Bingbing Zhuang , Manmohan Chandraker , Donghyun Kim
IPC: G06F18/213 , G06N3/08 , G06V10/75 , G06F18/22 , G06F18/214
CPC classification number: G06F18/213 , G06N3/08 , G06V10/751 , G06F18/22 , G06F18/2155
Abstract: Video methods and systems include extracting features of a first modality and a second modality from a labeled first training dataset in a first domain and an unlabeled second training dataset in a second domain. A video analysis model is trained using contrastive learning on the extracted features, including optimization of a loss function that includes a cross-domain regularization part and a cross-modality regularization part.
-
公开(公告)号:US11455813B2
公开(公告)日:2022-09-27
申请号:US17096111
申请日:2020-11-12
Applicant: NEC Laboratories America, Inc.
Inventor: Buyu Liu , Bingbing Zhuang , Samuel Schulter , Manmohan Chandraker
IPC: G06V30/422 , G06T7/00 , G06V40/12
Abstract: Systems and methods are provided for producing a road layout model. The method includes capturing digital images having a perspective view, converting each of the digital images into top-down images, and conveying a top-down image of time t to a neural network that performs a feature transform to form a feature map of time t. The method also includes transferring the feature map of the top-down image of time t to a feature transform module to warp the feature map to a time t+1, and conveying a top-down image of time t+1 to form a feature map of time t+1. The method also includes combining the warped feature map of time t with the feature map of time t+1 to form a combined feature map, transferring the combined feature map to a long short-term memory (LSTM) module to generate the road layout model, and displaying the road layout model.
-
公开(公告)号:US20220148220A1
公开(公告)日:2022-05-12
申请号:US17519894
申请日:2021-11-05
Applicant: NEC Laboratories America, Inc.
Inventor: Bingbing Zhuang , Manmohan Chandraker
Abstract: A computer-implemented method for fusing geometrical and Convolutional Neural Network (CNN) relative camera pose is provided. The method includes receiving two images having different camera poses. The method further includes inputting the two images into a geometric solver branch to return, as a first solution, an estimated camera pose and an associated pose uncertainty value determined from a Jacobian of a reproduction error function. The method also includes inputting the two images into a CNN branch to return, as a second solution, a predicted camera pose and an associated pose uncertainty value. The method additionally includes fusing, by a processor device, the first solution and the second solution in a probabilistic manner using Bayes' rule to obtain a fused pose.
-
公开(公告)号:US20220147746A1
公开(公告)日:2022-05-12
申请号:US17521193
申请日:2021-11-08
Applicant: NEC Laboratories America, Inc.
Inventor: Buyu Liu , Bingbing Zhuang , Manmohan Chandraker
Abstract: A computer-implemented method for road layout prediction is provided. The method includes segmenting, by a first processor-based element, an RGB image to output pixel-level semantic segmentation results for the RGB image in a perspective view for both visible and occluded pixels in the perspective view based on contextual clues. The method further includes learning, by a second processor-based element, a mapping from the pixel-level semantic segmentation results for the RGB image in the perspective view to a top view of the RGB image using a road plane assumption. The method also includes generating, by a third processor-based element, an occlusion-aware parametric road layout prediction for road layout related attributes in the top view.
-
公开(公告)号:US11132586B2
公开(公告)日:2021-09-28
申请号:US16593247
申请日:2019-10-04
Applicant: NEC Laboratories America, Inc.
Inventor: Quoc-Huy Tran , Bingbing Zhuang , Pan Ji , Manmohan Chandraker
Abstract: A method for correcting rolling shutter (RS) effects is presented. The method includes generating a plurality of images from a camera, synthesizing RS images from global shutter (GS) counterparts to generate training data to train the structure-and-motion-aware convolutional neural network (CNN), and predicting an RS camera motion and an RS depth map from a single RS image by employing a structure-and-motion-aware CNN to remove RS distortions from the single RS image.
-
公开(公告)号:US20250148697A1
公开(公告)日:2025-05-08
申请号:US18936290
申请日:2024-11-04
Applicant: NEC Laboratories America, Inc.
Inventor: Ziyu Jiang , Bingbing Zhuang , Manmohan Chandraker
IPC: G06T15/08
Abstract: Methods and systems include training a model for rendering a three-dimensional volume using a loss function that includes a depth loss term and a distribution loss term that regularize an output of the model to produce realistic scenarios. A simulated scenario is generated based on an original scenario, with the simulated scenario including a different position and pose relative to the original scenario in a three-dimensional (3D) scene that is generated by the model from the original scenario. A self-driving model is trained for an autonomous vehicle using the simulated scenario.
-
公开(公告)号:US20250115250A1
公开(公告)日:2025-04-10
申请号:US18903538
申请日:2024-10-01
Applicant: NEC Laboratories America, Inc.
Inventor: Bingbing Zhuang , Manmohan Chandraker , Di Liu
Abstract: Methods and systems for motion detection include performing a first prediction to predict voxel occupancy based on a sequence of input point clouds including a current point cloud and a set of previous point clouds. A second prediction is performed to predict voxel occupancy for the sequence of input point clouds using predicted voxel occupancy between the input point clouds. Motion detection is performed based on the completed voxel occupancy. An action is performed responsive to a detected motion.
-
-
-
-
-
-
-
-
-