Patent search ap:("NEC Laboratories America Page Inc.") AND inv:"Bingbing Zhuang"

21.

发明授权
End-to-end parametric road layout prediction with cheap supervision 有权

公开(公告)号：US12131557B2

公开(公告)日：2024-10-29

申请号：US17521193

申请日：2021-11-08

Applicant: NEC Laboratories America, Inc.

Inventor： Buyu Liu , Bingbing Zhuang , Manmohan Chandraker

IPC: G06T7/10 , B60W10/18 , B60W10/20 , B60W30/02 , B60W30/09 , G06F18/214 , G06V20/56 , G06V30/262

CPC classification number: G06V20/588 , B60W10/18 , B60W10/20 , B60W30/02 , B60W30/09 , G06F18/2155 , G06T7/10 , G06V30/274 , B60W2420/403 , G06T2207/10024 , G06T2207/20081 , G06T2207/30256

Abstract: A computer-implemented method for road layout prediction is provided. The method includes segmenting, by a first processor-based element, an RGB image to output pixel-level semantic segmentation results for the RGB image in a perspective view for both visible and occluded pixels in the perspective view based on contextual clues. The method further includes learning, by a second processor-based element, a mapping from the pixel-level semantic segmentation results for the RGB image in the perspective view to a top view of the RGB image using a road plane assumption. The method also includes generating, by a third processor-based element, an occlusion-aware parametric road layout prediction for road layout related attributes in the top view.

22.

发明授权
Uncertainty-aware fusion towards large-scale NeRF 有权

公开(公告)号：US12131422B2

公开(公告)日：2024-10-29

申请号：US17963471

申请日：2022-10-11

Applicant: NEC Laboratories America, Inc.

Inventor： Bingbing Zhuang , Samuel Schulter , Yi-Hsuan Tsai , Buyu Liu , Nanbo Li

IPC: G06T17/00 , G06T15/00 , G06T15/20 , G06V10/774 , G06V10/82 , G06V20/40

CPC classification number: G06T15/20 , G06T15/00 , G06T17/00 , G06V10/774 , G06V10/82 , G06V20/41 , G06T2200/08 , G06T2210/56

Abstract: A method for achieving high-fidelity novel view synthesis and 3D reconstruction for large-scale scenes is presented. The method includes obtaining images from a video stream received from a plurality of video image capturing devices, grouping the images into different image clusters representing a large-scale 3D scene, training a neural radiance field (NeRF) and an uncertainty multilayer perceptron (MLP) for each of the image clusters to generate a plurality of NeRFs and a plurality of uncertainty MLPs for the large-scale 3D scene, applying a rendering loss and an entropy loss to the plurality of NeRFs, performing uncertainty-based fusion to the plurality of NeRFs to define a fused NeRF, and jointly fine-tuning the plurality of NeRFs and the plurality of uncertainty MLPs, and during inference, applying the fused NeRF for novel view synthesis of the large-scale 3D scene.

23.

发明公开
VIDEO DOMAIN ADAPTATION VIA CONTRASTIVE LEARNING 审中-公开

公开(公告)号：US20240037187A1

公开(公告)日：2024-02-01

申请号：US18484832

申请日：2023-10-11

Applicant: NEC Laboratories America, Inc.

Inventor： Yi-Hsuan Tsai , Xiang Yu , Bingbing Zhuang , Manmohan Chandraker , Donghyun Kim

IPC: G06F18/213 , G06N3/08 , G06V10/75 , G06F18/22 , G06F18/214

CPC classification number: G06F18/213 , G06N3/08 , G06V10/751 , G06F18/22 , G06F18/2155

Abstract: Video methods and systems include extracting features of a first modality and a second modality from a labeled first training dataset in a first domain and an unlabeled second training dataset in a second domain. A video analysis model is trained using contrastive learning on the extracted features, including optimization of a loss function that includes a cross-domain regularization part and a cross-modality regularization part.

24.

发明公开
VIDEO DOMAIN ADAPTATION VIA CONTRASTIVE LEARNING 审中-公开

公开(公告)号：US20240037186A1

公开(公告)日：2024-02-01

申请号：US18484826

申请日：2023-10-11

Applicant: NEC Laboratories America, Inc.

Inventor： Yi-Hsuan Tsai , Xiang Yu , Bingbing Zhuang , Manmohan Chandraker , Donghyun Kim

IPC: G06F18/213 , G06N3/08 , G06V10/75 , G06F18/22 , G06F18/214

CPC classification number: G06F18/213 , G06N3/08 , G06V10/751 , G06F18/22 , G06F18/2155

Abstract: Video methods and systems include extracting features of a first modality and a second modality from a labeled first training dataset in a first domain and an unlabeled second training dataset in a second domain. A video analysis model is trained using contrastive learning on the extracted features, including optimization of a loss function that includes a cross-domain regularization part and a cross-modality regularization part.

25.

发明授权
Parametric top-view representation of complex road scenes 有权

公开(公告)号：US11455813B2

公开(公告)日：2022-09-27

申请号：US17096111

申请日：2020-11-12

Applicant: NEC Laboratories America, Inc.

Inventor： Buyu Liu , Bingbing Zhuang , Samuel Schulter , Manmohan Chandraker

IPC: G06V30/422 , G06T7/00 , G06V40/12

Abstract: Systems and methods are provided for producing a road layout model. The method includes capturing digital images having a perspective view, converting each of the digital images into top-down images, and conveying a top-down image of time t to a neural network that performs a feature transform to form a feature map of time t. The method also includes transferring the feature map of the top-down image of time t to a feature transform module to warp the feature map to a time t+1, and conveying a top-down image of time t+1 to form a feature map of time t+1. The method also includes combining the warped feature map of time t with the feature map of time t+1 to form a combined feature map, transferring the combined feature map to a long short-term memory (LSTM) module to generate the road layout model, and displaying the road layout model.

26.

发明申请
LEARNING TO FUSE GEOMETRICAL AND CNN RELATIVE CAMERA POSE VIA UNCERTAINTY 有权

公开(公告)号：US20220148220A1

公开(公告)日：2022-05-12

申请号：US17519894

申请日：2021-11-05

Applicant: NEC Laboratories America, Inc.

Inventor： Bingbing Zhuang , Manmohan Chandraker

IPC: G06T7/73 , G06T7/77 , G06T1/00

Abstract: A computer-implemented method for fusing geometrical and Convolutional Neural Network (CNN) relative camera pose is provided. The method includes receiving two images having different camera poses. The method further includes inputting the two images into a geometric solver branch to return, as a first solution, an estimated camera pose and an associated pose uncertainty value determined from a Jacobian of a reproduction error function. The method also includes inputting the two images into a CNN branch to return, as a second solution, a predicted camera pose and an associated pose uncertainty value. The method additionally includes fusing, by a processor device, the first solution and the second solution in a probabilistic manner using Bayes' rule to obtain a fused pose.

27.

发明申请
END-TO-END PARAMETRIC ROAD LAYOUT PREDICTION WITH CHEAP SUPERVISION 有权

公开(公告)号：US20220147746A1

公开(公告)日：2022-05-12

申请号：US17521193

申请日：2021-11-08

Applicant: NEC Laboratories America, Inc.

Inventor： Buyu Liu , Bingbing Zhuang , Manmohan Chandraker

IPC: G06K9/00 , G06T7/10 , G06K9/72 , G06K9/62 , B60W30/09 , B60W10/18 , B60W10/20 , B60W30/02

Abstract: A computer-implemented method for road layout prediction is provided. The method includes segmenting, by a first processor-based element, an RGB image to output pixel-level semantic segmentation results for the RGB image in a perspective view for both visible and occluded pixels in the perspective view based on contextual clues. The method further includes learning, by a second processor-based element, a mapping from the pixel-level semantic segmentation results for the RGB image in the perspective view to a top view of the RGB image using a road plane assumption. The method also includes generating, by a third processor-based element, an occlusion-aware parametric road layout prediction for road layout related attributes in the top view.

28.

发明授权
Rolling shutter rectification in images/videos using convolutional neural networks with applications to SFM/SLAM with rolling shutter images/videos 有权

公开(公告)号：US11132586B2

公开(公告)日：2021-09-28

申请号：US16593247

申请日：2019-10-04

Applicant: NEC Laboratories America, Inc.

Inventor： Quoc-Huy Tran , Bingbing Zhuang , Pan Ji , Manmohan Chandraker

IPC: H04N5/225 , G06N3/08 , G06N3/04 , H04N5/232 , G06T7/20 , G06K9/62

Abstract: A method for correcting rolling shutter (RS) effects is presented. The method includes generating a plurality of images from a camera, synthesizing RS images from global shutter (GS) counterparts to generate training data to train the structure-and-motion-aware convolutional neural network (CNN), and predicting an RS camera motion and an RS depth map from a single RS image by employing a structure-and-motion-aware CNN to remove RS distortions from the single RS image.

29.

发明申请
PHOTOREALISTIC TRAINING DATA AUGMENTATION 有权

公开(公告)号：US20250148697A1

公开(公告)日：2025-05-08

申请号：US18936290

申请日：2024-11-04

Applicant: NEC Laboratories America, Inc.

Inventor： Ziyu Jiang , Bingbing Zhuang , Manmohan Chandraker

IPC: G06T15/08

Abstract: Methods and systems include training a model for rendering a three-dimensional volume using a loss function that includes a depth loss term and a distribution loss term that regularize an output of the model to produce realistic scenarios. A simulated scenario is generated based on an original scenario, with the simulated scenario including a different position and pose relative to the original scenario in a three-dimensional (3D) scene that is generated by the model from the original scenario. A self-driving model is trained for an autonomous vehicle using the simulated scenario.

30.

发明申请
INSTANTANEOUS PERCEPTION OF FINE-GRAINED 3D MOTION 有权

公开(公告)号：US20250115250A1

公开(公告)日：2025-04-10

申请号：US18903538

申请日：2024-10-01

Applicant: NEC Laboratories America, Inc.

Inventor： Bingbing Zhuang , Manmohan Chandraker , Di Liu

IPC: B60W40/10 , B60W10/18 , B60W10/20 , G06T3/18 , G06T7/246

Abstract: Methods and systems for motion detection include performing a first prediction to predict voxel occupancy based on a sequence of input point clouds including a current point cloud and a set of previous point clouds. A second prediction is performed to predict voxel occupancy for the sequence of input point clouds using predicted voxel occupancy between the input point clouds. Motion detection is performed based on the completed voxel occupancy. An action is performed responsive to a detected motion.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification