-
公开(公告)号:US20220148220A1
公开(公告)日:2022-05-12
申请号:US17519894
申请日:2021-11-05
Applicant: NEC Laboratories America, Inc.
Inventor: Bingbing Zhuang , Manmohan Chandraker
Abstract: A computer-implemented method for fusing geometrical and Convolutional Neural Network (CNN) relative camera pose is provided. The method includes receiving two images having different camera poses. The method further includes inputting the two images into a geometric solver branch to return, as a first solution, an estimated camera pose and an associated pose uncertainty value determined from a Jacobian of a reproduction error function. The method also includes inputting the two images into a CNN branch to return, as a second solution, a predicted camera pose and an associated pose uncertainty value. The method additionally includes fusing, by a processor device, the first solution and the second solution in a probabilistic manner using Bayes' rule to obtain a fused pose.
-
公开(公告)号:US20220148189A1
公开(公告)日:2022-05-12
申请号:US17520207
申请日:2021-11-05
Applicant: NEC Laboratories America, Inc.
Inventor: Yi-Hsuan Tsai , Masoud Faraki , Yumin Suh , Sparsh Garg , Manmohan Chandraker , Dongwan Kim
Abstract: Methods and systems for training a model include combining data from multiple datasets, the datasets having different respective label spaces. Relationships between labels in the different label spaces are identified. A unified neural network model is trained, using the combined data and the identified relationships to generate a unified model, with a class relational binary cross-entropy loss.
-
公开(公告)号:US20220147746A1
公开(公告)日:2022-05-12
申请号:US17521193
申请日:2021-11-08
Applicant: NEC Laboratories America, Inc.
Inventor: Buyu Liu , Bingbing Zhuang , Manmohan Chandraker
Abstract: A computer-implemented method for road layout prediction is provided. The method includes segmenting, by a first processor-based element, an RGB image to output pixel-level semantic segmentation results for the RGB image in a perspective view for both visible and occluded pixels in the perspective view based on contextual clues. The method further includes learning, by a second processor-based element, a mapping from the pixel-level semantic segmentation results for the RGB image in the perspective view to a top view of the RGB image using a road plane assumption. The method also includes generating, by a third processor-based element, an occlusion-aware parametric road layout prediction for road layout related attributes in the top view.
-
公开(公告)号:US20220147735A1
公开(公告)日:2022-05-12
申请号:US17519986
申请日:2021-11-05
Applicant: NEC Laboratories America, Inc.
Inventor: Yumin Suh , Xiang Yu , Yi-Hsuan Tsai , Masoud Faraki , Manmohan Chandraker
Abstract: A method for employing facial information in unsupervised person re-identification is presented. The method includes extracting, by a body feature extractor, body features from a first data stream, extracting, by a head feature extractor, head features from a second data stream, outputting a body descriptor vector from the body feature extractor, outputting a head descriptor vector from the head feature extractor, and concatenating the body descriptor vector and the head descriptor vector to enable a model to generate a descriptor vector.
-
195.
公开(公告)号:US11321853B2
公开(公告)日:2022-05-03
申请号:US16939604
申请日:2020-07-27
Applicant: NEC Laboratories America, Inc.
Inventor: Pan Ji , Quoc-Huy Tran , Manmohan Chandraker , Yuliang Zou
Abstract: A computer-implemented method for implementing a self-supervised visual odometry framework using long-term modeling includes, within a pose network of the self-supervised visual odometry framework including a plurality of pose encoders, a convolution long short-term memory (ConvLSTM) module having a first-layer ConvLSTM and a second-layer ConvLSTM, and a pose prediction layer, performing a first stage of training over a first image sequence using photometric loss, depth smoothness loss and pose cycle consistency loss, and performing a second stage of training to finetune the second-layer ConvLSTM over a second image sequence longer than the first image sequence.
-
公开(公告)号:US11222238B2
公开(公告)日:2022-01-11
申请号:US17094261
申请日:2020-11-10
Applicant: NEC Laboratories America, Inc.
Inventor: Samuel Schulter , Gaurav Sharma , Yi-Hsuan Tsai , Manmohan Chandraker , Xiangyun Zhao
Abstract: Methods and systems for object detection include training dataset-specific object detectors using respective annotated datasets, each of the annotated datasets including annotations for a respective set of one or more object classes. The annotated datasets are cross-annotated using the dataset-specific object detectors. A unified object detector is trained, using the cross-annotated datasets, to detect all of the object classes of the annotated datasets. Objects are detected in an input image using the unified object detector.
-
公开(公告)号:US11132586B2
公开(公告)日:2021-09-28
申请号:US16593247
申请日:2019-10-04
Applicant: NEC Laboratories America, Inc.
Inventor: Quoc-Huy Tran , Bingbing Zhuang , Pan Ji , Manmohan Chandraker
Abstract: A method for correcting rolling shutter (RS) effects is presented. The method includes generating a plurality of images from a camera, synthesizing RS images from global shutter (GS) counterparts to generate training data to train the structure-and-motion-aware convolutional neural network (CNN), and predicting an RS camera motion and an RS depth map from a single RS image by employing a structure-and-motion-aware CNN to remove RS distortions from the single RS image.
-
公开(公告)号:US20210276547A1
公开(公告)日:2021-09-09
申请号:US17187157
申请日:2021-02-26
Applicant: NEC Laboratories America, Inc.
Inventor: Sriram Nochur Narayanan , Buyu Liu , Ramin Moslemi , Francesco Pittaluga , Manmohan Chandraker
IPC: B60W30/095 , B60W60/00 , G06K9/62 , G06N3/08 , G06K9/00
Abstract: Methods and systems for training a trajectory prediction model and performing a vehicle maneuver include encoding a set of training data to generate encoded training vectors, where the training data includes trajectory information for agents over time. Trajectory scenarios are simulated based on the encoded training vectors, with each simulated trajectory scenario representing one or more agents with respective agent trajectories, to generate simulated training data. A predictive neural network model is trained using the simulated training data to generate predicted trajectory scenarios based on a detected scene.
-
公开(公告)号:US20210142043A1
公开(公告)日:2021-05-13
申请号:US17091011
申请日:2020-11-06
Applicant: NEC Laboratories America, Inc.
Inventor: Xiang Yu , Manmohan Chandraker , Kihyuk Sohn , Yichun Shi
Abstract: A computer-implemented method for implementing face recognition includes receiving training data including a plurality of augmented images each corresponding to a respective one of a plurality of input images augmented by one of a plurality of variations, splitting a feature embedding generated from the training data into a plurality of sub-embeddings each associated with one of the plurality of variations, associating each of the plurality of sub-embeddings with respective ones of a plurality of confidence values, and applying a plurality of losses including a confidence-aware identification loss and a variation-decorrelation loss to the plurality of sub-embeddings and the plurality of confidence values to improve face recognition performance by learning the plurality of sub-embeddings.
-
公开(公告)号:US10915792B2
公开(公告)日:2021-02-09
申请号:US16535681
申请日:2019-08-08
Applicant: NEC Laboratories America, Inc.
Inventor: Yi-Hsuan Tsai , Kihyuk Sohn , Buyu Liu , Manmohan Chandraker , Jong-Chyi Su
Abstract: Systems and methods for domain adaptation are provided. The system aligns image level features between a source domain and a target domain based on an adversarial learning process while training a domain discriminator. The system selects, using the domain discriminator, unlabeled samples from the target domain that are far away from existing annotated samples from the target domain. The system selects, based on a prediction score of each of the unlabeled samples, samples with lower prediction scores. The system annotates the samples with the lower prediction scores.
-
-
-
-
-
-
-
-
-